Bingbin Liu
@BingbinL
PhD student at CMU machine learning department.
ID:1039263012123758595
https://clarabing.github.io/ 10-09-2018 21:23:05
90 Tweets
701 Followers
212 Following
Dataset choice is crucial in today's ML training pipeline. We (Mengzhou Xia and I) introduce desiderata for 'good' data and explain how our recent algorithm, LESS, fits into the picture. Huge review of data selection algs for pre-training and fine-tuning!
cs.princeton.edu/~smalladi/blog…
Extremely honored and excited to be starting as a research fellow at Kempner Institute at Harvard University! Looking forward to becoming part of the community; please do reach out if you are around! 😊
Will deep learning improve with more data, a larger model, or training for longer?
'Any balanced combination of them' <– in our #NeurIPS2023 spotlight, we reveal this through the lens of gradient-based feature learning in the presence of computational-statistical gaps. 1/5
I’m at NeurIPS Conference!
Giving a talk on this work tomorrow at 4:15pm in Ballroom A-C level 2.
If you’re interested in RLHF or learning from human feedback more generally (in Minecraft ⛏️), stop by. Also come to our poster at 5pm (#1405, Hall B1+B2)!