Yifang Chen
@cloudwaysX
Ph.D. student @uwcse. Previously @usc undergrad. Online Learning, reinforcement learning, bandits, and active learning.
ID:854855741110276096
https://cloudwaysx.github.io/ 20-04-2017 00:34:25
86 Tweets
461 Followers
641 Following
Excite to announce that our work on posterior sampling for pure exploration linear bandits will be at #AISTATS2024 !
arxiv.org/pdf/2310.06069…
Our method extends top-two sampling for MAB and hopefully opens up new promising directions for exploration in RL.
New paper on label-efficient supervised finetuning of LLMs.
We address the expensive prompt annotation cost by humans/proprietary LLMs, saving as much as 50% on FLAN V2.
Paper: arxiv.org/abs/2401.06692
Work led by: Jifan Zhang Yifang Chen Gantavya Bhatt Arnav Das
1/
In NOLA for NeurIPS Conference! Present two papers and one workshop paper. Excited to chat about RL, control, and robotics. DM if you want to meet up : )
1. Optimal Exploration for Model-based RL in Nonlinear Systems (arxiv.org/abs/2306.09210). Thu 10:45-12:45pm #1507. TL;DR: Not all
I'm looking to recruit PhD students to join Georgia Tech to work with me on data aspects of core machine learning starting Fall 2024. If interested, please consider applying by the December 15 deadline to the ML PhD program or the CS PhD program (links on my webpage).
Excited to join Bourns College of Engineering at UC Riverside UCR ECE UC Riverside! I’m also actively hiring Ph.D. students for Fall 2024. Please see yinglunz.com for more details 😀
#NeurIPS2022 When can we find the Nash equilibrium and overcome the curse of multiagent in offline multi-agent RL? Come to our poster Hall J #539 today at 4pm and Hall J #740 on Thursday at 4pm for more details and Q&A!
Paper: arxiv.org/abs/2201.03522 & arxiv.org/abs/2206.00159