Sean Welleck(@wellecks) 's Twitter Profileg
Sean Welleck

@wellecks

Assistant Professor at CMU. Marathoner, @thesisreview.

ID:280403336

linkhttp://wellecks.com calendar_today11-04-2011 07:59:23

860 Tweets

2,9K Followers

222 Following

Zhiqing Sun(@EdwardSun0909) 's Twitter Profile Photo

Our research on easy-to-hard generalization will be supported by the OpenAI Superalignment Fast Grant. Congratulations to the team and stay tuned!๐Ÿ˜Ž

Our research on easy-to-hard generalization will be supported by the OpenAI Superalignment Fast Grant. Congratulations to the team and stay tuned!๐Ÿ˜Ž
account_circle
Pan Lu(@lupantech) 's Twitter Profile Photo

Excited to announce the AI for Math Workshop at ICML Conference! Join us for groundbreaking discussions on the intersection of AI and mathematics. ๐Ÿค–๐Ÿงฎ

๐Ÿ“… Workshop details: sites.google.com/view/ai4mathwoโ€ฆ

๐Ÿ“œ Submit your pioneering work: sites.google.com/view/ai4mathwoโ€ฆ

๐Ÿ† Take on ourโ€ฆ

Excited to announce the AI for Math Workshop at #ICML2024 @icmlconf! Join us for groundbreaking discussions on the intersection of AI and mathematics. ๐Ÿค–๐Ÿงฎ ๐Ÿ“… Workshop details: sites.google.com/view/ai4mathwoโ€ฆ ๐Ÿ“œ Submit your pioneering work: sites.google.com/view/ai4mathwoโ€ฆ ๐Ÿ† Take on ourโ€ฆ
account_circle
Krishna Pillutla(@KrishnaPillutla) 's Twitter Profile Photo

Calling motivated students interested in pursuing MS/PhD in ML/AI, specifically privacy & generative AI!

The research group I'm starting at IIT Madras has openings!

Apply by *Mar 31* directly to Dept. of Data Science & AI, IIT Madras or IIT Madras CSE Dept. at research.iitm.ac.in!

account_circle
Seungone Kim(@seungonekim) 's Twitter Profile Photo

๐Ÿ”ฅI will be joining Carnegie Mellon University Language Technologies Institute | @CarnegieMellon this upcoming Fall, working with Graham Neubig and Sean Welleck on evaluating LLMs & improving them with (human) feedback!

Can't wait to explore what lies ahead during my Ph.D. journeyโ˜บ๏ธ

account_circle
Zhiqing Sun(@EdwardSun0909) 's Twitter Profile Photo

As a side product, we developed `gpt-accelera`, which supports batched inference with torch.compile during RL, and 2-D parallelism (tp + fsdp) training that can scale to 34b models. We perform all the training in full fine-tuning with this codebase. (n/n)

github.com/Edward-Sun/gptโ€ฆ

account_circle
Sean Welleck(@wellecks) 's Twitter Profile Photo

It's often said that 'evaluation is easier than generation'...

We go one step further: strong evaluators enable generalizing to harder problems! New paper led by Zhiqing Sun and Longhui Yu

Using supervision only on easy problems, 52.5 on MATH with Llemma-34b + re-ranking

account_circle
Matthew Finlayson(@mattf1n) 's Twitter Profile Photo

Wanna know gpt-3.5-turbo's embed size? We find a way to extract info from LLM APIs and estimate gpt-3.5-turboโ€™s embed size to be 4096. With the same trick we also develop 25x faster logprob extraction, audits for LLM APIs, and more!
๐Ÿ“„ arxiv.org/abs/2403.09539
Hereโ€™s how 1/๐Ÿงต

Wanna know gpt-3.5-turbo's embed size? We find a way to extract info from LLM APIs and estimate gpt-3.5-turboโ€™s embed size to be 4096. With the same trick we also develop 25x faster logprob extraction, audits for LLM APIs, and more! ๐Ÿ“„ arxiv.org/abs/2403.09539 Hereโ€™s how 1/๐Ÿงต
account_circle
Akari Asai(@AkariAsai) 's Twitter Profile Photo

๐—›๐—ผ๐˜„ ๐—ฐ๐—ฎ๐—ป ๐˜„๐—ฒ ๐—ฏ๐˜‚๐—ถ๐—น๐—ฑ ๐—บ๐—ผ๐—ฟ๐—ฒ ๐—ฟ๐—ฒ๐—น๐—ถ๐—ฎ๐—ฏ๐—น๐—ฒ ๐—Ÿ๐— -๐—ฏ๐—ฎ๐˜€๐—ฒ๐—ฑ ๐˜€๐˜†๐˜€๐˜๐—ฒ๐—บ๐˜€? Our new position paper advocates for retrieval-augmented LMs (RALMs) as the next gen. of LMs, exploring the promises, limitations, and a roadmap for wider adoption.
arxiv.org/abs/2403.03187 ๐Ÿงต

๐—›๐—ผ๐˜„ ๐—ฐ๐—ฎ๐—ป ๐˜„๐—ฒ ๐—ฏ๐˜‚๐—ถ๐—น๐—ฑ ๐—บ๐—ผ๐—ฟ๐—ฒ ๐—ฟ๐—ฒ๐—น๐—ถ๐—ฎ๐—ฏ๐—น๐—ฒ ๐—Ÿ๐— -๐—ฏ๐—ฎ๐˜€๐—ฒ๐—ฑ ๐˜€๐˜†๐˜€๐˜๐—ฒ๐—บ๐˜€? Our new position paper advocates for retrieval-augmented LMs (RALMs) as the next gen. of LMs, exploring the promises, limitations, and a roadmap for wider adoption. arxiv.org/abs/2403.03187 ๐Ÿงต
account_circle
EleutherAI(@AiEleuther) 's Twitter Profile Photo

Another day, another math LM bootstrapping their data work off of the work done by OpenWebMath and Llemma teams. This makes the three in the past week!

Open data work is ๐Ÿ”ฅ Not only do people use your data, but high quality data work has enduring impact on data pipelines.

account_circle
Jerry Liu(@jerryjliu0) 's Twitter Profile Photo

Self-RAG in LlamaIndex ๐Ÿฆ™

Weโ€™re excited to feature Self-RAG, a special RAG technique where an LLM can do self-reflection for dynamic retrieval, critique, and generation (@AkariAsai et al.).

Itโ€™s implemented in LlamaIndex ๐Ÿฆ™ as a custom query engine withโ€ฆ

Self-RAG in @llama_index Weโ€™re excited to feature Self-RAG, a special RAG technique where an LLM can do self-reflection for dynamic retrieval, critique, and generation (@AkariAsai et al.). Itโ€™s implemented in @llama_index as a custom query engine withโ€ฆ
account_circle
Ximing Lu(@GXiming) 's Twitter Profile Photo

If you're interested in this direction, check out our paper Inference-Time Policy Adapters (IPA๐Ÿบ).

IPA guides a frozen LLM such as GPT-3 during decoding time through a lightweight policy adapter trained to optimize an arbitrary user objective with RL.

If you're interested in this direction, check out our paper Inference-Time Policy Adapters (IPA๐Ÿบ). IPA guides a frozen LLM such as GPT-3 during decoding time through a lightweight policy adapter trained to optimize an arbitrary user objective with RL.
account_circle
Sean Welleck(@wellecks) 's Twitter Profile Photo

Teaching a new course on Neural Code Generation with Daniel Fried!

cmu-codegen.github.io/s2024/

Here is the lecture on pretraining and scaling laws:
cmu-codegen.github.io/s2024/static_fโ€ฆ

Teaching a new course on Neural Code Generation with @dan_fried! cmu-codegen.github.io/s2024/ Here is the lecture on pretraining and scaling laws: cmu-codegen.github.io/s2024/static_fโ€ฆ
account_circle
Albert Jiang(@AlbertQJiang) 's Twitter Profile Photo

Wrote a summary of my thoughts on the plane back from NeurIPS: albertqjiang.github.io/thoughts/

That's the serious stuff. Will do a thread of silly things later.

account_circle
Akari Asai(@AkariAsai) 's Twitter Profile Photo

Had a great time at the NeurIPS instruction workshop and we are honored to get the Best paper Honorable Mention Award!

Check the details of the paper here:
selfrag.github.io

Had a great time at the NeurIPS instruction workshop and we are honored to get the Best paper Honorable Mention Award! Check the details of the paper here: selfrag.github.io
account_circle