Luca Soldaini ๐ŸŽ€(@soldni) 's Twitter Profileg
Luca Soldaini ๐ŸŽ€

@soldni

I like tokens! Lead for OLMo data team at @allen_ai (makin Dolma ๐Ÿ‡), open source science fan, @QueerInAI organizer ๐Ÿค–โ˜•๏ธ๐Ÿ•they/them

ID:1865461842

linkhttps://soldaini.net calendar_today15-09-2013 00:09:49

18,0K Tweets

6,1K Followers

1,0K Following

Pete(@epwalsh) 's Twitter Profile Photo

The full training loop metrics are now available on W&B ๐Ÿ‘‡

Stage 1 pretraining: wandb.ai/ai2-llm/OLMo-7โ€ฆ
Stage 2 annealing: wandb.ai/ai2-llm/OLMo-7โ€ฆ

account_circle
Alexander Doria(@Dorialexander) 's Twitter Profile Photo

Big announcement: pleias releases a massive open corpus of 2 million Youtube videos in Creative Commons (CC-By) on Hugging Face. Youtube-Commons features 30 billion words of audio transcriptions in multiple languages, and soon other modalities huggingface.co/datasets/PleIAโ€ฆ

Big announcement: @pleiasfr releases a massive open corpus of 2 million Youtube videos in Creative Commons (CC-By) on @huggingface. Youtube-Commons features 30 billion words of audio transcriptions in multiple languages, and soon other modalities huggingface.co/datasets/PleIAโ€ฆ
account_circle
Kyle Lo(@kylelostat) 's Twitter Profile Photo

notable stuff:
๐Ÿฆ‰ton of perf boost from mixing instruct data at end (e.g., flan)
๐Ÿ‹anneal learning rate (Fig 9b in arxiv.org/abs/2403.08763)
๐Ÿžchanging data mix boosts MMLU at some cost to other evals

๐Ÿ‡huggingface.co/allenai/dolma
๐Ÿง€huggingface.co/allenai/OLMo-1โ€ฆ

account_circle
Cody Blakeney(@code_star) 's Twitter Profile Photo

Amazing work as always by Luca Soldaini ๐ŸŽ€ and and the folks at AI2. Major improvements on MMLU by tweaking the recipe on data cleaning and mixing. Read the blog and learn from the best. Open science is so important!

account_circle
Nathan Lambert(@natolambert) 's Twitter Profile Photo

Our new position paper showing how social choice theory can give us tools to answer many vexing questions around the preference collection and learning in rlhf!

account_circle