Andreas Mueller (also at mastodon)(@amuellerml) 's Twitter Profileg
Andreas Mueller (also at mastodon)

@amuellerml

Machine learner, Python geek and scikit-learn developer.
Principal Research SDE @AzureData @Microsoft

ID:471550563

linkhttp://amueller.github.io calendar_today23-01-2012 00:40:44

9,6K Tweets

49,4K Followers

1,0K Following

Andrej Karpathy(@karpathy) 's Twitter Profile Photo

Congrats to AI at Meta on Llama 3 release!! 🎉
ai.meta.com/blog/meta-llam…
Notes:

Releasing 8B and 70B (both base and finetuned) models, strong-performing in their model class (but we'll see when the rankings come in @ lmsys.org :))
400B is still training, but already encroaching

account_circle
Yann LeCun(@ylecun) 's Twitter Profile Photo

🥁 Llama3 is out 🥁
8B and 70B models available today.
8k context length.
Trained with 15 trillion tokens on a custom-built 24k GPU cluster.
Great performance on various benchmarks, with Llam3-8B doing better than Llama2-70B in some cases.
More versions are coming over the next

🥁 Llama3 is out 🥁 8B and 70B models available today. 8k context length. Trained with 15 trillion tokens on a custom-built 24k GPU cluster. Great performance on various benchmarks, with Llam3-8B doing better than Llama2-70B in some cases. More versions are coming over the next
account_circle
Ibis Project(@IbisData) 's Twitter Profile Photo

We often get questions around why Voltron Data supports the Ibis project -- we've answered them here!

TL;DR: open standards are critical for the composable data ecosystem and tightly coupling Python dataframes to execution engines is bad for everyone

ibis-project.org/posts/why-voda…

account_circle
Julien Le Dem(@J_) 's Twitter Profile Photo

The rumors are true! I started a(nother) blog. sympathetic.ink

The first post is an adaption of my talk, recalling the pas 10+ years of building open source standards and the lessons learned along the way. sympathetic.ink/2024/01/24/Ten…

account_circle
Nick Erickson(@innixma) 's Twitter Profile Photo

Kaggle's (@Kaggle) latest competition's top 11 highest scoring notebooks all use 🚀@AutoGluon AutoML🚀 to achieve their strong performance!

When I said that AutoGluon 1.0 was the largest jump in the state-of-the-art in 4 years, I meant it.

Competition: kaggle.com/competitions/p…

Kaggle's (@Kaggle) latest competition's top 11 highest scoring notebooks all use 🚀@AutoGluon AutoML🚀 to achieve their strong performance! When I said that AutoGluon 1.0 was the largest jump in the state-of-the-art in 4 years, I meant it. Competition: kaggle.com/competitions/p…
account_circle
Andrew Lamb(@andrewlamb1111) 's Twitter Profile Photo

I am going to speak about ApacheArrow , Apache Parquet and Apache Arrow DataFusion at the Data Council this March. Should be a good conference datacouncil.ai/talks24/buildi…

account_circle
OtterTune(@OtterTuneAI) 's Twitter Profile Photo

From the rise of vector databases to SQL:2023 to MariaDB troubles and the FAA outage, 2023 was an exciting year in database history. OtterTune CEO Andy Pavlo (@[email protected]) covers all that, plus database VC funding. ottertune.com/blog/databases…

account_circle
hazyresearch(@HazyResearch) 's Twitter Profile Photo

Thank you so much for the fun keynote, NeurIPS Conference

As in every year, our lab had a blast! We've enjoyed connecting with so many smart, enthusiastic people--and learning about your work. What an exciting time in AI!

Some asked for slides: cs.stanford.edu/~chrismre/pape… and video

Thank you so much for the fun keynote, @NeurIPSConf As in every year, our lab had a blast! We've enjoyed connecting with so many smart, enthusiastic people--and learning about your work. What an exciting time in AI! Some asked for slides: cs.stanford.edu/~chrismre/pape… and video
account_circle
Daniel Mas Montserrat(@_danielmas) 's Twitter Profile Photo

What if you could train an MLP with milliseconds instead of hours and still obtain state-of-the-art accuracy?

We introduce HyperFast: a hypernetwork for instant classification of tabular data that matches the accuracy of XGBoost while being much faster!

openreview.net/pdf?id=VRBhaU8…

What if you could train an MLP with milliseconds instead of hours and still obtain state-of-the-art accuracy? We introduce HyperFast: a hypernetwork for instant classification of tabular data that matches the accuracy of XGBoost while being much faster! openreview.net/pdf?id=VRBhaU8…
account_circle
Andreas Mueller (also at mastodon)(@amuellerml) 's Twitter Profile Photo

I'm excited to share our results on MotherNet, a new hyper-network architecture based on TabPFN that can learn an MLP in-context using a single forward pass. This substantially improves prediction times over predicting with TabPFN directly: arxiv.org/abs/2312.08598

account_circle
Andreas Mueller (also at mastodon)(@amuellerml) 's Twitter Profile Photo

Really amazing and inspiring talk by Chris Re! Hope the recording will be available soon, I'll have to re-watch it a couple of times, I think. Also you should follow Dillon Niederhut PhD for amazing NeurIPS coverage.

account_circle
Andreas Mueller (also at mastodon)(@amuellerml) 's Twitter Profile Photo

In other news, I think I'll mostly be leaving this platform for LinkedIn, which seems to have higher quality engagement these days. Though this is obviously the premier platform for trolling and subtweeting (sub-X-ing?)

account_circle
Andreas Mueller (also at mastodon)(@amuellerml) 's Twitter Profile Photo

Christopher Re at his keynote 'I think Ilya deserves the Touring award. Maybe not Employee of the month, but definitely the touring award'. [Most of the keynote was deeply technical and quite inspiring.]

account_circle
Shao-Hua Sun(@shaohua0116) 's Twitter Profile Photo

Since everyone on my Twitter timeline is attending . I thought it would be helpful to share this Inigo Montoya conference networking tip for initiating conversations, which works great for me.

Since everyone on my Twitter timeline is attending #NeurIPS2023. I thought it would be helpful to share this Inigo Montoya conference networking tip for initiating conversations, which works great for me.
account_circle
Dillon Niederhut PhD(@dillonniederhut) 's Twitter Profile Photo

I'm worried that we're heading into an LLM replicability crisis. How many of the results that we've seen are due to very careful prompts? How many would be robust to a small change?

- Meredith Ringel Morris at

account_circle
Miro Dudik(@MiroDudik) 's Twitter Profile Photo

🚨Deadline for ML/AI postdocs at Microsoft Research NYC is extended to December 15. If you are at and want to chat about these positions, please DM me.

account_circle
Andreas Mueller (also at mastodon)(@amuellerml) 's Twitter Profile Photo

Amazing tutorial by Nihar Shah on the many issues in peer review, definitely check out slides and material! cs.cmu.edu/~nihars/tutori… my take away is that there's really no hope for the current model of peer review. Which was kind of my intuition before...

account_circle
Frank Hutter @ NeurIPS(@FrankRHutter) 's Twitter Profile Photo

I'm at with my amazing team, excited to be presenting 6 papers at the main track and 2 at workshops, as well as a keynote in the table representation learning workshop. Here's all the info in one tweet, ordered by day. Please come by and chat with us 🙂

Tuesday

account_circle
Andrej Karpathy(@karpathy) 's Twitter Profile Photo

You know how image generation went from blurry 32x32 texture patches to high-resolution images that are difficult to distinguish from real in roughly a snap of a finger? The same is now happening along the time axis (extending to video) and the repercussions boggle the mind just

account_circle