Andreas Mueller (also at mastodon) (@amuellerml) Twitter Tweets • TwiCopy

Andreas Mueller (also at mastodon)

@amuellerml

+ Follow

Machine learner, Python geek and scikit-learn developer.
Principal Research SDE @AzureData @Microsoft

ID:471550563

linkhttp://amueller.github.io calendar_today23-01-2012 00:40:44

9,6K Tweets

49,4K Followers

1,0K Following

Andrej Karpathy

2 weeks ago

Congrats to AI at Meta on Llama 3 release!! 🎉
ai.meta.com/blog/meta-llam…
Notes:

Releasing 8B and 70B (both base and finetuned) models, strong-performing in their model class (but we'll see when the rankings come in @ lmsys.org :))
400B is still training, but already encroaching

thumb_up_off_alt7,9K

chat_bubble_outline0

account_circle

Yann LeCun

2 weeks ago

🥁 Llama3 is out 🥁
8B and 70B models available today.
8k context length.
Trained with 15 trillion tokens on a custom-built 24k GPU cluster.
Great performance on various benchmarks, with Llam3-8B doing better than Llama2-70B in some cases.
More versions are coming over the next

🥁 Llama3 is out 🥁 8B and 70B models available today. 8k context length. Trained with 15 trillion tokens on a custom-built 24k GPU cluster. Great performance on various benchmarks, with Llam3-8B doing better than Llama2-70B in some cases. More versions are coming over the next

thumb_up_off_alt7,4K

chat_bubble_outline0

account_circle

Ibis Project

2 months ago

We often get questions around why Voltron Data supports the Ibis project -- we've answered them here!

TL;DR: open standards are critical for the composable data ecosystem and tightly coupling Python dataframes to execution engines is bad for everyone

ibis-project.org/posts/why-voda…

thumb_up_off_alt27

chat_bubble_outline0

account_circle

Julien Le Dem

3 months ago

The rumors are true! I started a(nother) blog. sympathetic.ink

The first post is an adaption of my talk, recalling the pas 10+ years of building open source standards and the lessons learned along the way. sympathetic.ink/2024/01/24/Ten…

thumb_up_off_alt67

chat_bubble_outline0

account_circle

Nick Erickson

3 months ago

Kaggle's (@Kaggle) latest competition's top 11 highest scoring notebooks all use 🚀@AutoGluon AutoML🚀 to achieve their strong performance!

When I said that AutoGluon 1.0 was the largest jump in the state-of-the-art in 4 years, I meant it.

Competition: kaggle.com/competitions/p…

Kaggle's (@Kaggle) latest competition's top 11 highest scoring notebooks all use 🚀@AutoGluon AutoML🚀 to achieve their strong performance! When I said that AutoGluon 1.0 was the largest jump in the state-of-the-art in 4 years, I meant it. Competition: kaggle.com/competitions/p…

thumb_up_off_alt53

chat_bubble_outline0

account_circle

Andrew Lamb

@andrewlamb1111

3 months ago

I am going to speak about ApacheArrow , Apache Parquet and Apache Arrow DataFusion at the Data Council this March. Should be a good conference datacouncil.ai/talks24/buildi…

thumb_up_off_alt68

chat_bubble_outline0

account_circle

OtterTune

3 months ago

From the rise of vector databases to SQL:2023 to MariaDB troubles and the FAA outage, 2023 was an exciting year in database history. OtterTune CEO Andy Pavlo (@[email protected]) covers all that, plus database VC funding. ottertune.com/blog/databases…

thumb_up_off_alt58

chat_bubble_outline0

account_circle

hazyresearch

4 months ago

Thank you so much for the fun keynote, NeurIPS Conference

As in every year, our lab had a blast! We've enjoyed connecting with so many smart, enthusiastic people--and learning about your work. What an exciting time in AI!

Some asked for slides: cs.stanford.edu/~chrismre/pape… and video

Thank you so much for the fun keynote, @NeurIPSConf As in every year, our lab had a blast! We've enjoyed connecting with so many smart, enthusiastic people--and learning about your work. What an exciting time in AI! Some asked for slides: cs.stanford.edu/~chrismre/pape… and video

thumb_up_off_alt119

chat_bubble_outline0

account_circle

Daniel Mas Montserrat

4 months ago

What if you could train an MLP with milliseconds instead of hours and still obtain state-of-the-art accuracy?

We introduce HyperFast: a hypernetwork for instant classification of tabular data that matches the accuracy of XGBoost while being much faster!

openreview.net/pdf?id=VRBhaU8…

What if you could train an MLP with milliseconds instead of hours and still obtain state-of-the-art accuracy? We introduce HyperFast: a hypernetwork for instant classification of tabular data that matches the accuracy of XGBoost while being much faster! openreview.net/pdf?id=VRBhaU8…

thumb_up_off_alt73

chat_bubble_outline0

account_circle

Andreas Mueller (also at mastodon)

4 months ago

I'm excited to share our results on MotherNet, a new hyper-network architecture based on TabPFN that can learn an MLP in-context using a single forward pass. This substantially improves prediction times over predicting with TabPFN directly: arxiv.org/abs/2312.08598

thumb_up_off_alt90

chat_bubble_outline0

account_circle

Andreas Mueller (also at mastodon)

4 months ago

Really amazing and inspiring talk by Chris Re! Hope the recording will be available soon, I'll have to re-watch it a couple of times, I think. Also you should follow Dillon Niederhut PhD for amazing NeurIPS coverage.

thumb_up_off_alt11

chat_bubble_outline0

account_circle

Andreas Mueller (also at mastodon)

4 months ago

In other news, I think I'll mostly be leaving this platform for LinkedIn, which seems to have higher quality engagement these days. Though this is obviously the premier platform for trolling and subtweeting (sub-X-ing?)

thumb_up_off_alt1

chat_bubble_outline0

account_circle

Andreas Mueller (also at mastodon)

4 months ago

Christopher Re at his #NeurIPS2023 keynote 'I think Ilya deserves the Touring award. Maybe not Employee of the month, but definitely the touring award'. [Most of the keynote was deeply technical and quite inspiring.]

thumb_up_off_alt96

chat_bubble_outline0

account_circle

Shao-Hua Sun

4 months ago

Since everyone on my Twitter timeline is attending #NeurIPS2023 . I thought it would be helpful to share this Inigo Montoya conference networking tip for initiating conversations, which works great for me.

Since everyone on my Twitter timeline is attending #NeurIPS2023. I thought it would be helpful to share this Inigo Montoya conference networking tip for initiating conversations, which works great for me.

thumb_up_off_alt35

chat_bubble_outline0

account_circle

Dillon Niederhut PhD

@dillonniederhut

4 months ago

I'm worried that we're heading into an LLM replicability crisis. How many of the results that we've seen are due to very careful prompts? How many would be robust to a small change?

- Meredith Ringel Morris at #NeurIPS2023

thumb_up_off_alt10

chat_bubble_outline0

account_circle

Miro Dudik

4 months ago

🚨Deadline for ML/AI postdocs at Microsoft Research NYC is extended to December 15. If you are at #NeurIPS2023 and want to chat about these positions, please DM me.

thumb_up_off_alt49

chat_bubble_outline0

account_circle

Andreas Mueller (also at mastodon)

4 months ago

Amazing tutorial by Nihar Shah on the many issues in peer review, definitely check out slides and material! cs.cmu.edu/~nihars/tutori… my take away is that there's really no hope for the current model of peer review. Which was kind of my intuition before... #NeurIPS2023

thumb_up_off_alt10

chat_bubble_outline0

account_circle

Frank Hutter @ NeurIPS

4 months ago

I'm at #NeurIPS2023 with my amazing team, excited to be presenting 6 papers at the main track and 2 at workshops, as well as a keynote in the table representation learning workshop. Here's all the info in one tweet, ordered by day. Please come by and chat with us 🙂

Tuesday

thumb_up_off_alt49

chat_bubble_outline0

account_circle

Andreas Mueller (also at mastodon)

4 months ago

Just made it to NOLA, can't wait to catch up with everyone at #NeurIPS2023 !

thumb_up_off_alt5

chat_bubble_outline0

account_circle

Andrej Karpathy

5 months ago

You know how image generation went from blurry 32x32 texture patches to high-resolution images that are difficult to distinguish from real in roughly a snap of a finger? The same is now happening along the time axis (extending to video) and the repercussions boggle the mind just

thumb_up_off_alt11,5K

chat_bubble_outline0

account_circle