Abhishek Yadav(@abhishek__AI) 's Twitter Profileg
Abhishek Yadav

@abhishek__AI

Data analyst by day, AI explorer by night. Passionate about all things data and AI.
Let's learn & grow together!
📖,🚘,🎧,⚽,🏊❤️

ID:715774453423206401

linkhttp://Www.futureoflife.org calendar_today01-04-2016 05:35:03

9,3K Tweets

4,3K Followers

1,0K Following

Xenova(@xenovacom) 's Twitter Profile Photo

Introducing MusicGen Web: AI-powered music generation directly in your browser, built with 🤗 Transformers.js! 🎵

Everything runs 100% locally, meaning no calls to an API! 🤯 Served as a static website... this costs $0 to host and run! 🔥

Try it out yourself! 👇

account_circle
Wing Lian (caseus)(@winglian) 's Twitter Profile Photo

- 13B parameter BitNet + infini-Attention + DenseFormer + MoD + In Context-Pretraining + 2 stage pretraining
- upcycle w c-BTX to an 8 expert sparse MoE + MoA

I’m sure I’m missing about 20 other techniques to throw into a pretrained model/architecture 🤓

In all seriousness…

account_circle
John Yang(@jyangballin) 's Twitter Profile Photo

SWE-agent is our new system for autonomously solving issues in GitHub repos. It gets similar accuracy to Devin on SWE-bench, takes 93 seconds on avg + it's open source!

We designed a new agent-computer interface to make it easy for GPT-4 to edit+run code
github.com/princeton-nlp/…

SWE-agent is our new system for autonomously solving issues in GitHub repos. It gets similar accuracy to Devin on SWE-bench, takes 93 seconds on avg + it's open source! We designed a new agent-computer interface to make it easy for GPT-4 to edit+run code github.com/princeton-nlp/…
account_circle
Aleksa Gordić 🍿🤖(@gordic_aleksa) 's Twitter Profile Photo

If you're still struggling to understand how transformers work here are some amazing resources! (including mine! :))

First of all Grant Sanderson just released 2 videos covering in fair amount of depth word embeddings, transformers and its submodules like embedding mechanism,…

If you're still struggling to understand how transformers work here are some amazing resources! (including mine! :)) First of all @3blue1brown just released 2 videos covering in fair amount of depth word embeddings, transformers and its submodules like embedding mechanism,…
account_circle
AK(@_akhaliq) 's Twitter Profile Photo

OSWorld

Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Autonomous agents that accomplish complex computer tasks with minimal human interventions have the potential to transform human-computer interaction, significantly enhancing

account_circle
Stability AI(@StabilityAI) 's Twitter Profile Photo

🎵The Stable Audio 2.0 user guide is here 🎵

Here’s some tips and tricks to get the most out of the 2.0 model. You can access the full guide here: stableaudio.com/user-guide (1/5)

account_circle
Google DeepMind(@GoogleDeepMind) 's Twitter Profile Photo

Our players were able to walk, turn, kick and stand up faster than manually programmed skills on this type of robot. 🔁

They could also combine movements to score goals, anticipate ball movements and block opponent shots - thereby developing a basic understanding of a 1v1 game.

account_circle
LangChain(@LangChainAI) 's Twitter Profile Photo

🎙️📹Audio & Video Structured Extraction with Gemini♊️

Google's Gemini 1.5 Pro officially came out of preview yesterday, with support for audio and video inputs, and they work with function calling!

We just released a new short YouTube video outlining how to perform structured…

🎙️📹Audio & Video Structured Extraction with Gemini♊️ Google's Gemini 1.5 Pro officially came out of preview yesterday, with support for audio and video inputs, and they work with function calling! We just released a new short YouTube video outlining how to perform structured…
account_circle
Haystack(@Haystack_AI) 's Twitter Profile Photo

Cohere Rerank 3 + Haystack 2.0

cohere just dropped new great models for reranking 👇
· Context length of 4k for significant improvement on longer documents
· Multilingual coverage of 100+ languages
· Improved latency

You can use it right away in Haystack to improve your…

Cohere Rerank 3 + Haystack 2.0 @cohere just dropped new great models for reranking 👇 · Context length of 4k for significant improvement on longer documents · Multilingual coverage of 100+ languages · Improved latency You can use it right away in Haystack to improve your…
account_circle
Pratyush Maini(@pratyushmaini) 's Twitter Profile Photo

1/ 🥁Scaling Laws for Data Filtering 🥁

TLDR: Data Curation *cannot* be compute agnostic!
In our paper, we develop the first scaling laws for heterogeneous & limited web data.

w/Sachin Goyal Zachary Lipton Aditi Raghunathan Zico Kolter
📝:arxiv.org/abs/2404.07177

1/ 🥁Scaling Laws for Data Filtering 🥁 TLDR: Data Curation *cannot* be compute agnostic! In our #CVPR2024 paper, we develop the first scaling laws for heterogeneous & limited web data. w/@goyalsachin007 @zacharylipton @AdtRaghunathan @zicokolter 📝:arxiv.org/abs/2404.07177
account_circle
lmsys.org(@lmsysorg) 's Twitter Profile Photo

🔥Exciting news -- GPT-4-Turbo has just reclaimed the No. 1 spot on the Arena leaderboard again! Woah!

We collect over 8K user votes from diverse domains and observe its strong coding & reasoning capability over others. Hats off to OpenAI for this incredible launch!

To offer…

🔥Exciting news -- GPT-4-Turbo has just reclaimed the No. 1 spot on the Arena leaderboard again! Woah! We collect over 8K user votes from diverse domains and observe its strong coding & reasoning capability over others. Hats off to @OpenAI for this incredible launch! To offer…
account_circle
Aran Komatsuzaki(@arankomatsuzaki) 's Twitter Profile Photo

Google presents Best Practices and Lessons Learned on Synthetic Data for Language Models

Provides an overview of synthetic data research, discussing its applications, challenges, and future directions

arxiv.org/abs/2404.07503

Google presents Best Practices and Lessons Learned on Synthetic Data for Language Models Provides an overview of synthetic data research, discussing its applications, challenges, and future directions arxiv.org/abs/2404.07503
account_circle
Aran Komatsuzaki(@arankomatsuzaki) 's Twitter Profile Photo

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

The first-of-its-kind scalable, real computer environment for multimodal agents, supporting task setup, execution-based evaluation, and interactive learning across various operating…

account_circle
Nicolas Mejia Petit(@mejia_petit) 's Twitter Profile Photo

🚀 Introducing Mistral-22b-V.01 A breakthrough in AI! 🧠💡
- First-ever MOE to Dense model conversion🔥

This model is NOT an MOE
(It only has 22B params.)

huggingface.co/Vezora/Mistral…

account_circle
Google AI(@GoogleAI) 's Twitter Profile Photo

Being able to interpret an model’s hidden representations is key to understanding its behavior. Today we introduce Patchscopes, an approach that trains to provide natural language explanations of their own hidden representations. Learn more → goo.gle/4aS5epd

Being able to interpret an #ML model’s hidden representations is key to understanding its behavior. Today we introduce Patchscopes, an approach that trains #LLMs to provide natural language explanations of their own hidden representations. Learn more → goo.gle/4aS5epd
account_circle
Omar Sanseviero(@osanseviero) 's Twitter Profile Photo

Introducing: Zephyr 141B-A35B 🥁

🔥Mixtral-8x22B fine-tune
🤯 Using DORPO: new alignment algorithm (no SFT, open )
🚀 With 7k instances of (open) data

Very strong IFEval, BBH, AGIEval... Enjoy! 🤗

hf.co/HuggingFaceH4/…

account_circle
Carlos E. Perez(@IntuitMachine) 's Twitter Profile Photo

1/n Griffin: DeepMind's Soaring Leap Beyond the Transformer Paradigm

Imagine having a conversation with an AI assistant that can engage with you for hours on end, seamlessly incorporating context from your entire dialogue history. Or envision AI models that can analyze books,…

1/n Griffin: DeepMind's Soaring Leap Beyond the Transformer Paradigm Imagine having a conversation with an AI assistant that can engage with you for hours on end, seamlessly incorporating context from your entire dialogue history. Or envision AI models that can analyze books,…
account_circle
AI at Meta(@AIatMeta) 's Twitter Profile Photo

Today we’re releasing OpenEQA — the Open-Vocabulary Embodied Question Answering Benchmark. It measures an AI agent’s understanding of physical environments by probing it with open vocabulary questions like “Where did I leave my badge?”

More details ➡️ go.fb.me/7vq6hm…

account_circle
Google DeepMind(@GoogleDeepMind) 's Twitter Profile Photo

Soccer players have to master a range of dynamic skills, from turning and kicking to chasing a ball. How could robots do the same? ⚽

We trained our AI agents to demonstrate a range of agile behaviors using reinforcement learning.

Here’s how. 🧵 dpmd.ai/3vUlgjC

account_circle