Abhishek Yadav (@abhishek__AI) Twitter Tweets • TwiCopy

Abhishek Yadav

@abhishek__AI

+ Follow

Data analyst by day, AI explorer by night. Passionate about all things data and AI.
Let's learn & grow together!
📖,🚘,🎧,⚽,🏊❤️

ID:715774453423206401

linkhttp://Www.futureoflife.org calendar_today01-04-2016 05:35:03

9,3K Tweets

4,3K Followers

1,0K Following

Xenova

3 weeks ago

Introducing MusicGen Web: AI-powered music generation directly in your browser, built with 🤗 Transformers.js! 🎵

Everything runs 100% locally, meaning no calls to an API! 🤯 Served as a static website... this costs $0 to host and run! 🔥

Try it out yourself! 👇

thumb_up_off_alt239

chat_bubble_outline0

account_circle

Wing Lian (caseus)

3 weeks ago

- 13B parameter BitNet + infini-Attention + DenseFormer + MoD + In Context-Pretraining + 2 stage pretraining
- upcycle w c-BTX to an 8 expert sparse MoE + MoA

I’m sure I’m missing about 20 other techniques to throw into a pretrained model/architecture 🤓

In all seriousness…

thumb_up_off_alt156

chat_bubble_outline0

account_circle

John Yang

1 month ago

SWE-agent is our new system for autonomously solving issues in GitHub repos. It gets similar accuracy to Devin on SWE-bench, takes 93 seconds on avg + it's open source!

We designed a new agent-computer interface to make it easy for GPT-4 to edit+run code
github.com/princeton-nlp/…

SWE-agent is our new system for autonomously solving issues in GitHub repos. It gets similar accuracy to Devin on SWE-bench, takes 93 seconds on avg + it's open source! We designed a new agent-computer interface to make it easy for GPT-4 to edit+run code github.com/princeton-nlp/…

thumb_up_off_alt2,3K

chat_bubble_outline0

account_circle

Aleksa Gordić 🍿🤖

3 weeks ago

If you're still struggling to understand how transformers work here are some amazing resources! (including mine! :))

First of all Grant Sanderson just released 2 videos covering in fair amount of depth word embeddings, transformers and its submodules like embedding mechanism,…

If you're still struggling to understand how transformers work here are some amazing resources! (including mine! :)) First of all @3blue1brown just released 2 videos covering in fair amount of depth word embeddings, transformers and its submodules like embedding mechanism,…

thumb_up_off_alt133

chat_bubble_outline0

account_circle

AK

3 weeks ago

OSWorld

Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

Autonomous agents that accomplish complex computer tasks with minimal human interventions have the potential to transform human-computer interaction, significantly enhancing

thumb_up_off_alt459

chat_bubble_outline0

account_circle

Stability AI

3 weeks ago

🎵The Stable Audio 2.0 user guide is here 🎵

Here’s some tips and tricks to get the most out of the 2.0 model. You can access the full guide here: stableaudio.com/user-guide (1/5)

thumb_up_off_alt154

chat_bubble_outline0

account_circle

Google DeepMind

@GoogleDeepMind

3 weeks ago

Our players were able to walk, turn, kick and stand up faster than manually programmed skills on this type of robot. 🔁

They could also combine movements to score goals, anticipate ball movements and block opponent shots - thereby developing a basic understanding of a 1v1 game.

thumb_up_off_alt89

chat_bubble_outline0

account_circle

LangChain

3 weeks ago

🎙️📹Audio & Video Structured Extraction with Gemini♊️

Google's Gemini 1.5 Pro officially came out of preview yesterday, with support for audio and video inputs, and they work with function calling!

We just released a new short YouTube video outlining how to perform structured…

🎙️📹Audio & Video Structured Extraction with Gemini♊️ Google's Gemini 1.5 Pro officially came out of preview yesterday, with support for audio and video inputs, and they work with function calling! We just released a new short YouTube video outlining how to perform structured…

thumb_up_off_alt251

chat_bubble_outline0

account_circle

Haystack

3 weeks ago

Cohere Rerank 3 + Haystack 2.0

cohere just dropped new great models for reranking 👇
· Context length of 4k for significant improvement on longer documents
· Multilingual coverage of 100+ languages
· Improved latency

You can use it right away in Haystack to improve your…

Cohere Rerank 3 + Haystack 2.0 @cohere just dropped new great models for reranking 👇 · Context length of 4k for significant improvement on longer documents · Multilingual coverage of 100+ languages · Improved latency You can use it right away in Haystack to improve your…

thumb_up_off_alt27

chat_bubble_outline0

account_circle

Pratyush Maini

3 weeks ago

1/ 🥁Scaling Laws for Data Filtering 🥁

TLDR: Data Curation *cannot* be compute agnostic!
In our #CVPR2024 paper, we develop the first scaling laws for heterogeneous & limited web data.

w/Sachin Goyal Zachary Lipton Aditi Raghunathan Zico Kolter
📝:arxiv.org/abs/2404.07177

1/ 🥁Scaling Laws for Data Filtering 🥁 TLDR: Data Curation *cannot* be compute agnostic! In our #CVPR2024 paper, we develop the first scaling laws for heterogeneous & limited web data. w/@goyalsachin007 @zacharylipton @AdtRaghunathan @zicokolter 📝:arxiv.org/abs/2404.07177

thumb_up_off_alt215

chat_bubble_outline0

account_circle

lmsys.org

3 weeks ago

🔥Exciting news -- GPT-4-Turbo has just reclaimed the No. 1 spot on the Arena leaderboard again! Woah!

We collect over 8K user votes from diverse domains and observe its strong coding & reasoning capability over others. Hats off to OpenAI for this incredible launch!

To offer…

🔥Exciting news -- GPT-4-Turbo has just reclaimed the No. 1 spot on the Arena leaderboard again! Woah! We collect over 8K user votes from diverse domains and observe its strong coding & reasoning capability over others. Hats off to @OpenAI for this incredible launch! To offer…

thumb_up_off_alt1,0K

chat_bubble_outline0

account_circle

Aran Komatsuzaki

@arankomatsuzaki

3 weeks ago

Google presents Best Practices and Lessons Learned on Synthetic Data for Language Models

Provides an overview of synthetic data research, discussing its applications, challenges, and future directions

arxiv.org/abs/2404.07503

Google presents Best Practices and Lessons Learned on Synthetic Data for Language Models Provides an overview of synthetic data research, discussing its applications, challenges, and future directions arxiv.org/abs/2404.07503

thumb_up_off_alt644

chat_bubble_outline0

account_circle

Aran Komatsuzaki

@arankomatsuzaki

3 weeks ago

OSWorld: Benchmarking Multimodal Agents for Open-Ended Tasks in Real Computer Environments

The first-of-its-kind scalable, real computer environment for multimodal agents, supporting task setup, execution-based evaluation, and interactive learning across various operating…

thumb_up_off_alt329

chat_bubble_outline0

account_circle

Nicolas Mejia Petit

3 weeks ago

🚀 Introducing Mistral-22b-V.01 A breakthrough in AI! 🧠💡
- First-ever MOE to Dense model conversion🔥 #Mistral22bV01

This model is NOT an MOE
(It only has 22B params.)

huggingface.co/Vezora/Mistral…

thumb_up_off_alt537

chat_bubble_outline0

account_circle

Google AI

3 weeks ago

Being able to interpret an #ML model’s hidden representations is key to understanding its behavior. Today we introduce Patchscopes, an approach that trains #LLMs to provide natural language explanations of their own hidden representations. Learn more → goo.gle/4aS5epd

Being able to interpret an #ML model’s hidden representations is key to understanding its behavior. Today we introduce Patchscopes, an approach that trains #LLMs to provide natural language explanations of their own hidden representations. Learn more → goo.gle/4aS5epd

thumb_up_off_alt1,0K

chat_bubble_outline0

account_circle

Omar Sanseviero

3 weeks ago

Introducing: Zephyr 141B-A35B 🥁

🔥Mixtral-8x22B fine-tune
🤯 Using DORPO: new alignment algorithm (no SFT, open )
🚀 With 7k instances of (open) data

Very strong IFEval, BBH, AGIEval... Enjoy! 🤗

hf.co/HuggingFaceH4/…

thumb_up_off_alt723

chat_bubble_outline0

account_circle

Carlos E. Perez

3 weeks ago

1/n Griffin: DeepMind's Soaring Leap Beyond the Transformer Paradigm

Imagine having a conversation with an AI assistant that can engage with you for hours on end, seamlessly incorporating context from your entire dialogue history. Or envision AI models that can analyze books,…

1/n Griffin: DeepMind's Soaring Leap Beyond the Transformer Paradigm Imagine having a conversation with an AI assistant that can engage with you for hours on end, seamlessly incorporating context from your entire dialogue history. Or envision AI models that can analyze books,…

thumb_up_off_alt240

chat_bubble_outline0

account_circle

AI at Meta

3 weeks ago

Today we’re releasing OpenEQA — the Open-Vocabulary Embodied Question Answering Benchmark. It measures an AI agent’s understanding of physical environments by probing it with open vocabulary questions like “Where did I leave my badge?”

More details ➡️ go.fb.me/7vq6hm…

thumb_up_off_alt1,1K

chat_bubble_outline0

account_circle

Shubham Saboo

@Saboo_Shubham_

3 weeks ago

Transformers (LLMs) clearly explained with visuals:

thumb_up_off_alt1,1K

chat_bubble_outline0

account_circle

Google DeepMind

@GoogleDeepMind

3 weeks ago

Soccer players have to master a range of dynamic skills, from turning and kicking to chasing a ball. How could robots do the same? ⚽

We trained our AI agents to demonstrate a range of agile behaviors using reinforcement learning.

Here’s how. 🧵 dpmd.ai/3vUlgjC

thumb_up_off_alt2,0K

chat_bubble_outline0

account_circle

fpc ok :)