Andrew Ng (@AndrewYNg) Twitter Tweets • TwiCopy

Andrew Ng

@AndrewYNg

+ Follow

Co-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCs

ID:216939636

linkhttp://www.andrewng.org calendar_today18-11-2010 03:39:11

1,6K Tweets

1,0M Followers

909 Following

Andrew Ng

1 day ago

New short course with Mistral AI !

Mistral's open-source Mixtral 8x7B model uses a 'mixture of experts' (MoE) architecture. Unlike a standard transformer, an MoE model has multiple expert feed-forward networks (8 in this case), with a gating network selecting two experts at

thumb_up_off_alt1,3K

chat_bubble_outline0

account_circle

Andrew Ng

5 days ago

Meta released Llama 3 on my birthday! 🎂 Best present ever, thanks Meta! 😀

thumb_up_off_alt4,3K

chat_bubble_outline0

account_circle

Andrew Ng

5 days ago

Multi-agent collaboration has emerged as a key AI agentic design pattern. Given a complex task like writing software, a multi-agent approach would break down the task into subtasks to be executed by different roles -- such as a software engineer, product manager, designer, QA

thumb_up_off_alt2,3K

chat_bubble_outline0

account_circle

Andrew Ng

1 week ago

LLMs can take gigabytes of memory to store, which limits what can be run on consumer hardware. But quantization can dramatically compress models, making a wider selection of models available to developers. You can often reduce model size by 4x or more while maintaining reasonable

thumb_up_off_alt2,4K

chat_bubble_outline0

account_circle

Andrew Ng

1 week ago

Planning is a key agentic AI design pattern in which we use a large language model (LLM) to autonomously decide on what sequence of steps to execute to accomplish a larger task. For example, if we ask an agent to do online research on a given topic, we might use an LLM to break

thumb_up_off_alt2,4K

chat_bubble_outline0

account_circle

Andrew Ng

1 week ago

Data preprocessing is critical for building effective RAG systems. Our new short course, Preprocessing Unstructured Data for LLM Applications, taught by Matt Robinson of UnstructuredIO, demonstrates important but sometimes overlooked aspects of RAG systems:

- How to extract

thumb_up_off_alt1,2K

chat_bubble_outline0

account_circle

Andrew Ng

2 weeks ago

The Financial Times has a great article on Renate Nyborg Renate Nyborg's work on Meeno , written by Madhumita Murgia.

The article is paywalled, but I appreciate Renate (as well as Harvard's Ron Ivey)'s leadership speaking about the dangers of the AI fake girlfriend/boyfriend

thumb_up_off_alt298

chat_bubble_outline0

account_circle

Andrew Ng

2 weeks ago

The task-based analysis of how AI affects jobs is a powerful technique for creating business value. It was pioneered by Workhelix’s Erik Brynjolfsson et al. Now, Workhelix has developed technology to apply this at scale, by automatically examining a company’s job descriptions,

thumb_up_off_alt550

chat_bubble_outline0

account_circle

Andrew Ng

2 weeks ago

Tool use, in which an LLM is given functions it can request to call for gathering information, taking action, or manipulating data, is a key design pattern of AI agentic workflows. You may be familiar with LLM-based systems that can perform a web search or execute code. Some of

thumb_up_off_alt1,6K

chat_bubble_outline0

account_circle

Andrew Ng

2 weeks ago

Learn to carry out red teaming attacks against your own LLM-based applications to spot and patch vulnerabilities! In our new short course, Red Teaming LLM Applications, Matteo Dora & Luca Martial of LLM testing company Giskard teach how to simulate malicious actions to

thumb_up_off_alt766

chat_bubble_outline0

account_circle

Andrew Ng

3 weeks ago

I hope everyone in Taiwan 🇹🇼 is okay after the earthquake. My thoughts are with everyone affected. ❤️

thumb_up_off_alt2,0K

chat_bubble_outline0

account_circle

Andrew Ng

3 weeks ago

Last week, I described four design patterns for AI agentic workflows that I believe will drive significant progress this year: Reflection, Tool use, Planning and Multi-agent collaboration. Instead of having an LLM generate its final output directly, an agentic workflow prompts

thumb_up_off_alt2,8K

chat_bubble_outline0

account_circle

Andrew Ng

3 weeks ago

New JavaScript short course: Build a full-stack web application that uses RAG in JavaScript RAG Web Apps with LlamaIndex, taught by Laurie Voss, VP of Developer Relations at LlamaIndex 🦙 and npm co-founder.
- Build a RAG application for querying your own data
- Develop tools to

thumb_up_off_alt1,3K

chat_bubble_outline0

account_circle

Andrew Ng

1 month ago

I’ve been a fan of ⁦AutoGen⁩ as a multiagent programming framework for awhile. It was great hosting two of its leaders ⁦Chi Wang⁩ and ⁦Qingyun Wu⁩ to discuss agent design patterns!

I’ve been a fan of ⁦@pyautogen⁩ as a multiagent programming framework for awhile. It was great hosting two of its leaders ⁦@Chi_Wang_⁩ and ⁦@qingyun_wu⁩ to discuss agent design patterns!

thumb_up_off_alt723

chat_bubble_outline0

account_circle

Andrew Ng

1 month ago

I think AI agentic workflows will drive massive AI progress this year — perhaps even more than the next generation of foundation models. This is an important trend, and I urge everyone who works in AI to pay attention to it.

Today, we mostly use LLMs in zero-shot mode, prompting

I think AI agentic workflows will drive massive AI progress this year — perhaps even more than the next generation of foundation models. This is an important trend, and I urge everyone who works in AI to pay attention to it. Today, we mostly use LLMs in zero-shot mode, prompting

thumb_up_off_alt5,3K

chat_bubble_outline0

account_circle

Andrew Ng

1 month ago

Learn how to build an optimized LLM inference system from the ground up in our new short course, Efficiently Serving LLMs, built in collaboration with Predibase and taught by Travis Addair.

Whether you're serving your own LLM or using a model hosting service, this course will

thumb_up_off_alt804

chat_bubble_outline0

account_circle

Andrew Ng

1 month ago

Our new short course, Knowledge Graphs for RAG, is now available! Knowledge graphs are a data structure that is great at capturing complex relationships between data of multiple types. By enabling more sophisticated retrieval of text than similarity search alone, knowledge graphs

thumb_up_off_alt2,0K

chat_bubble_outline0

account_circle

Andrew Ng

1 month ago

When we get to AGI, it will have come slowly, not overnight.

A NeurIPS Outstanding Paper award recipient, Are Emergent Abilities of Large Language Models a Mirage? (by Rylan Schaeffer, Brando Miranda, Sanmi Koyejo) studies emergent properties of LLMs, and concludes:
'...

thumb_up_off_alt1,6K

chat_bubble_outline0

account_circle

younes

1 month ago

Very excited about this new collaboration with DeepLearning.AI team Andrew Ng, HF team Marc Sun Maria Khalusova !

You will learn how to get started with transformers library and HF ecosystem and learn how to build AI applications from scratch - enjoy!

thumb_up_off_alt158

chat_bubble_outline0

account_circle

Andrew Ng

1 month ago

New short course: Open Source Models with Hugging Face 🤗, taught by Maria Khalusova, Marc Sun, and Younes Belkada! Hugging Face has been a game changer by letting you quickly grab any of hundreds of thousands of already-trained open source models to assemble into new

thumb_up_off_alt1,2K

chat_bubble_outline0

account_circle

fpc ok :)