Andrew Ng(@AndrewYNg) 's Twitter Profileg
Andrew Ng

@AndrewYNg

Co-Founder of Coursera; Stanford CS adjunct faculty. Former head of Baidu AI Group/Google Brain. #ai #machinelearning, #deeplearning #MOOCs

ID:216939636

linkhttp://www.andrewng.org calendar_today18-11-2010 03:39:11

1,6K Tweets

1,0M Followers

909 Following

Andrew Ng(@AndrewYNg) 's Twitter Profile Photo

New short course with Mistral AI !

Mistral's open-source Mixtral 8x7B model uses a 'mixture of experts' (MoE) architecture. Unlike a standard transformer, an MoE model has multiple expert feed-forward networks (8 in this case), with a gating network selecting two experts at

account_circle
Andrew Ng(@AndrewYNg) 's Twitter Profile Photo

Multi-agent collaboration has emerged as a key AI agentic design pattern. Given a complex task like writing software, a multi-agent approach would break down the task into subtasks to be executed by different roles -- such as a software engineer, product manager, designer, QA

account_circle
Andrew Ng(@AndrewYNg) 's Twitter Profile Photo

LLMs can take gigabytes of memory to store, which limits what can be run on consumer hardware. But quantization can dramatically compress models, making a wider selection of models available to developers. You can often reduce model size by 4x or more while maintaining reasonable

account_circle
Andrew Ng(@AndrewYNg) 's Twitter Profile Photo

Planning is a key agentic AI design pattern in which we use a large language model (LLM) to autonomously decide on what sequence of steps to execute to accomplish a larger task. For example, if we ask an agent to do online research on a given topic, we might use an LLM to break

account_circle
Andrew Ng(@AndrewYNg) 's Twitter Profile Photo

Data preprocessing is critical for building effective RAG systems. Our new short course, Preprocessing Unstructured Data for LLM Applications, taught by Matt Robinson of UnstructuredIO, demonstrates important but sometimes overlooked aspects of RAG systems:

- How to extract

account_circle
Andrew Ng(@AndrewYNg) 's Twitter Profile Photo

The Financial Times has a great article on Renate Nyborg Renate Nyborg's work on Meeno , written by Madhumita Murgia.

The article is paywalled, but I appreciate Renate (as well as Harvard's Ron Ivey)'s leadership speaking about the dangers of the AI fake girlfriend/boyfriend

account_circle
Andrew Ng(@AndrewYNg) 's Twitter Profile Photo

The task-based analysis of how AI affects jobs is a powerful technique for creating business value. It was pioneered by Workhelix’s Erik Brynjolfsson et al. Now, Workhelix has developed technology to apply this at scale, by automatically examining a company’s job descriptions,

account_circle
Andrew Ng(@AndrewYNg) 's Twitter Profile Photo

Tool use, in which an LLM is given functions it can request to call for gathering information, taking action, or manipulating data, is a key design pattern of AI agentic workflows. You may be familiar with LLM-based systems that can perform a web search or execute code. Some of

account_circle
Andrew Ng(@AndrewYNg) 's Twitter Profile Photo

Learn to carry out red teaming attacks against your own LLM-based applications to spot and patch vulnerabilities! In our new short course, Red Teaming LLM Applications, Matteo Dora & Luca Martial of LLM testing company Giskard teach how to simulate malicious actions to

account_circle
Andrew Ng(@AndrewYNg) 's Twitter Profile Photo

I hope everyone in Taiwan 🇹🇼 is okay after the earthquake. My thoughts are with everyone affected. ❤️

account_circle
Andrew Ng(@AndrewYNg) 's Twitter Profile Photo

Last week, I described four design patterns for AI agentic workflows that I believe will drive significant progress this year: Reflection, Tool use, Planning and Multi-agent collaboration. Instead of having an LLM generate its final output directly, an agentic workflow prompts

account_circle
Andrew Ng(@AndrewYNg) 's Twitter Profile Photo

New JavaScript short course: Build a full-stack web application that uses RAG in JavaScript RAG Web Apps with LlamaIndex, taught by Laurie Voss, VP of Developer Relations at LlamaIndex 🦙 and npm co-founder.
- Build a RAG application for querying your own data
- Develop tools to

account_circle
Andrew Ng(@AndrewYNg) 's Twitter Profile Photo

I’ve been a fan of ⁦AutoGen⁩ as a multiagent programming framework for awhile. It was great hosting two of its leaders ⁦Chi Wang⁩ and ⁦Qingyun Wu⁩ to discuss agent design patterns!

I’ve been a fan of ⁦@pyautogen⁩ as a multiagent programming framework for awhile. It was great hosting two of its leaders ⁦@Chi_Wang_⁩ and ⁦@qingyun_wu⁩ to discuss agent design patterns!
account_circle
Andrew Ng(@AndrewYNg) 's Twitter Profile Photo

I think AI agentic workflows will drive massive AI progress this year — perhaps even more than the next generation of foundation models. This is an important trend, and I urge everyone who works in AI to pay attention to it.

Today, we mostly use LLMs in zero-shot mode, prompting

I think AI agentic workflows will drive massive AI progress this year — perhaps even more than the next generation of foundation models. This is an important trend, and I urge everyone who works in AI to pay attention to it. Today, we mostly use LLMs in zero-shot mode, prompting
account_circle
Andrew Ng(@AndrewYNg) 's Twitter Profile Photo

Learn how to build an optimized LLM inference system from the ground up in our new short course, Efficiently Serving LLMs, built in collaboration with Predibase and taught by Travis Addair.

Whether you're serving your own LLM or using a model hosting service, this course will

account_circle
Andrew Ng(@AndrewYNg) 's Twitter Profile Photo

Our new short course, Knowledge Graphs for RAG, is now available! Knowledge graphs are a data structure that is great at capturing complex relationships between data of multiple types. By enabling more sophisticated retrieval of text than similarity search alone, knowledge graphs

account_circle
Andrew Ng(@AndrewYNg) 's Twitter Profile Photo

When we get to AGI, it will have come slowly, not overnight.

A NeurIPS Outstanding Paper award recipient, Are Emergent Abilities of Large Language Models a Mirage? (by Rylan Schaeffer, Brando Miranda, Sanmi Koyejo) studies emergent properties of LLMs, and concludes:
'...

account_circle
younes(@younesbelkada) 's Twitter Profile Photo

Very excited about this new collaboration with DeepLearning.AI team Andrew Ng, HF team Marc Sun Maria Khalusova !

You will learn how to get started with transformers library and HF ecosystem and learn how to build AI applications from scratch - enjoy!

account_circle
Andrew Ng(@AndrewYNg) 's Twitter Profile Photo

New short course: Open Source Models with Hugging Face 🤗, taught by Maria Khalusova, Marc Sun, and Younes Belkada! Hugging Face has been a game changer by letting you quickly grab any of hundreds of thousands of already-trained open source models to assemble into new

account_circle