Sandro Pezzelle(@sandropezzelle) 's Twitter Profileg
Sandro Pezzelle

@sandropezzelle

Assistant Professor at the University of Amsterdam. #NLProc #AI #CogSci #interpretability

ID:1009417448

linkhttps://sandropezzelle.github.io calendar_today13-12-2012 18:18:00

374 Tweets

839 Followers

673 Following

Jacob Andreas(@jacobandreas) 's Twitter Profile Photo

Some very cool recent work from Chengxu Zhuang on visually grounded language learning.

Surprisingly, standard text/image repr learning gives great image reprs but doesn't change behavior much on language tasks---how can we use image data for better learning of *language*?

account_circle
babyLM(@babyLMchallenge) 's Twitter Profile Photo

👶 BabyLM Challenge is back!
Can you improve pretraining with a small data budget?

BabyLMs for better LLMs
& for understanding how humans learn from 100M words

New:
How vision affects learning
Bring your own data
Paper track

babylm.github.io
🧵

account_circle
Ethan Gotlieb Wilcox(@weGotlieb) 's Twitter Profile Photo

BabyLM is back for round two! This time we have: 1. Multimodal track 2. Bring your own data and 3. Paper track! See the CfP for all the juicy details!

account_circle
Sandro Pezzelle(@sandropezzelle) 's Twitter Profile Photo

What happy faces! 😊 It was a wonderful adventure organizing this workshop together with Raquel Fernández! Thanks again to Andre Martins Ivan Titov Iryna Gurevych and ELLIS for proposing and supporting it at all stages! Thanks again to all participants 🔥

account_circle
Yoav Artzi(@yoavartzi) 's Twitter Profile Photo

Folks, some Conference on Language Modeling stats, because looking at these really brightens the mood :)
We received a total of ⭐️1036⭐️ submissions (for the first ever COLM!!!!). What is even more exciting is the nice distribution of topics and keywords. Exciting times ahead! ❤️

Folks, some @COLM_conf stats, because looking at these really brightens the mood :) We received a total of ⭐️1036⭐️ submissions (for the first ever COLM!!!!). What is even more exciting is the nice distribution of topics and keywords. Exciting times ahead! ❤️
account_circle
Sandro Pezzelle(@sandropezzelle) 's Twitter Profile Photo

Thanks Andre Ivan Titov Iryna Gurevych and ELLIS for proposing and supporting the organization of this workshop! It was a pleasure to make it happen together with Raquel Fernández and all participants 🔥

account_circle
Michael Hanna(@michaelwhanna) 's Twitter Profile Photo

Circuits are a hot topic in interpretability, but how do you find a circuit and guarantee it reflects how your model works?

We (Sandro Pezzelle, Yonatan Belinkov, and I) introduce a new circuit-finding method, EAP-IG, and show it finds more faithful circuits arxiv.org/abs/2403.17806 1/8

account_circle
noahdgoodman(@noahdgoodman) 's Twitter Profile Photo

New work where language models learn to ask questions? So they can better understand user needs? With an amazing method name? Oh, yes!

account_circle
Sonia Joseph(@soniajoseph_) 's Twitter Profile Photo

I'm excited to release Prisma, a mechanistic interpretability library for multimodal models like CLIP and ViTs. Incubated at Blake Richards's lab & in collab with Neel Nanda.

Recent mech interp work has focused on language, but many techniques transfer. Behold, the dogit lens:

I'm excited to release Prisma, a mechanistic interpretability library for multimodal models like CLIP and ViTs. Incubated at @tyrell_turing's lab & in collab with @NeelNanda5. Recent mech interp work has focused on language, but many techniques transfer. Behold, the dogit lens:
account_circle
Sandro Pezzelle(@sandropezzelle) 's Twitter Profile Photo

was an absolute blast! 🔥 Our own contributions: a great talk (Alberto Testoni) and a must-see poster (@ecekt2) at the main conference + a timely tutorial (@michaelwhanna) and an workshop on Thursday! ☀️ Well done!

account_circle
Wilker Aziz(@wilkeraziz) 's Twitter Profile Photo

_Unsure_ about what workshop to attend? Join us this Friday at the workshop

Location: Bastion 2 room of the Corinthia

uncertainlp.github.io/program

account_circle
Elias Stengel-Eskin(@EliasEskin) 's Twitter Profile Photo

Thrilled to share our work as a keynote at the UncertainNLP workshop!

I'll be covering the following work (ranging from semantic parsing to VQA and multi-LLM discussion/collaboration) under the umbrella of using uncertainty to rephrase/refine inputs & select outputs

Short 🧵

account_circle