Kyle Mahowald (@kmahowald) Twitter Tweets • TwiCopy

Can LLM comprehensively capture information spread across multiple documents?
Can LLM distinguish confusing entity mentions?

Please check out our preprint on multi-document reasoning for LLM, focusing on entity disambiguation!

thumb_up_off_alt63

chat_bubble_outline0

repeat8

shareShare

account_circle

Jessy Li

@jessyjli

3 weeks ago

Congrats to Venkat for successfully defending his thesis today! David Beaver and I are so proud of his excellent work.
Venkat will join Ithaca College as a faculty member this fall 🎉
Huge thanks to friends, collaborators & committee members Kyle Mahowald and Malihe Alikhani

Congrats to @_venkatasg for successfully defending his thesis today! @David_Beaver and I are so proud of his excellent work. Venkat will join @IthacaCollege as a faculty member this fall 🎉 Huge thanks to friends, collaborators & committee members @kmahowald and @malihealikhani

thumb_up_off_alt81

chat_bubble_outline0

repeat6

shareShare

account_circle

Tiago Pimentel

@tpimentelms

3 weeks ago

Have you ever wondered about what's the impact of having so many near duplicate subwords in your model's vocabulary (e.g., _book vs book vs books vs Book 📚) on its performance?! If yes, checkout Anton Schäfer's new paper on it! :)

thumb_up_off_alt35

chat_bubble_outline0

repeat2

shareShare

account_circle

Ted Gibson, Language Lab MIT

@LanguageMIT

3 weeks ago

We are gruntled editors of Open Mind, a completely free open access cognitive science journal published by MIT Press: send us your papers!!

account_circle

Ethan Gotlieb Wilcox

@weGotlieb

3 weeks ago

BabyLM is back for round two! This time we have: 1. Multimodal track 2. Bring your own data and 3. Paper track! See the CfP for all the juicy details!

thumb_up_off_alt33

chat_bubble_outline0

repeat4

shareShare

account_circle

Kyle Mahowald

@kmahowald

1 month ago

Super exciting to see resources like this being built for studying constructions more systematically in computational linguistics!

thumb_up_off_alt21

chat_bubble_outline0

repeat3

shareShare

account_circle

Communications of the ACM

@CACMmag

1 month ago

'Conversations with AI,' by Bennie Mols, looks at the promises and pitfalls of using #LLMs as tools in the study of #languages . bit.ly/3Vy2zgh

thumb_up_off_alt1

chat_bubble_outline0

repeat2

shareShare

account_circle

Dan Roberts

@danintheory

1 month ago

Do LLMs really need to be so L?

That's a rejected title for a new paper w/ Andrey Gromov, Kushal Tirumala, Hassan Shapourian, Paolo Glorioso on pruning open-weight LLMs: we can remove up to *half* the layers of Llama-2 70B w/ essentially no impact on performance on QA benchmarks.

1/

Do LLMs really need to be so L? That's a rejected title for a new paper w/ @Andr3yGR, @kushal_tirumala, @Hasan_Shap, @PaoloGlorioso1 on pruning open-weight LLMs: we can remove up to *half* the layers of Llama-2 70B w/ essentially no impact on performance on QA benchmarks. 1/

account_circle

Kyle Mahowald

@kmahowald

1 month ago

Very cool survey on mental experience. A real surprise to me that some see a new name visually written in their mind when they first hear it…even more surprised it turns out you can live for years with someone who does that and have no idea that’s what’s been going on in there!

thumb_up_off_alt10

chat_bubble_outline0

repeat0

shareShare

account_circle

Kanishka Misra 😶‍🌫️

@kanishkamisra

1 month ago

☀️minicons 🌖 now supports sequence scoring with Vision-Language Models!!

Looking forward to see how ppl use it (if at all!) -- feedback always welcome!

🐧🐦

thumb_up_off_alt25

chat_bubble_outline0

repeat2

shareShare

account_circle

Christopher Potts

@ChrisGPotts

1 month ago

I know I am late in the project cycle for this, but I do have suggested edits for the team behind nature.com/articles/d4158… My overall comment is that the central claims in the original are lacking in empirical support.

account_circle

Trends in Cognitive Sciences

@TrendsCognSci

1 month ago

Dissociating language and thought in large language models

Feature Review by Kyle Mahowald (Kyle Mahowald), Anna Ivanova (@neuranna), Idan Blank (@IbanDlank), Nancy Kanwisher (@Nancy_Kanwisher), Joshua Tenenbaum, & Evelina Fedorenko (@ev_fedorenko)

doi.org/10.1016/j.tics…