Twitter search results for "haopeng_nlp" • TwiCopy

Tom Sherborne

7 months ago

🚨 new paper 🚨

Can we train for flat minima with less catastrophic forgetting?

We propose Trust Region Aware Minimization for smoothness in parameter+representations. TL;DR representations matter as much as parameters!

arxiv.org/abs/2310.03646 w/Naomi Saphra Pradeep Dasigi Hao Peng

🚨 new paper 🚨

Can we train for flat minima with less catastrophic forgetting?

We propose Trust Region Aware Minimization for smoothness in parameter+representations. TL;DR representations matter as much as parameters!

arxiv.org/abs/2310.03646 w/@nsaphra @pdasigi @haopeng_nlp

thumb_up_off_alt93

chat_bubble_outline0

account_circle

Tom Sherborne

2 months ago

CR for TRAM is now live! See you at #ICLR2024 in Vienna (as a spotlight poster)

now feat.
* Vision exps (Better Imagenet→CIFAR/Cars/Flowers transfer)
* +ablations (XL model, weird combos)
* Pictures (see below!)
w/ Naomi Saphra Pradeep Dasigi Hao Peng

openreview.net/forum?id=kxebD…

CR for TRAM is now live! See you at #ICLR2024 in Vienna (as a spotlight poster)

now feat.
* Vision exps (Better Imagenet→CIFAR/Cars/Flowers transfer)
* +ablations (XL model, weird combos)
* Pictures (see below!)
w/ @nsaphra @pdasigi @haopeng_nlp

openreview.net/forum?id=kxebD…

thumb_up_off_alt23

chat_bubble_outline0

account_circle

Igor Kotenkov

@stalkermustang

3 weeks ago

Yao Fu Yizhong Wang Guangxuan Xiao Hao Peng Really important one. Hope to see a framework to detect & store only 10-15% of heads cache to support longer context (w/ sliding attention). Likely, there are other heads that we wanna store, I doubt there are more than 20% for most of the tasks.

@Francis_YAO_ @yizhongwyz @Guangxuan_Xiao @haopeng_nlp Really important one. Hope to see a framework to detect & store only 10-15% of heads cache to support longer context (w/ sliding attention). Likely, there are other heads that we wanna store, I doubt there are more than 20% for most of the tasks.

thumb_up_off_alt1

chat_bubble_outline0

account_circle

Tom Sherborne

4 months ago

TRAM is accepted to ICLR 2024 #ICLR2024 as a Spotlight! See you in Vienna 🇦🇹! Thanks to Naomi Saphra, Pradeep Dasigi, Hao Peng and Allen Institute for AI AllenNLP

Vision experiments, more discussion and visuals coming soon to the camera ready!

thumb_up_off_alt66

chat_bubble_outline0

account_circle

Tom Sherborne

2 weeks ago

I'll be at #ICLR2024 next week in Vienna presenting TRAM at a Spotlight Poster! Come find me at Halle B Thu 9 May 10:45AM-12:45PM CEST

Lets talk about SAM, OOD generalisation, PhDing at EdinburghNLP or working at cohere

w/ Naomi Saphra Pradeep Dasigi Hao Peng

thumb_up_off_alt24

chat_bubble_outline0

account_circle

TTIC

1 month ago

Friday, April 12, 2024 at 11:00 am CT: TTIC/UChicagoCS NLP Seminar presents Hao Peng (@haopeng_nlp) of Illinois Computer Science with a talk titled 'Pushing the Boundaries of Length Generalization and Reasoning Capabilities of Open LLMs.' Please join us in Room 529, 5th floor at TTIC.

Friday, April 12, 2024 at 11:00 am CT: TTIC/@UChicagoCS NLP Seminar presents Hao Peng (@haopeng_nlp) of @IllinoisCS with a talk titled 'Pushing the Boundaries of Length Generalization and Reasoning Capabilities of Open LLMs.' Please join us in Room 529, 5th floor at TTIC.

thumb_up_off_alt5

chat_bubble_outline0

account_circle

Haoyi Qiu

3 weeks ago

Kung-Hsiang Steeve Huang Heng Ji Hao Peng Han Zhao Shafiq Joty UIUC NLP Illinois Computer Science The Grainger College of Engineering 𝓒𝓸𝓷𝓰𝓻𝓪𝓽𝓼!! 𝓓𝓻. 𝓗𝓾𝓪𝓷𝓰 🥰

thumb_up_off_alt4

chat_bubble_outline0

account_circle

MichiganAI

1 month ago

🎙️ Speaker Announcement🎙️
We're pleased to announce the keynote speakers to the 17th Midwest Speech & Language Days Symposium #MSLD2024 , happening University of Michigan, April 15-16:

🌟Eric Fosler-Lussier Eric Fosler-Lussier
🌟Hao Peng Hao Peng
🌟Betsy Sneller Betsy Sneller BLACK LIVES MATTER
🌟Emma Strubell Emma Strubell

🎙️ Speaker Announcement🎙️
We're pleased to announce the keynote speakers to the 17th Midwest Speech & Language Days Symposium #MSLD2024, happening @UMich, April 15-16:

🌟Eric Fosler-Lussier @EricFos
🌟Hao Peng @haopeng_nlp
🌟Betsy Sneller @betsysneller
🌟Emma Strubell @strubell

thumb_up_off_alt12

chat_bubble_outline0

account_circle

Eli Chien

3 weeks ago

Kung-Hsiang Steeve Huang Heng Ji Hao Peng Han Zhao Shafiq Joty UIUC NLP Illinois Computer Science The Grainger College of Engineering Congrats!!!

thumb_up_off_alt1

chat_bubble_outline0

account_circle

Ahmed Masry

3 weeks ago

Kung-Hsiang Steeve Huang Heng Ji Hao Peng Han Zhao Shafiq Joty UIUC NLP Illinois Computer Science The Grainger College of Engineering Congrats!

thumb_up_off_alt1

chat_bubble_outline0

account_circle

Heng Ji

7 months ago

Can we let LLM simulate human tutor to guide reasoning and problem solving?My amazing PhD student Xingyao Wang Xingyao Wang has just finished another line of innovative work, based on collaborations with my new wonderful colleague Hao Peng Hao Peng

arxiv.org/abs/2309.10691

thumb_up_off_alt59

chat_bubble_outline0

account_circle

Genglin Liu

2 months ago

Many thanks to my wonderful collaborators/advisor Xingyao Wang , @lifan__yuan , Yangyi Chen and Hao Peng !

Check out our paper for more details: arxiv.org/abs/2311.09731

thumb_up_off_alt5

chat_bubble_outline0

account_circle

jAXt

1 year ago

Yao Fu Hao Peng tusharkhot Allen Institute for AI ICLR 2024 Save to Notion  #thread #llm

thumb_up_off_alt0

chat_bubble_outline0

account_circle

El Cartel | Steward $MOJO

9 months ago

Cissoko420 🦈 Majuu Alone🇰🇪 Hao Peng

thumb_up_off_alt0

chat_bubble_outline0

account_circle

Yulong Chen

@Yulongchen1010

3 weeks ago

Kung-Hsiang Steeve Huang Heng Ji Hao Peng Han Zhao Shafiq Joty UIUC NLP Illinois Computer Science The Grainger College of Engineering Congrats!

thumb_up_off_alt2

chat_bubble_outline0

account_circle

Kung-Hsiang Steeve Huang

3 weeks ago

Naihao(Neo) Deng Heng Ji Hao Peng Han Zhao Shafiq Joty UIUC NLP Illinois Computer Science The Grainger College of Engineering Thanks, Naihao!

thumb_up_off_alt0

chat_bubble_outline0

account_circle

Kung-Hsiang Steeve Huang

3 weeks ago

Xiangci Li Heng Ji Hao Peng Han Zhao Shafiq Joty UIUC NLP Illinois Computer Science The Grainger College of Engineering Thanks, Xiangci!

thumb_up_off_alt0

chat_bubble_outline0

account_circle

Mingyang Zhou

@MingyangKevinZh

2 weeks ago

Kung-Hsiang Steeve Huang Heng Ji Hao Peng Han Zhao Shafiq Joty UIUC NLP Illinois Computer Science The Grainger College of Engineering Shout out to Dr Huang! What a journey and good luck in your new chapter of life!

thumb_up_off_alt2

chat_bubble_outline0

account_circle

HamiltonHuaji（生活西化, 恐怖分子）

3 weeks ago

Yao Fu Yizhong Wang Guangxuan Xiao Hao Peng If you freeze everything but retrieval heads during continual pretraining, can you still get the same perfect retrieval accuracy as full parameter training?

thumb_up_off_alt0

chat_bubble_outline0

account_circle