EdinburghNLP (@EdinburghNLP) Twitter Tweets • TwiCopy

EdinburghNLP

@EdinburghNLP

+ Follow

The Natural Language Processing Group at the University of Edinburgh

ID:862649978170245120

linkhttp://edinburghnlp.inf.ed.ac.uk/ calendar_today11-05-2017 12:45:56

988 Tweets

11,3K Followers

140 Following

Giwon Hong

@GiwonHong413849

1 week ago

Introducing The Hallucinations Leaderboard! 🚀 An open effort to measure the LLMs’ tendency to generate hallucinations across various tasks, like open-domain QA, instruction following, and summarisation! 🧵1/N

📄Paper:arxiv.org/abs/2404.05904
🤗Leaderboard:huggingface.co/spaces/halluci…

Introducing The Hallucinations Leaderboard! 🚀 An open effort to measure the LLMs’ tendency to generate hallucinations across various tasks, like open-domain QA, instruction following, and summarisation! 🧵1/N 📄Paper:arxiv.org/abs/2404.05904 🤗Leaderboard:huggingface.co/spaces/halluci…

thumb_up_off_alt55

chat_bubble_outline0

account_circle

Pasquale Minervini 🚀 looking for postdocs!

2 weeks ago

Yi Tay Yu Zhao Yeah! However, other (open-source) pre-training code bases we analysed were doing plain causal masking, where the likelihood of each token was conditioned on all previous tokens in the pre-training chunk. This (sub-optimal, as we found in arxiv.org/abs/2402.13991) choice probably…

@YiTayML @yuzhaouoe Yeah! However, other (open-source) pre-training code bases we analysed were doing plain causal masking, where the likelihood of each token was conditioned on all previous tokens in the pre-training chunk. This (sub-optimal, as we found in arxiv.org/abs/2402.13991) choice probably…

thumb_up_off_alt17

chat_bubble_outline0

account_circle

Aryo Pradipta Gema

2 weeks ago

Fine-tune your LLM unless you have access to GPT-4!
In our SemEval 2024 NLI4CT solution, we evaluated LLMs with ICL, CoT, and a novel PEFT method. And yet, GPT-4 produces surprisingly good results, ranking joint-first in the leaderboard!
👉 arxiv.org/abs/2404.00484
🧵 1/9

Fine-tune your LLM unless you have access to GPT-4! In our SemEval 2024 @NLI4CT solution, we evaluated LLMs with ICL, CoT, and a novel PEFT method. And yet, GPT-4 produces surprisingly good results, ranking joint-first in the leaderboard! 👉 arxiv.org/abs/2404.00484 🧵 1/9

thumb_up_off_alt26

chat_bubble_outline0

account_circle

Pasquale Minervini 🚀 looking for postdocs!

2 weeks ago

Llama 3 was trained using intra-document causal masking, as suggested by Yu Zhao's paper 'Analysing The Impact of Sequence Composition on Language Model Pre-Training'! 🚀🚀🚀 arxiv.org/abs/2402.13991

Llama 3 was trained using intra-document causal masking, as suggested by @yuzhaouoe's paper 'Analysing The Impact of Sequence Composition on Language Model Pre-Training'! 🚀🚀🚀 arxiv.org/abs/2402.13991

thumb_up_off_alt171

chat_bubble_outline0

account_circle

Clémentine Fourrier 🍊 (is off atm!)

2 weeks ago

Congrats to Aaditya Ura ( looking for PhD ), Pasquale Minervini 🚀 looking for postdocs!, Aryo Pradipta Gema and andreas motzfeldt for this cool initiative!
Open Life Science AI

Leaderboard: huggingface.co/spaces/openlif…

thumb_up_off_alt18

chat_bubble_outline0

account_circle

Andrei Manolache

3 weeks ago

Emile van Krieken Chendi Qian Mathias Niepert Christopher Morris Zhe Zeng kareem ahmed Guy Van den Broeck Daniel Daza Thiviyan Thanapalasingam Peter Bloem (@[email protected]) Taraneh Younesian I also got reports that Adaptive IMLE from Pasquale Minervini 🚀 looking for postdocs!, L. Franceschi, and Mathias Niepert works very well for some graph-related sampling tasks, but I haven't tried it out myself yet
arxiv.org/abs/2209.04862

thumb_up_off_alt3

chat_bubble_outline0

account_circle

Tom Hosking

4 weeks ago

HRQ-VAE is now available in Pythae!

Hierarchical Residual Quantization learns a recursive clustering of a dense vector space, end-to-end. And it learns more meaningful clusters than doing quantization post-hoc.

📝 Paper: aclanthology.org/2022.acl-long.…
🤖 Code: github.com/clementchadebe…

thumb_up_off_alt44

chat_bubble_outline0

account_circle

Pasquale Minervini 🚀 looking for postdocs!

4 weeks ago

If you're working on LLM pre-training, you may want to take these findings into account! E.g., intra-document causal masking is a one-line drop-in replacement of 'classic' causal masking; it's fully supported by e.g., Flash Attention (github.com/Dao-AILab/flas…); and provides…

If you're working on LLM pre-training, you may want to take these findings into account! E.g., intra-document causal masking is a one-line drop-in replacement of 'classic' causal masking; it's fully supported by e.g., Flash Attention (github.com/Dao-AILab/flas…); and provides…

thumb_up_off_alt21

chat_bubble_outline0

account_circle

Rohit Saxena

4 weeks ago

🚨New paper🚨
Excited to share that 'Select and Summarize: Scene Saliency for Movie Script Summarization' has been accepted at the #NAACL2024 Findings. 🎬
w. Frank Keller
arxiv.org/abs/2404.03561…

thumb_up_off_alt28

chat_bubble_outline0

account_circle

Pasquale Minervini 🚀 looking for postdocs!

4 weeks ago

Excited to share our latest work on improving LLM pre-training! 🚀 The amazing Yu Zhao et al. found that focusing on how pre-training sequences are composed and attended over can significantly improve the generalisation properties of LLMs on a wide array of downstream tasks,…

Excited to share our latest work on improving LLM pre-training! 🚀 The amazing @yuzhaouoe et al. found that focusing on how pre-training sequences are composed and attended over can significantly improve the generalisation properties of LLMs on a wide array of downstream tasks,…

thumb_up_off_alt106

chat_bubble_outline0

account_circle

fly51fly

1 month ago

[CL] Learning to Plan and Generate Text with Citations
arxiv.org/abs/2404.03381
- Long-form generation in information-seeking scenarios has seen increasing demand for verifiable systems that generate responses with supporting evidence like citations. Most approaches rely on…

[CL] Learning to Plan and Generate Text with Citations arxiv.org/abs/2404.03381 - Long-form generation in information-seeking scenarios has seen increasing demand for verifiable systems that generate responses with supporting evidence like citations. Most approaches rely on…

thumb_up_off_alt40

chat_bubble_outline0

account_circle

Pasquale Minervini 🚀 looking for postdocs!

1 month ago

Thanks for coming over to the other side of town Alessandro Suglia!! 🚀🚀🚀🤗🤗🤗

thumb_up_off_alt15

chat_bubble_outline0

account_circle

Dongwei Jiang

@Dongwei__Jiang

1 month ago

📢 New paper at NAACL:

🤔 > LLMs still struggle with complex reasoning problems. Can we use reasoning nuggets from mathematical proofs to help with it?

Introducing LeanReasoner, a framework that uses Lean to solve complex natural language logical reasoning problems 🧵

📢 New paper at NAACL: 🤔 > LLMs still struggle with complex reasoning problems. Can we use reasoning nuggets from mathematical proofs to help with it? Introducing LeanReasoner, a framework that uses Lean to solve complex natural language logical reasoning problems 🧵

thumb_up_off_alt32

chat_bubble_outline0

account_circle

Antonio Valerio Miceli Barone

@AVMiceliBarone

1 month ago

New paper:

Scaling Behavior of Machine Translation with Large Language Models under Prompt Injection Attacks
arxiv.org/abs/2403.09832

thumb_up_off_alt11

chat_bubble_outline0

account_circle

Edoardo Ponti

1 month ago

If you are curious to discover more about Dynamic Memory Compression, I will give a preview during my keynote talk at the MOOMIN workshop eaclmeeting

See you on Thursday, March 21st at 9:30 AM!

moomin-workshop.github.io/program

thumb_up_off_alt28

chat_bubble_outline0

account_circle

Irina Saparina

1 month ago

Next week I’ll be in Malta 🇲🇹 to present our work on Improving Generalization in Semantic Parsing by Increasing Natural Language Variation at #EACL2024 !

1/3

Next week I’ll be in Malta 🇲🇹 to present our work on Improving Generalization in Semantic Parsing by Increasing Natural Language Variation at #EACL2024! 1/3

thumb_up_off_alt35

chat_bubble_outline0

account_circle

EdinburghNLP

1 month ago

Dynamic Memory Compression (DMC) retrofits LLMs like Llama 2 7/13/70B into models with the same downstream performance but with +370% throughput (tok/s) during generation.

thumb_up_off_alt8

chat_bubble_outline0

account_circle

Simone Scardapane

1 month ago

*Conditional computation in NNs: principles and research trends*
by Alessio Devoto Valerio Marsocci Jary Pomponi Pasquale Minervini 🚀 looking for postdocs!

Our latest tutorial on increasing modularity in NNs with conditional computation, covering MoEs, token selection, & early exits.

arxiv.org/abs/2403.07965

*Conditional computation in NNs: principles and research trends* by @devoto_alessio @valeriomarsocci @JaryPom @PMinervini Our latest tutorial on increasing modularity in NNs with conditional computation, covering MoEs, token selection, & early exits. arxiv.org/abs/2403.07965

thumb_up_off_alt67

chat_bubble_outline0

account_circle

Tom Sherborne

1 month ago

CR for TRAM is now live! See you at #ICLR2024 in Vienna (as a spotlight poster)

now feat.
* Vision exps (Better Imagenet→CIFAR/Cars/Flowers transfer)
* +ablations (XL model, weird combos)
* Pictures (see below!)
w/ Naomi Saphra Pradeep Dasigi Hao Peng

openreview.net/forum?id=kxebD…

CR for TRAM is now live! See you at #ICLR2024 in Vienna (as a spotlight poster) now feat. * Vision exps (Better Imagenet→CIFAR/Cars/Flowers transfer) * +ablations (XL model, weird combos) * Pictures (see below!) w/ @nsaphra @pdasigi @haopeng_nlp openreview.net/forum?id=kxebD…

thumb_up_off_alt23

chat_bubble_outline0

account_circle

Edoardo Ponti

1 month ago

Can open-source LLMs execute *chains of instructions* in a single query? Not so well, we found.

However, they can learn this ability by:
- augmenting examples from public SFT mixtures with chains of instructions automatically
- performing *sequential instruction tuning* on them.…

Can open-source LLMs execute *chains of instructions* in a single query? Not so well, we found. However, they can learn this ability by: - augmenting examples from public SFT mixtures with chains of instructions automatically - performing *sequential instruction tuning* on them.…

thumb_up_off_alt87

chat_bubble_outline0

account_circle

fpc ok :)