Ehsan Shareghi (@EhsanShareghi) Twitter Tweets • TwiCopy

Ehsan Shareghi

@EhsanShareghi

+ Follow

Teaching/working on #NLProc. Currently an assistant professor at Monash University and previously a postdoc in University of Cambridge. Opinions are my own.

ID:1365097706704703488

linkhttps://eehsan.github.io calendar_today26-02-2021 00:33:57

70 Tweets

186 Followers

161 Following

Yinhong Liu

3 weeks ago

🔥New paper!📜
Struggle to align LLM evaluators with human judgements?🤔
Introducing PairS🌟: By exploiting transitivity, we push the potential of pairwise preference in efficient ranking evaluations that has better alignment!🧑‍⚖️
📖arxiv.org/abs/2403.16950
💻github.com/cambridgeltl/p…

🔥New paper!📜 Struggle to align LLM evaluators with human judgements?🤔 Introducing PairS🌟: By exploiting transitivity, we push the potential of pairwise preference in efficient ranking evaluations that has better alignment!🧑‍⚖️ 📖arxiv.org/abs/2403.16950 💻github.com/cambridgeltl/p…

thumb_up_off_alt32

chat_bubble_outline0

account_circle

Ehsan Shareghi

3 months ago

Simple working idea: Taking a mixture of training data, train a task router that will guide each input to the right mode of solving. A single LoRA (not an MoE) instruction-tuned to make both Task Routing and Task Solving decisions. More: raven-lm.github.io #EACL2024 #NLProc

Simple working idea: Taking a mixture of training data, train a task router that will guide each input to the right mode of solving. A single LoRA (not an MoE) instruction-tuned to make both Task Routing and Task Solving decisions. More: raven-lm.github.io #EACL2024 #NLProc

thumb_up_off_alt7

chat_bubble_outline0

account_circle

Ehsan Shareghi

3 months ago

Now accepted to #EACL2024 . An updated version, code, and many new analyeses are available on tinyurl.com/RL4SEG

Now accepted to #EACL2024. An updated version, code, and many new analyeses are available on tinyurl.com/RL4SEG

thumb_up_off_alt8

chat_bubble_outline0

account_circle

Ehsan Shareghi

3 months ago

Expanding a Language Agent with uncertainty estimation mechanism not only improves the agent's performance🤯, but also reduces the number of calls it makes to external tools (far more economical🫰). Uncertainty-Aware Language Agent (UALA) paper & code to follow soon! #NLProc

Expanding a Language Agent with uncertainty estimation mechanism not only improves the agent's performance🤯, but also reduces the number of calls it makes to external tools (far more economical🫰). Uncertainty-Aware Language Agent (UALA) paper & code to follow soon! #NLProc

thumb_up_off_alt16

chat_bubble_outline0

account_circle

Shunyu Yao

6 months ago

🧠🦾ReAct -> 🔥FireAct

Most language agents prompt LMs
- ReAct, AutoGPT, ToT, Generative Agents, ...
- Which is expensive, slow, and non-robust😢

Most fine-tuned LMs not for agents...

FireAct asks: WHY NOT?

Paper, code, data, ckpts: fireact-agent.github.io

(1/5)

🧠🦾ReAct -> 🔥FireAct Most language agents prompt LMs - ReAct, AutoGPT, ToT, Generative Agents, ... - Which is expensive, slow, and non-robust😢 Most fine-tuned LMs not for agents... FireAct asks: WHY NOT? Paper, code, data, ckpts: fireact-agent.github.io (1/5)

thumb_up_off_alt487

chat_bubble_outline0

account_circle

Ehsan Shareghi

6 months ago

This is now accepted in #EMNLP2023 demonstration track.
koala-index.erc.monash.edu

thumb_up_off_alt5

chat_bubble_outline0

account_circle

Ehsan Shareghi

6 months ago

It is a waste of reviewers' efforts if an AC/SAC chooses to add a meta-review/decision inconsistent with the reviews/scores. This nullifies the rebuttal authors provide too. AC/SAC could allocate better reviewers at the start, or engage in the discussion. Not healthy! #emnlp2023

thumb_up_off_alt4

chat_bubble_outline0

account_circle

Ehsan Shareghi

7 months ago

We (Fangyu Liu Nigel Collier ) reported related observations earlier in CogSci23 (arxiv.org/pdf/2208.11981…) - LLMs (those we tested) do not have a reliable sense of directionality. This is quite problematic for asymmetric relations. Good to see other works in this space.

We (@hardy_qr @nigelhcollier ) reported related observations earlier in CogSci23 (arxiv.org/pdf/2208.11981…) - LLMs (those we tested) do not have a reliable sense of directionality. This is quite problematic for asymmetric relations. Good to see other works in this space.

thumb_up_off_alt5

chat_bubble_outline0

account_circle

Ehsan Shareghi

7 months ago

RL(HF) has great potentials. But it is quite a challenge to form the right reward or balance during optimisation. This is just a scratch on the surface (work in-progress), showing its potential in improving the quality of Structured Explanations.
Paper: arxiv.org/pdf/2309.08347…

RL(HF) has great potentials. But it is quite a challenge to form the right reward or balance during optimisation. This is just a scratch on the surface (work in-progress), showing its potential in improving the quality of Structured Explanations. Paper: arxiv.org/pdf/2309.08347…

thumb_up_off_alt6

chat_bubble_outline0

account_circle

Ehsan Shareghi

11 months ago

Speech encoders are foundation models too! We investigate (1) robustness in low-resource condition, (2) where they potentially capture content and prosodic information, and (3) their representational properties. #INTERSPEECH2023
Paper: arxiv.org/pdf/2305.17733…

Speech encoders are foundation models too! We investigate (1) robustness in low-resource condition, (2) where they potentially capture content and prosodic information, and (3) their representational properties. #INTERSPEECH2023 Paper: arxiv.org/pdf/2305.17733…

thumb_up_off_alt4

chat_bubble_outline0

account_circle

Christopher Manning

11 months ago

But most AI people work in the quiet middle: We see huge benefits from people using AI in healthcare, education, …, and we see serious AI risks & harms but believe we can minimize them with careful engineering & regulation, just as happened with electricity, cars, planes, ….

thumb_up_off_alt729

chat_bubble_outline0

account_circle

Ehsan Shareghi

11 months ago

I don't think anyone sane actually assumed small models trained on instruction data from an LLM would clone LLM's 'capabilities' too. It is just a rewiring step to make them behave similarly (i.e., to follow instructions). That is ground zero but unlocks many new possibilities.

thumb_up_off_alt4

chat_bubble_outline0

account_circle

Ehsan Shareghi

11 months ago

Translation of natural language into symbolic first-order logic is very foundational IMO. In this work we used the latest of the NLP/AI world (SFT+RLHF), to train a small LM which both corrects a GPT-3.5 and works as a standalone translation tool. #NLProc
arxiv.org/pdf/2305.15541…

Translation of natural language into symbolic first-order logic is very foundational IMO. In this work we used the latest of the NLP/AI world (SFT+RLHF), to train a small LM which both corrects a GPT-3.5 and works as a standalone translation tool. #NLProc arxiv.org/pdf/2305.15541…

thumb_up_off_alt7

chat_bubble_outline0

account_circle

Ehsan Shareghi

11 months ago

Needed a verifier to auto-correct the outputs from LLM? PiVe offers a simple solution to construct data and train such a verifier. Could be used standalone or iteratively with LLM to improve the output accuracy!
Paper: arxiv.org/pdf/2305.12392…
Code: github.com/Jiuzhouh/PiVe

Needed a verifier to auto-correct the outputs from LLM? PiVe offers a simple solution to construct data and train such a verifier. Could be used standalone or iteratively with LLM to improve the output accuracy! Paper: arxiv.org/pdf/2305.12392… Code: github.com/Jiuzhouh/PiVe

thumb_up_off_alt7

chat_bubble_outline0

account_circle

Ehsan Shareghi

1 year ago

This may not age well ... have been thinking about this for a while now ... I will learn from your thoughts/views on this #generativeAI

This may not age well ... have been thinking about this for a while now ... I will learn from your thoughts/views on this #generativeAI

thumb_up_off_alt9

chat_bubble_outline0

account_circle

Ehsan Shareghi

1 year ago

Categorical dive into commonsense about the physical world, quantifying human norms and comparing against pre-trained large language models. Associative learning from data is not sufficient for: Mereo-Topology, Affordances, Nonsymmetric Relations (Cause-effect, Troponym) #NLProc

thumb_up_off_alt4

chat_bubble_outline0

account_circle

Nigel Collier

1 year ago

Delighted to announce our paper 'On Reality and the Limits of Language Data' in collaboration with Ehsan Shareghi and Fangyu Liu at . We've spent the last 9 months reading and thinking about the limitations of pre-trained language m…lnkd.in/d6RSeVXN lnkd.in/dT-t3n22

thumb_up_off_alt29

chat_bubble_outline0

account_circle

fpc ok :)