Shunyu Yao (@ShunyuYao12) Twitter Tweets • TwiCopy

1 month ago

More important and relevant in 2024

thumb_up_off_alt3

account_circle

You can now apply SWE-agent to any local repository and use any text file as the input issue instead of having to use GitHub repos/issues. Lots of people were asking for this! More information in the latest release notes: github.com/princeton-nlp/…

thumb_up_off_alt11

repeat5

account_circle

Shunyu Yao

1 month ago

Attention is all we need.
Memory is all we have.

thumb_up_off_alt29

repeat2

account_circle

Ofir Press

@OfirPress

1 month ago

It's been just 10 days since we launched SWE-agent but we already have 1.5k people in our Discord and lots of contributors on GitHub.

We've been making the agent easier to use and there are lots more exciting updates coming soon, including a web UI! Join us :)

account_circle

Tianbao Xie

@TianbaoX

1 month ago

🤔Can we assess agents across various apps & OS w.o. crafting new envs?

OSWorld🖥️: A unified, real computer env for multimodal agents to evaluate open-ended computer tasks with arbitrary apps and interfaces on Ubuntu, Windows, & macOS.

+ annotated 369 real-world computer tasks…

account_circle

Shunyu Yao

1 month ago

Guys, before achieving AGI, we need to solve zork-1?

thumb_up_off_alt18

repeat2

account_circle

Ruibo Liu

@RuiboLiu

1 month ago

Thanks Aran for sharing our work!

This is a survey paper I’ve been thinking about for a long time, as we have seen an increasing need for synthetic data. As we will probably run out of fresh tokens soon, the audience of this paper should be everyone who cares about AI progress.

account_circle

Shunyu Yao

1 month ago

Will visit AGI House for the first time this Saturday and talk about SWE-agent, Agent-Computer Interface (ACI), and answer questions😃

thumb_up_off_alt51

repeat7

account_circle

Shunyu Yao

1 month ago

When I first saw Tree of Thoughts I also asked myself this😀 great exploration into if next-token prediction can simulate search, and if you're interested in this you probably also wanna check out arxiv.org/abs/2309.02427 last paragraph

thumb_up_off_alt74

account_circle

Ofir Press

@OfirPress

1 month ago

You can now download & run SWE-agent (on any GitHub issue) in 1 line!

Check our repo for deets: github.com/princeton-nlp/…

Join our Discord to hear first about updates like this: discord.gg/AVEFbBn2rH

thumb_up_off_alt83

repeat7

account_circle

Shunyu Yao

1 month ago

in some sense, math is the first programming language, and mathematician's mind (+scratchpad) is the first compiler

thumb_up_off_alt45

repeat1

account_circle

Sarah Catanzaro

@sarahcat21

1 month ago

Purposeful pretraining and interface design may be the keys to unlocking reliable LM-based agents.

thumb_up_off_alt20

repeat2

account_circle

Shunyu Yao

1 month ago

deliver like cole

thumb_up_off_alt2

account_circle

Shunyu Yao

1 month ago

People still surprised by such things across pairs among ReAct ToT Reflexion CoALA WebShop SWE-bench SWE-agent😂

thumb_up_off_alt96

repeat3

account_circle

Shunyu Yao

1 month ago

Swe-agent achieves ~ Devin while only using < 2min and $4 per case

thumb_up_off_alt9

account_circle

Sanjeev Arora

@prfsanjeevarora

1 month ago

Fantastic work from our Princeton PLI team. Agent-based AI is clearly a big next step.

thumb_up_off_alt43

repeat6

account_circle

Karthik Narasimhan

@karthik_r_n

1 month ago

SWE-agent is finally out. A few highlights:
1. Agent-Computer Interface (ACI) design will be critical for the success of AI agents, much like HCI is critical for how effective humans are with computers.
2. You can use SWE-agent out of the box on any github issue.
(1/2)

account_circle

Shunyu Yao

1 month ago

What we call reasoning in AI is algorithm in CS

thumb_up_off_alt40

repeat3

account_circle

Shunyu Yao

1 month ago

A neat idea and setup

thumb_up_off_alt6