Seth Lazar(@sethlazar) 's Twitter Profileg
Seth Lazar

@sethlazar

ANU Philosophy Prof working on normative philosophy of computing. This place is bad. Find my work at linktree below

ID:351808995

linkhttp://linktr.ee/sethlazar calendar_today09-08-2011 19:21:17

1,6K Tweets

6,7K Followers

1,9K Following

Follow People
Max Lamparth(@MLamparth) 's Twitter Profile Photo

Our new op-ed in Foreign Affairs with Jacquelyn Schneider argues against using language models for military strategy & decisions. We base our discussion on our research and how we create and train these models. Inherent limitations make them unreliable for high-stake applications.

account_circle
David Hering(@hering_david) 's Twitter Profile Photo

Increasingly convinced there was a prelapsarian Tower of Babel moment where everyone used the same plug, but people angered the god of electricity and now we have this

Increasingly convinced there was a prelapsarian Tower of Babel moment where everyone used the same plug, but people angered the god of electricity and now we have this
account_circle
ACM FAccT(@FAccTConference) 's Twitter Profile Photo

⌛️ Only a few days left before registration fees increase for ! Register by **Monday** to take advantage of early bird rates: cvent.me/xBrvwq

account_circle
Jeremy Howard(@jeremyphoward) 's Twitter Profile Photo

There's a new bill, SB-1047 'Safe and Secure Innovation for Frontier Artificial Intelligence Models Act'.

I think it could do a great deal of harm to startups, American innovation, open source, and safety. So I've written a response to the authors: 🧵
answer.ai/posts/2024-04-…

account_circle
Astribot(@Astribot_Inc) 's Twitter Profile Photo

Meet Astribot S1: the Next-Gen AI Robot.

- AI-powered. Unparalleled agility, dexterity and accuracy.
- The future of AI Robot is here, and it's Naturally Yours.

➡ Astribot.com

Astribot S1: Hello World! youtu.be/AePEcHIIk9s?si… 来自 YouTube

account_circle
Seth Lazar(@sethlazar) 's Twitter Profile Photo

Proud of MINT Lab PhD student JakeStone for this excellent paper with Brent Mittelstadt on ignoring legitimacy in algorithmic decisions, accepted to ACM FAccT 2024! arxiv.org/abs/2404.15680

account_circle
Seth Lazar(@sethlazar) 's Twitter Profile Photo

New from me in The Guardian. Not sure AI summarisation is quite ready yet to support communicative democracy: theguardian.com/commentisfree/… (thanks to Matt Boulos for some inspiration here)

account_circle
Seth Lazar(@sethlazar) 's Twitter Profile Photo

Would love this in Oz, cannot remember last time a domestic flight I booked out of CBR wasn’t delayed or cancelled.

account_circle
Hannah Rose Kirk(@hannahrosekirk) 's Twitter Profile Photo

Today we're launching PRISM, a new resource to diversify the voices contributing to alignment. We asked 1500 people around the world for their stated preferences over LLM behaviours, then we observed their contextual preferences in 8000 convos with 21 LLMs arxiv.org/abs/2404.16019

Today we're launching PRISM, a new resource to diversify the voices contributing to alignment. We asked 1500 people around the world for their stated preferences over LLM behaviours, then we observed their contextual preferences in 8000 convos with 21 LLMs arxiv.org/abs/2404.16019
account_circle
chrisrohlf(@chrisrohlf) 's Twitter Profile Photo

The vast majority of people expressing concern over AI + cyber have no experience or background in cyber security. If you’re in this camp I’ve got some sobering news for you, sophisticated and low skill attackers alike are already compromising “critical infrastructure” and thats

account_circle
AK(@_akhaliq) 's Twitter Profile Photo

Open AI presents The Instruction Hierarchy

Training LLMs to Prioritize Privileged Instructions

Today's LLMs are susceptible to prompt injections, jailbreaks, and other attacks that allow adversaries to overwrite a model's original instructions with their own malicious prompts.

Open AI presents The Instruction Hierarchy Training LLMs to Prioritize Privileged Instructions Today's LLMs are susceptible to prompt injections, jailbreaks, and other attacks that allow adversaries to overwrite a model's original instructions with their own malicious prompts.
account_circle
Jeremy Howard(@jeremyphoward) 's Twitter Profile Photo

Today at Answer.AI we've got something new for you: FSDP/QDoRA. We've tested it with AI at Meta Llama3 and the results blow away anything we've seen before.

I believe that this combination is likely to create better task-specific models than anything else at any cost. 🧵

Today at @answerdotai we've got something new for you: FSDP/QDoRA. We've tested it with @AIatMeta Llama3 and the results blow away anything we've seen before. I believe that this combination is likely to create better task-specific models than anything else at any cost. 🧵
account_circle
Robert Gorwa | @rg@someone.elses.computer(@rgorwa) 's Twitter Profile Photo

Friends: I'm indescribably chuffed to share that my first book is finally done and out shortly!

'The Politics of Platform Regulation' 🔓🔜is my 6+yearlong empirical exploration of how governments worldwide try to shape tech's trust & safety practices...
global.oup.com/academic/produ…

Friends: I'm indescribably chuffed to share that my first book is finally done and out shortly! 'The Politics of Platform Regulation' 🔓🔜is my 6+yearlong empirical exploration of how governments worldwide try to shape tech's trust & safety practices... global.oup.com/academic/produ…
account_circle
Yoav Artzi(@yoavartzi) 's Twitter Profile Photo

We created reviewing guidelines for Conference on Language Modeling. Not intended to automate the committee work, or dictate constraints. But, to inspire a thoughtful reviewing process, for an exciting and impactful program of the highest possible quality. We have a wonderful program committee ❤️

We created reviewing guidelines for @COLM_conf. Not intended to automate the committee work, or dictate constraints. But, to inspire a thoughtful reviewing process, for an exciting and impactful program of the highest possible quality. We have a wonderful program committee ❤️
account_circle