Twitter #AIsafety hashtag • TwiCopy

Lab Horizons

8 hours ago

Exploring the frontier of AI safety, new policy forum discussed a roadmap to preempt the risks of advanced AI agents, advocating for robust regulations to secure humanity’s oversight. Read more: labhorizons.co.uk/2024/04/outsma… #LabHorizons #ai #artificialintelligence #ai safety

thumb_up_off_alt0

chat_bubble_outline0

account_circle

Technical AI Safety Conference (TAIS)

13 hours ago

In his talk at #TAIS2024 , Manuel Baltieri shared the concepts of active inference and the free energy principle, highlighting their significance in #AIsafety . He explained how these ideas contribute to defining 'what agents are' and 'what agents do', particularly emphasizing the…

In his talk at #TAIS2024, @manuelbaltieri shared the concepts of active inference and the free energy principle, highlighting their significance in #AIsafety. He explained how these ideas contribute to defining 'what agents are' and 'what agents do', particularly emphasizing the…

thumb_up_off_alt7

chat_bubble_outline0

account_circle

I N F I N I M I N D

23 hours ago

Maximizing the Power of AI: Building Truth-Seeking Machines
#AIethics #TruthSeekingAI #AISafety #AIprinciples #AIforHumanity #FutureofTechnology #MachineLearning #AIInnovation #TechnologyEthics #BuildingBetterAI

thumb_up_off_alt0

chat_bubble_outline0

account_circle

AllAboutAI

6 days ago

The US and UK unite for AI safety. A new partnership aims to set global standards and
ensure AI technologies are developed responsibly.
Hit the link in bio and get ready for your mind to be blown!
#A ISafety #TechCollaboration #A llAboutAI #A rtificialIntelligence #A

thumb_up_off_alt1

chat_bubble_outline0

account_circle

LongLimbsLenore

3 days ago

AI Safety Fundamentals

@aisafety_course

thumb_up_off_alt3

chat_bubble_outline0

account_circle

Zhijing Jin@ICLR/COLING/NAACL

4 days ago

I'll be at #ICLR2024 from Mon to Sun!

Happy to chat w/ students interested in applying for PhDs, and connect w/ researchers in #LLMs #Causality #AISafety .

I'll be presenting our paper on Correlation-to-Causation (Corr2Cause) Inference for LLMs on Thu: arxiv.org/abs/2306.05836🎉

I'll be at #ICLR2024 from Mon to Sun!

Happy to chat w/ students interested in applying for PhDs, and connect w/ researchers in #LLMs #Causality #AISafety.

I'll be presenting our paper on Correlation-to-Causation (Corr2Cause) Inference for LLMs on Thu: arxiv.org/abs/2306.05836🎉

thumb_up_off_alt103

chat_bubble_outline0

account_circle

Sanjay Puri

4 hours ago

Is the US set to lead in global tech and innovation? Joe Morelle critiques the nation's R&D strategy in our latest episode, highlighting the need for a cohesive approach to technological innovation. Click for the full episode

#AIRegulation #AISafety #AIStandard

thumb_up_off_alt0

chat_bubble_outline0

account_circle

Sanjay Puri

2 days ago

How can AI be a force for good? Dive into a conversation with Trooper Sanders of Benefits Data Trust on crafting trustworthy AI. His unique insights provide a roadmap for ethical AI regulation. Click here for more.

#AIRegulation #AISafety #AIStandard

thumb_up_off_alt0

chat_bubble_outline0

account_circle

dataonmatrix

4 hours ago

Let’s explore some of the key dangers associated with AI.

#artificialintelligence #AI #AI risks #EthicalAI #AI regulation #TechEthics #AI safety #FutureofAI #AI accountability #JobDisplacement #DataSecurity #security #risks

Let’s explore some of the key dangers associated with AI.

#artificialintelligence #AI #AIrisks #EthicalAI #AIregulation #TechEthics #AIsafety #FutureofAI #AIaccountability #JobDisplacement #DataSecurity #security #risks

thumb_up_off_alt0

chat_bubble_outline0

account_circle

DollarStore Cowboy

@DollrStorCowboy

1 day ago

AI Safety Fundamentals

@aisafety_course

thumb_up_off_alt0

chat_bubble_outline0

account_circle

Google Cloud Security

@GoogleCloudSec

1 day ago

Heather Adkins - Ꜻ - Spes consilium non est joined a thought-provoking panel of experts to discuss the effective ways organizations can harness the power of AI and the state of AI safety with regards to legislation. Stay tuned for more insights from RSA!

#RSAC #GenAI #AISafety #Cybersecurity

@argvee joined a thought-provoking panel of experts to discuss the effective ways organizations can harness the power of AI and the state of AI safety with regards to legislation. Stay tuned for more insights from RSA!

#RSAC #GenAI #AISafety #Cybersecurity

thumb_up_off_alt1

chat_bubble_outline0

account_circle

DailyAI

@DailyAIOfficial

4 days ago

dailyai.com/2024/04/dhs-la…

#AI #AI Safety #OpenSource #HomelandSecurity #AI ethics #TechPolicy #FutureOfAI #LLMs

dailyai.com/2024/04/dhs-la…

#AI #AISafety #OpenSource #HomelandSecurity #AIethics #TechPolicy #FutureOfAI #LLMs

thumb_up_off_alt6

chat_bubble_outline0

account_circle

NformAI

1 week ago

AI is changing the game in public safety - from predicting crimes to optimizing emergency responses. Read our latest article to find out how. #AISafety #PublicSecurity #TechInnovation

AI is changing the game in public safety - from predicting crimes to optimizing emergency responses. Read our latest article to find out how. #AISafety #PublicSecurity #TechInnovation

thumb_up_off_alt0

chat_bubble_outline0

account_circle

Technical AI Safety Conference (TAIS)

4 days ago

At #TAIS2024 , Dan Hendrycks, director of Center for AI Safety, unveiled his presentation on the WMDP Benchmark, focusing on measuring and mitigating malicious usage through unlearning. He introduced CUT, a cutting-edge unlearning technique. Watch now: youtu.be/cHPlQTJqtGw
#AIsafety

At #TAIS2024, @DanHendrycks, director of @ai_risks, unveiled his presentation on the WMDP Benchmark, focusing on measuring and mitigating malicious usage through unlearning. He introduced CUT, a cutting-edge unlearning technique. Watch now: youtu.be/cHPlQTJqtGw
#AIsafety

thumb_up_off_alt6

chat_bubble_outline0

account_circle

Technical AI Safety Conference (TAIS)

1 day ago

In their talk at #TAIS2024 , James Fox and @mattmacdermott1 explored the interconnectedness of causality, agency and #AIsafety . They illustrated potential real-world implementations of their theoretical insights by presenting their strategies for creating 'agency detectors'.…

In their talk at #TAIS2024, @James_D_Fox and @mattmacdermott1 explored the interconnectedness of causality, agency and #AIsafety. They illustrated potential real-world implementations of their theoretical insights by presenting their strategies for creating 'agency detectors'.…

thumb_up_off_alt2

chat_bubble_outline0

account_circle

Sanjay Puri

1 week ago

Is the US set to lead in global tech and innovation? Joe Morelle critiques the nation's R&D strategy in our latest episode, highlighting the need for a cohesive approach to technological innovation. Click for the full episode

#AIRegulation #AISafety #AIStandard

thumb_up_off_alt2

chat_bubble_outline0

account_circle

Xuanli He

1 week ago

🚨 New Paper! (arxiv.org/abs/2404.19597)🚨 We uncover significant vulnerabilities in Multilingual LLMs (MLLMs) (e.g., BLOOM, Llama2, Llama3, Gemma, and GPT-3.5-turbo) to cross-lingual transferable backdoor attacks. #AIsafety #LLMs #backdoors

🚨 New Paper! (arxiv.org/abs/2404.19597)🚨 We uncover significant vulnerabilities in Multilingual LLMs (MLLMs) (e.g., BLOOM, Llama2, Llama3, Gemma, and GPT-3.5-turbo) to cross-lingual transferable backdoor attacks. #AIsafety #LLMs #backdoors

thumb_up_off_alt23

chat_bubble_outline0

account_circle

Rick Roane

@ChiefAdversary

1 week ago

AI deepfake tech and products are exploding with success right now, but so is the threat to innocent bystanders. What more can we do to prevent abuse before product launches and adoption? #Deepfake #AI #ArtificialIntelligence #Auventic #ClearedContact #AI Safety Auventic, Inc.…

AI deepfake tech and products are exploding with success right now, but so is the threat to innocent bystanders. What more can we do to prevent abuse before product launches and adoption? #Deepfake #AI #ArtificialIntelligence #Auventic #ClearedContact #AISafety @auventic…

thumb_up_off_alt13

chat_bubble_outline0

account_circle

SPAR

21 hours ago

SPAR is now accepting technical safety and AI governance mentees for Summer 2024 (June 14 - Sep 7)!

Apply here:tinyurl.com/SPAR-Mentee by 5/24!

SPAR provides opportunities to work with mentors to develop valuable experience in #AIsafety

thumb_up_off_alt1

chat_bubble_outline0

account_circle

Diana Wolf Torres

4 hours ago

Check out the latest article in my newsletter: Unraveling the Paperclip Alignment Problem: A Cautionary Tale in AI Development linkedin.com/pulse/unraveli… via LinkedIn

#ethicalai #aisafety #deeplearning

thumb_up_off_alt0

chat_bubble_outline0

account_circle