Trustworthy ML Initiative (TrustML) (@trustworthy

Morena

@morenadevil4

9 years ago

Twitter Beğeni Hilesi

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

🔥 We're releasing the strongest membership inference attack for foundation models! 🔥 Our attack applies to LLMs, vLMs, CLIP, Diffusion models and is SOTA on all🥇 Not only is our attack a magnificent breakthrough, it is also *magic*: we don't look at the ML model at all🪄 🧵👇

thumb_up_off_alt125

chat_bubble_outline3

repeat24

shareShare

A. Feder Cooper

@afedercooper

3 months ago

Here's my last PhD paper-- “The Files are in the Computer: Copyright, Memorization, and Generative-AI Systems” James Grimmelmann & I address ambiguity over the relationship b/w copying + memorization: when a (near-)exact copy of training data can be reconstructed from a model

Here's my last PhD paper-- “The Files are in the Computer: Copyright, Memorization, and Generative-AI Systems”

<a href="/grimmelm/">James Grimmelmann</a> & I address ambiguity over the relationship b/w copying + memorization: when a (near-)exact copy of training data can be reconstructed from a model

thumb_up_off_alt43

chat_bubble_outline1

repeat7

shareShare

Ilia Shumailov🦔

@iliaishacked

3 months ago

Unlearning, originally for privacy, today is often discussed as a content-regulation tool. If my model doesnt know X, it is safe. We argue that unlearning provides illusion of safety, since adversaries can inject malicious knowledge back into the models. arxiv.org/pdf/2407.00106

thumb_up_off_alt157

chat_bubble_outline4

repeat42

shareShare

Niloofar Mireshghallah

@niloofar_mire

2 months ago

Our work on challenges and inconclusiveness of membership inference attacks on LLMs has been accepted to Conference on Language Modeling!! arxiv.org/abs/2402.07841 This work has instigated new directions and many conversations on MIA evaluations, I will list them here in this thread, add to it!

thumb_up_off_alt124

chat_bubble_outline4

repeat10

shareShare

Weijia Shi

@weijiashi2

2 months ago

Can 𝐦𝐚𝐜𝐡𝐢𝐧𝐞 𝐮𝐧𝐥𝐞𝐚𝐫𝐧𝐢𝐧𝐠 make language models forget their training data? We shows Yes but at the cost of privacy and utility. Current unlearning scales poorly with the size of the data to be forgotten and can’t handle sequential unlearning requests. 🔗:

thumb_up_off_alt321

chat_bubble_outline5

repeat73

shareShare

Niloofar Mireshghallah

@niloofar_mire

2 months ago

When talking abt personal data people share w/ OpenAI & privacy implications, I get the 'come on! people don't share that w/ ChatGPT!🫷' In our Conference on Language Modeling paper, we study disclosures, and find many concerning⚠️ cases of sensitive information sharing: tinyurl.com/ChatGPT-person…

When talking abt personal data people share w/ <a href="/OpenAI/">OpenAI</a> & privacy implications, I get the 'come on! people don't share that w/ ChatGPT!🫷'

In our <a href="/COLM_conf/">Conference on Language Modeling</a> paper, we study disclosures, and find many concerning⚠️ cases of sensitive information sharing:

tinyurl.com/ChatGPT-person…

thumb_up_off_alt216

chat_bubble_outline6

repeat56

shareShare

Mohit Bansal

@mohitban47

2 months ago

Justin Chih-Yao Chen Swarnadeep Saha Elias Stengel-Eskin Archiki Prasad ICML Conference Tianlong Chen Beidi Chen Postdoc positions: x.com/mohitban47/sta…

thumb_up_off_alt13

chat_bubble_outline1

repeat11

shareShare

The GenLaw Center

@genlawcenter

2 months ago

Really excited to announce this year's list of accepted papers + spotlights! We had _so_ many submissions, many wonderful reviewers, and a final list of 66 accepted papers. Full papers will be posted before the workshop July 27th! genlaw.org/2024-icml-pape…

thumb_up_off_alt23

chat_bubble_outline0

repeat7

shareShare

Eoin Delaney

@eoindelaney_

2 months ago

🚨New paper and fairness toolkit alert🚨 Announcing OxonFair: A Flexible Toolkit for Algorithmic Fairness w/Zihao Fu, Sandra Wachter - @[email protected], Brent Mittelstadt and Chris Russell toolkit -github.com/oxfordinternet… paper - papers.ssrn.com/sol3/papers.cf…

thumb_up_off_alt27

chat_bubble_outline1

repeat13

shareShare

Swarnadeep Saha

@swarnanlp

2 months ago

🚨 New: my last PhD paper 🚨 Introducing System-1.x, a controllable planning framework with LLMs. It draws inspiration from Dual-Process Theory, which argues for the co-existence of fast/intuitive System-1 and slow/deliberate System-2 planning. System 1.x generates hybrid plans

thumb_up_off_alt293

chat_bubble_outline3

repeat68

shareShare

Besmira Nushi 💙💛

@besanushi

2 months ago

Microsoft Research Cambridge UK is hiring on topics related to equitable and responsible multi-modal AI. The team pursues efforts at the intersection of vision and language and is passionate about all aspects of #ResponsibleAI including fairness, reliability, interpretability. I

thumb_up_off_alt26

chat_bubble_outline1

repeat5

shareShare

Pin-Yu Chen

@pinyuchentw

2 months ago

🚩(1/2) Please help forward the Call for the 2024 Adversarial Machine Learning (AdvML) Rising Star Awards! We promote junior researchers in AI safety, robustness, and security. Award events are hosted at AdvML'Frontiers workshop NeurIPS Conference 2024 Info: sites.google.com/view/advml/adv…

thumb_up_off_alt40

chat_bubble_outline1

repeat16

shareShare

Pin-Yu Chen

@pinyuchentw

2 months ago

(2/2) #NeurIPS2024 AdvML'Frontiers Workshop advml-frontier.github.io Past Awardees: '23 Tianlong Chen Vikash Sehwag '22 Niloofar Mireshghallah Linyi Li '21 Florian Tramèr Huan Zhang Trustworthy ML Initiative (TrustML) LLM Security Award Committee: sijia.liu Cho-Jui Hsieh Bo Li & yours truly

(2/2)
#NeurIPS2024 AdvML'Frontiers Workshop advml-frontier.github.io

Past Awardees:
'23 <a href="/TianlongChen4/">Tianlong Chen</a> <a href="/VSehwag_/">Vikash Sehwag</a>
'22 <a href="/niloofar_mire/">Niloofar Mireshghallah</a> <a href="/limyikli/">Linyi Li</a>
'21 <a href="/florian_tramer/">Florian Tramèr</a> <a href="/huan_zhang12/">Huan Zhang</a>

<a href="/trustworthy_ml/">Trustworthy ML Initiative (TrustML)</a> <a href="/llm_sec/">LLM Security</a>
Award Committee: <a href="/sijialiu17/">sijia.liu</a> Cho-Jui Hsieh <a href="/uiuc_aisecure/">Bo Li</a> & yours truly

thumb_up_off_alt11

chat_bubble_outline0

repeat3

shareShare

sijia.liu

@sijialiu17

2 months ago

The 3rd AdvML-Frontiers Workshop (AdvMLFrontiers advml-frontier.github.io) is set for #NeurIPS 2024 (NeurIPS Conference)! This year, we're delving into the expansion of the trustworthy AI landscape, especially in large multi-modal systems. Trustworthy ML Initiative (TrustML) LLM Security🚀 We're now

The 3rd AdvML-Frontiers Workshop (<a href="/AdvMLFrontiers/">AdvMLFrontiers</a> advml-frontier.github.io) is set for #NeurIPS 2024 (<a href="/NeurIPSConf/">NeurIPS Conference</a>)! This year, we're delving into the expansion of the trustworthy AI landscape, especially in large multi-modal systems. <a href="/trustworthy_ml/">Trustworthy ML Initiative (TrustML)</a>
<a href="/llm_sec/">LLM Security</a>🚀

We're now

thumb_up_off_alt20

chat_bubble_outline2

repeat8

shareShare

𝙷𝚒𝚖𝚊 𝙻𝚊𝚔𝚔𝚊𝚛𝚊𝚓𝚞

@hima_lakkaraju

2 months ago

Excited to announce the 2nd edition of our Regulatable ML workshop NeurIPS Conference! We plan to debate burning questions around the regulation of generative #AI and Artificial General Intelligence (#AGI). We are accepting submissions until Aug. 30th -- regulatableml.github.io [1/N]

Excited to announce the 2nd edition of our Regulatable ML workshop <a href="/NeurIPSConf/">NeurIPS Conference</a>! We plan to debate burning questions around the regulation of generative #AI and Artificial General Intelligence (#AGI).

We are accepting submissions until Aug. 30th -- regulatableml.github.io [1/N]

thumb_up_off_alt63

chat_bubble_outline1

repeat13

shareShare

Canyu Chen

@canyuchen3

a month ago

🤔Are your open-source LLMs really safe? 🚨It may be injected with misinformation or bias! Our new paper "𝐂𝐚𝐧 𝐄𝐝𝐢𝐭𝐢𝐧𝐠 𝐋𝐋𝐌𝐬 𝐈𝐧𝐣𝐞𝐜𝐭 𝐇𝐚𝐫𝐦?" (Project website: llm-editing.github.io ) sheds light on the emerging challenges of LLMs, especially the

thumb_up_off_alt95

chat_bubble_outline2

repeat26

shareShare

Chhavi Yadav

@chhaviyadav_

a month ago

📰 Excited to be organizing a workshop on Interpretability NeurIPS Conference'24, called 'Interpretable AI : Past, Present and Future' Submit to our workshop for all things inherently interpretable! Submission ddl : 30 Aug 🔗 interpretable-ai-workshop.github.io Follow this account for updates!

thumb_up_off_alt61

chat_bubble_outline3

repeat8

shareShare

Konrad Rieck 🌈

@mlsec

6 days ago

🚨 We are extending the Call for Papers for the 3rd IEEE Conference on Secure and Trustworthy Machine Learning (SaTML Conference)! 👉 satml.org/participate-cf… ⏰ New Deadline: Sep 27 This extension gives you more time to submit your best work on secure AI algorithms and systems😉

🚨 We are extending the Call for Papers for the 3rd IEEE Conference on Secure and Trustworthy Machine Learning (<a href="/satml_conf/">SaTML Conference</a>)!

👉 satml.org/participate-cf…
⏰ New Deadline: Sep 27

This extension gives you more time to submit your best work on secure AI algorithms and systems😉

thumb_up_off_alt20

chat_bubble_outline1

repeat8

shareShare