Max Lamparth (@mlamparth) Twitter Tweets • TwiDoom

Max Lamparth

@mlamparth

+ Follow

Postdoc at @Stanford, @StanfordCISAC, Stanford Center for AI Safety, and the SERI program | Focusing on interpretability, robustness, and ethical AI/LLMs.

ID: 1588663024969125888

linkhttp://www.maxlamparth.com calendar_today04-11-2022 22:43:21

373 Tweet

613 Followers

488 Following

Max Lamparth

@mlamparth

a month ago

🚨 Our paper was accepted at AI, Ethics, and Society Conference (AIES) happening in October! #AIES We compare human national security experts vs. LLM simulations in wargames. The results? Surprising differences in decision-making that could impact real-world conflicts. Paper: arxiv.org/pdf/2403.03407

thumb_up_off_alt25

chat_bubble_outline2

repeat8

shareShare

Max Lamparth

@mlamparth

a month ago

🚨 Our paper was accepted for Conference on Language Modeling! As we face a mental health crisis and lack of access to professional care, many turn to AI as a solution. But how does ethical automated care look like and are models safe enough for patients? Paper: arxiv.org/abs/2406.11852

thumb_up_off_alt27

chat_bubble_outline1

repeat11

shareShare

xuan (ɕɥɛn / sh-yen)

@xuanalogue

16 days ago

Should AI be aligned with human preferences, rewards, or utility functions? Excited to finally share a preprint that Micah Carroll Matija Franklin Hal Ashton & I have worked on for almost 2 years, arguing that AI alignment has to move beyond the preference-reward-utility nexus!

Should AI be aligned with human preferences, rewards, or utility functions?

Excited to finally share a preprint that <a href="/MicahCarroll/">Micah Carroll</a> <a href="/FranklinMatija/">Matija Franklin</a> <a href="/hal_ashton/">Hal Ashton</a> & I have worked on for almost 2 years, arguing that AI alignment has to move beyond the preference-reward-utility nexus!

thumb_up_off_alt673

chat_bubble_outline21

repeat123

shareShare

Seth Lazar

@sethlazar

16 days ago

Incredibly valuable work from a stellar group: must read!

thumb_up_off_alt31

chat_bubble_outline2

repeat2

shareShare

Max Lamparth

@mlamparth

16 days ago

Great and important work by xuan (ɕɥɛn / sh-yen) et al.!

thumb_up_off_alt6

chat_bubble_outline1

repeat0

shareShare

Sonia 🌻

@soniajoseph_

14 days ago

The woman who is in Founder Mode getting called a “toxic leader” has its own version in AI research for female AI researchers as well. Breaking ppl’s preconceptions can lead to a lot of unnecessary fear and stigma. Kudos to Sophia for this post ✨

thumb_up_off_alt8

chat_bubble_outline1

repeat0

shareShare

Seth Lazar

@sethlazar

14 days ago

Couple weeks left to get in your abstracts. We'll also have invited papers from Henry Farrell and Hahrie Han, Arvind Narayanan, Alondra Nelson, Deirdre K. mulligan, Daniel Susskind and others—leading thinkers on politics, economics, and technologies of democracy. V keen for PhDs/ECRs to apply.

thumb_up_off_alt15

chat_bubble_outline0

repeat6

shareShare

Diyi Yang

@diyi_yang

10 days ago

Can #LLMs generate novel research ideas? Check out our one year-long human study with 100+ NLP researchers, done by the amazing CLS

thumb_up_off_alt251

chat_bubble_outline2

repeat31

shareShare

Sasha Luccioni, PhD 🦋🌎✨🤗

@sashamtl

10 days ago

I used to be shy to publicly state that I had both of my kids during my PhD, but now I'm like: "Damn straight, I had two babies AND STILL managed to become a doctor in 5 years!" 👩‍🎓Academic Mom, PhD

thumb_up_off_alt207

chat_bubble_outline14

repeat6

shareShare

Danny Halawi

@dannyhalawi15

10 days ago

The results in "LLMs Are Superhuman Forecasters" don't hold when given another set of forecasting questions. I used their codebase (models, prompts, retrieval, etc.) to evaluate a new set of 324 questions—all opened after November 2023. Findings: Their Brier score: .195 Crowd

thumb_up_off_alt395

chat_bubble_outline7

repeat39

shareShare

Diyi Yang

@diyi_yang

9 days ago

🚨 PrivacyLens evaluates #LLMs' privacy awareness using contextual integrity theory! 🔍 It goes beyond static QA to detect privacy leaks 🔐 during LLMs agents' task execution process! Overall, a multi-level approach to evaluate #privacy awareness of LLMs in action ✨

thumb_up_off_alt20

chat_bubble_outline0

repeat3

shareShare