Arvind Narayanan (@random_walker) Twitter Tweets • TwiCopy

Arvind Narayanan

@random_walker

+ Follow

Princeton CS prof. Director @PrincetonCITP. I write about the societal impact of AI, tech ethics, & social media platforms.
BOOK: AI Snake Oil. Views mine.

ID:10834752

linkhttps://www.cs.princeton.edu/~arvindn/ calendar_today04-12-2007 11:14:14

12,0K Tweets

119,1K Followers

413 Following

Follow People

Kristian Lum

Assoc Research Professor U Chicago | handmaid of corp tech | @FAccTConference OG | Past Twitter META, UPenn CS faculty & @hrdag | [email protected]

+ Follow

Deb Raji

AI accountability, audits & eval. Keen on participation & practical outcomes. Fellow @mozilla, CS PhDing @UCBerkeley. forever @AJLUnited, @hashtag_include ✝️

+ Follow

Thomas G. Dietterich

Distinguished Professor (Emeritus), Oregon State Univ.; Former President, Assoc. for the Adv. of Artificial Intelligence; Robust AI & Comput. Sustainability,

+ Follow

Prof. Anima Anandkumar

Bren Professor @caltech, Sr Director of #AI research @nvidia, Fmr Principal Scientist @awscloud, AI+Science, PDE, Neural operators. Views my own.

+ Follow

Boaz Barak

Computer Scientist. See also https://t.co/EXWR5k634w, https://t.co/SEVX6it6z3 ( @[email protected] , boaz.barak in threads ). Opinions my own.

+ Follow

Michael Lones

@michael_lones

16 hours ago

Great to have been involved in this initiative led by Sayash Kapoor and Arvind Narayanan to (hopefully!) improve the use of machine learning in science. Further thoughts in my Substack post: fetchdecodeexecute.substack.com/p/reforms-a-gu…

thumb_up_off_alt6

chat_bubble_outline0

repeat4

shareShare

account_circle

Peter Henderson

@PeterHndrsn

16 hours ago

To my mind, unconstrained military use of AI is one of the most risky & is underemphasized in policymaking. Military use must be a central part of AI Safety discussions. Glad to see a couple of new pieces emphasizing this point.

ft.com/content/da03f8…

foreignaffairs.com/united-states/…

account_circle

Ethan Zuckerman

@EthanZ

1 day ago

With my brilliant friends at Knight First Amendment Institute, I filed suit against Meta today, asking a federal court to find that CDA section 230 gives users rights to control what they see on social media via third party tools. See our complaint at knightcolumbia.org/cases/zuckerma…

account_circle

Jessica Hullman

@JessicaHullman

1 day ago

Lots of practical advice to help researchers doing ML-based science avoid unintentional irreproducibility and overgeneralization in this new paper led by Sayash Kapoor

thumb_up_off_alt21

chat_bubble_outline0

repeat9

shareShare

account_circle

rishi

@RishiBommasani

1 day ago

REFORMS is an exceptional work by an ensemble cast spanning institutions and disciplines! Check it out!

The approach also directly inspired our work on open foundation models, where we worked towards consensus across folks from different institutions:
crfm.stanford.edu/open-fms/

thumb_up_off_alt13

chat_bubble_outline0

repeat3

shareShare

account_circle

Sayash Kapoor

@sayashk

1 day ago

Excited to share that our paper introducing the REFORMS checklist is now out Science Advances!

In it, we:
- review common errors in ML for science
- create a checklist of 32 items applicable across disciplines
- provide in-depth guidelines for each item

science.org/doi/10.1126/sc…

account_circle

Musa al-Gharbi

@Musa_alGharbi

2 days ago

MTurk is basically junk responses. People often lie about their background characteristics. And they often choose the same answer for most questions, regardless of content, such that you can ask people opposing questions and get completely incoherent results (even after screening…

account_circle

Scott Condron

@_ScottCondron

1 day ago

- Agents are costly and that should be jointly optimized with task accuracy
- Simple baselines like retrying, retrying with different temps, retrying with better models outperform complex Agents on the Pareto frontier of cost/accuracy
- reproducibility & benchmarks continue to be…

thumb_up_off_alt12

chat_bubble_outline0

repeat2

shareShare

account_circle

Veniamin Veselovsky

@VminVsky

2 days ago

put your mouth where your money is

amazing how adding cost on an axis can directionally shift interpretations on many ai agent approaches!

thumb_up_off_alt14

chat_bubble_outline0

repeat6

shareShare

account_circle

Hailey Schoelkopf

@haileysch__

2 days ago

Rigorously evaluating “agents” takes thought!

great work debunking the (cost-normalized) performance of popular coding agents by Arvind Narayanan Sayash Kapoor Benedikt Ströbl .

thumb_up_off_alt25

chat_bubble_outline0

repeat4

shareShare

account_circle

Arvind Narayanan

@random_walker

2 days ago

We fully recognize that there are downsides to reporting dollar costs in evals, given that costs change quickly. The difference in our perspective comes down to model evaluation vs downstream evaluation (our focus is the latter). aisnakeoil.com/p/ai-leaderboa…

thumb_up_off_alt11

chat_bubble_outline0

repeat2

shareShare

account_circle