Adina Williams (@adinamwilliams) 's Twitter Profile
Adina Williams

@adinamwilliams

Computational linguistics, cognitive science, NLP; semantics + syntax; @AIatMeta (FAIR NYC); formerly @nyuling
@[email protected]

ID: 214639688

linkhttp://www.adinawilliams.com calendar_today11-11-2010 21:55:03

1,1K Tweet

3,3K Followers

928 Following

Hannah Rose Kirk (@hannahrosekirk) 's Twitter Profile Photo

Today we're launching PRISM, a new resource to diversify the voices contributing to alignment. We asked 1500 people around the world for their stated preferences over LLM behaviours, then we observed their contextual preferences in 8000 convos with 21 LLMs arxiv.org/abs/2404.16019

Today we're launching PRISM, a new resource to diversify the voices contributing to alignment. We asked 1500 people around the world for their stated preferences over LLM behaviours, then we observed their contextual preferences in 8000 convos with 21 LLMs arxiv.org/abs/2404.16019
NextGenAISafety Workshop @ ICML'24 (@ng_ai_safety) 's Twitter Profile Photo

🚀Thrilled to launch the Workshop on the Next Generation of AI Safety at #ICML2024! Dive into the future of AI safety. CFP & more details 👉 icml-nextgenaisafety.github.io #NextGenAISafety #ICML2024

Hannah Rose Kirk (@hannahrosekirk) 's Twitter Profile Photo

New week, new blog! In collab with MLCommons, we wrote up a summary of the PRISM alignment project, covering what motivated us to collect feedback from 1,500 humans around the world, how we collected it & which challenges we encountered on the road 🙇‍♀️ mlcommons.org/2024/05/prism/

alphaXiv (@askalphaxiv) 's Twitter Profile Photo

🚨 New from Meta AI 🚨 Want to learn more about VLMs? Ask Florian Bordes from AI at Meta about their latest paper “An Introduction to Vision-Language Models”. This week’s paper of the week 👇 alphaxiv.org/abs/2405.17247

Hannah Rose Kirk (@hannahrosekirk) 's Twitter Profile Photo

🌎Introducing LINGOLY, our new reasoning benchmark that stumps even top LLMs (best models only reach ~35% accuracy)🥴 In a colab between University of Oxford, Stanford University and UK Linguistic Olympiad puzzle authors, we stress test LLMs on over 90 low-resource and extinct languages...

🌎Introducing LINGOLY, our new reasoning benchmark that stumps even top LLMs (best models only reach ~35% accuracy)🥴
In a colab between <a href="/UniofOxford/">University of Oxford</a>, <a href="/Stanford/">Stanford University</a> and UK Linguistic Olympiad puzzle authors, we stress test LLMs on over 90 low-resource and extinct languages...
Adina Williams (@adinamwilliams) 's Twitter Profile Photo

I've got a cool opportunity to share! We're looking for folks to create safety prompts for our upcoming safety benchmark MLCommons AI Safety Initiative, and we have some funding to share with qualified orgs! Check it out or sign up here by Jul 19: mlcommons.org/ai-safety/ai-s…

Vipul Gupta (@vipul_1011) 's Twitter Profile Photo

🚨There is serious lack of robustness with MMLU! In our new work we find that “Changing Answer Order Can Decrease MMLU Accuracy” and the accuracy of top models can drop by 10-20%📉 This means leaderboards might not be as reliable as we thought! 📄arxiv.org/abs/2406.19470 (1/N)

🚨There is serious lack of robustness with MMLU!

In our new work we find that “Changing Answer Order Can Decrease MMLU Accuracy” and the accuracy of top models can drop by 10-20%📉
This means leaderboards might not be as reliable as we thought!
📄arxiv.org/abs/2406.19470

(1/N)
Usman Gohar (@usmangohar) 's Twitter Profile Photo

🚨Excited to share that the V2 of the social impact paper, led by the incredible Irene Solaiman and Zeerak Talat (زیرک تلت) Zeerak@ mastodon|bsky, is finally out! In this work, we present a guide for evaluating the social impact of Gen AI systems across categories & modalities. 🧵(1/7) arxiv.org/pdf/2306.05949

Roger Levy (@roger_p_levy) 's Twitter Profile Photo

Video recordings from the #NSF-sponsored workshop New Horizons in Language Science: Large Language Models, Language Structure, and the Cognitive & Neural Basis of language are now publicly available! All videos are linked to from the workshop website: newhorizonsinlanguagescience.github.io 1/

Ethan Gotlieb Wilcox (@wegotlieb) 's Twitter Profile Photo

🐘🙋 New Preprint🙋 🐘 Bigger is not always better: The importance of human-scale language modeling for psycholinguistics osf.io/preprints/psya…

Alex Warstadt (@a_stadt) 's Twitter Profile Photo

I'm at #CogSci2024 giving a poster TODAY from 13-14:15 about the past and future of BabyLM! Come chat about computational models of language acquisition and data-efficient pretraining!

Adina Williams (@adinamwilliams) 's Twitter Profile Photo

fyi: Meta's funding a few new grants for faculty to create evaluation benchmarks (deadline Sept 7). Could be of interest! #NLProc llama.meta.com/llm-evaluation…

Paul Röttger (@paul_rottger) 's Twitter Profile Photo

We won OUTSTANDING PAPER at #ACL2024 for our work on evaluating values and opinions in LLMs 🥳 Thank you to the reviewers and awards committee, and again to my amazing co-authors, especially joint first author Valentin Hofmann Check out the poster + paper + full author list below 👇

We won OUTSTANDING PAPER at #ACL2024 for our  work on evaluating values and opinions in LLMs 🥳

Thank you to the reviewers and awards committee, and again to my amazing co-authors, especially joint first author <a href="/vjhofmann/">Valentin Hofmann</a>

Check out the poster + paper + full author list below 👇
WikiResearch (@wikiresearch) 's Twitter Profile Photo

"Are Female Carpenters like Blue Bananas? A Corpus Investigation of Occupation Gender Typicality" finds relationship between gender mentioning in Wikipedia and Reddit and actual (from BLS-Labor Statistics) femaleness of occupations. (Ju et al, 2023) arxiv.org/pdf/2408.02948 Adina Williams