Emma Strubell (@strubell) Twitter Tweets • TwiCopy

Emma Strubell

@strubell

+ Follow

assistant professor @LTIatCMU & visiting scientist @allen_ai. natural language processing and efficient ML. she/her/dad (2 dogs). 🏳️‍🌈. hiking and food. BLM.

ID:71544226

linkhttp://strubell.github.io calendar_today04-09-2009 14:13:53

1,0K Tweets

4,0K Followers

930 Following

Follow People

(((ل()(ل() 'yoav))))👾

+ Follow

Delip Rao e/σ

Busy inventing the shipwreck. Affiliations: @JHU, @Penn, @UCSC, @Amazon, @Twitter || Art: #NLProc, Vision, Speech, #DeepLearning || Life: 道元, improv, running 🌈

+ Follow

Sasha Rush

Professor, Programmer in NYC. Cornell, Hugging Face 🤗 https://t.co/hD1g4W4wR4

+ Follow

Yoav Artzi

NLP, ML researcher/professor @cs_cornell (NYC🚡@cornell_tech) @ https://t.co/9YnWry7yHs / @asapptech / asso. faculty director @arxiv 🟦☁️ https://t.co/y19dwNI8Ek 🧶 @yoavartzi

+ Follow

Kyunghyun Cho

a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).

+ Follow

Sang Choe

@sangkeun_choe

6 months ago

High-quality data is a key to successful pretrain/finetuning in the GPT era, but manual data curation is expensive💸 We tackle data quality challenges involving large models and datasets with ScAlable Meta leArning (SAMA) #NeurIPS2023 💫

Arxiv: arxiv.org/abs/2310.05674
🧵 (1/n)

account_circle

Luca Soldaini 🎀

@soldni

6 months ago

Just released v0.9.0 of the Dolma toolkit 🍇 Lots of goodies (dataset tokenization support, new taggers, data analysis, etc), but the one I'm most proud of is that we now have....

✨ proper documentation 💫

check it out at github.com/allenai/dolma/…, or `pip install dolma` 😊

account_circle

Language Technologies Institute | @CarnegieMellon

@LTIatCMU

6 months ago

The LTI is hosting an information session for applicants to the MLT and PhD programs on Nov 8, 2023, 12-1 PM ET. If you would like to attend, please RSVP and send us your questions through this form: forms.gle/fGjDiTAKypwD2f…

account_circle

Sanket Vaibhav Mehta (SVM)

@sanketvmehta

7 months ago

Our paper (w/ Darshan Patil, Sarath Chandar & Emma Strubell) “An Empirical Investigation of the Role of Pre-training in Lifelong Learning” is now officially published in #JMLR (will be presented at #NeurIPS2023 Journal-to-Conference Track)!
Paper 👉 jmlr.org/papers/v24/22-…
🧵👇 (1/n)

account_circle

AllenNLP

@ai2_allennlp

9 months ago

The deadline for Spring 2024 Research Internships at AllenNLP is July 15th, in two weeks. If you think 2024 is a great time to do NLP research with top mentors, apply at boards.greenhouse.io/thealleninstit…!

account_circle

Jesse Dodge

@JesseDodge

11 months ago

Today Google announced PaLM 2. In their 91 page paper they repeatedly say the training data is key ('we find that the data mixture is a critical component of the final model') while providing almost no information about how it was constructed, how it was sourced, or its contents.

account_circle

Allen Institute for AI

@allen_ai

11 months ago

Today we're thrilled to announce our new undertaking to collaboratively build the best open language model in the world: AI2 OLMo.

Uniquely open, 70B parameters, coming early 2024 – join us!

blog.allenai.org/announcing-ai2…

account_circle

Cohere For AI

@CohereForAI

1 year ago

Our cross-institutional collaboration, Efficient Methods for Natural Language Processing, has been accepted for publication at TACL! 🎉

You can find the pre-print at: arxiv.org/abs/2209.00099

thumb_up_off_alt26

chat_bubble_outline0

repeat7

shareShare

account_circle

Roy Schwartz

@royschwartzNLP

1 year ago

AI models are becoming dangerously powerful. How can we effectively regulate them?

We propose a simple regulation to address the spread of misinformation⚠️: any AI-generated photorealistic image must have a visible watermark 🔖
tinyurl.com/zsf7zc3h

👇
(1/n)

account_circle

@[email protected] on Mastodon

@timnitGebru

1 year ago

“It’s not okay to install these by default,” says David Gray Widder...who became one of the department’s most vocal voices against Mites. “I don’t want to live in a world where one’s employer installing networked sensors...”
technologyreview.com/2023/04/03/107…

by Eileen Guo & Tate Ryan-Mosley

account_circle

Emma Strubell

@strubell

1 year ago

We, members of the research community, have the power to shape what is or is not considered good science.

Check out our blog post for discussion and recommendations on what to do about the rise of closed models like GPT-4:

thumb_up_off_alt46

chat_bubble_outline0

repeat6

shareShare

account_circle

Leon Derczynski ✍🏻🌲☕

@LeonDerczynski

1 year ago

ChatGPT not best at many language tasks. It's outranked by other systems on many NLP benchmarks in current evaluation. For 77.5% of tasks examined, other systems are better than ChatGPT.

opensamizdat.com/posts/chatgpt_…

account_circle