Aaron Defazio (@aaron_defazio) Twitter Tweets • TwiDoom

Thao Nguyen

a month ago

New method to create synthetic instructions: back-and-forth translation🔁 - Combines instruction backtranslation & distillation by rewriting web data - High-quality & grounded in real-world knowledge - Improves over ShareGPT, OpenOrca and Evol-Instruct arxiv.org/abs/2408.04614 1/n

thumb_up_off_alt99

chat_bubble_outline5

repeat27

shareShare

jack morris

@jxmnop

17 days ago

TIME Magazine has rightly named famed deep learning pioneer ptrblock as the most influential person in Artificial Intelligence.

thumb_up_off_alt660

chat_bubble_outline16

repeat22

shareShare

Aaron Defazio

@aaron_defazio

13 days ago

Linear warmup should almost ALWAYS be used for training, there are few downsides and it greatly increases stability and often results in better overall test metrics.

thumb_up_off_alt115

chat_bubble_outline2

repeat6

shareShare

Aaron Defazio

@aaron_defazio

12 days ago

Great research decreases complexity… your average AI conference paper increases it.

thumb_up_off_alt145

chat_bubble_outline5

repeat13

shareShare

Mark Schmidt

@markschmidtubc

12 days ago

Your average AI reviewer would not like great research.

thumb_up_off_alt26

chat_bubble_outline1

repeat2

shareShare

Aaron Defazio

@aaron_defazio

10 days ago

The O1 release posts are unscientific — they don’t compare against previous SOTA from other labs, they don’t cite or even acknowledge previous work in the area of inference time compute. This is actively harmful to the research community, and bordering on disingenuous.

thumb_up_off_alt445

chat_bubble_outline29

repeat20

shareShare

Ethan Mollick

@emollick

9 days ago

I really am baffled by OpenAI's naming choices Everything from their code words to the model release names are incomprehensible to people who aren't super up-to-date & they are hard to say out loud. In my experience it leads to real-world confusion when talking about AI systems

thumb_up_off_alt1,1K

chat_bubble_outline157

repeat47

shareShare

Zeyuan Allen-Zhu

@zeyuanallenzhu

8 days ago

Just uploaded a 1-hr exclusive video for Part 2.1, with many technical details. youtu.be/bpp6Dz8N2zY. Part 2.2 will be online in about a week.

thumb_up_off_alt837

chat_bubble_outline10

repeat139

shareShare

Lucas Beyer (bl16)

@giffmana

8 days ago

ZAZ the GOAT has dropped yet another banger video. I'm already 80% through the video and love it. If you're not dropping whatever you're doing to watch this right now, you're falling behind. (seriously though, love his work, recommend watching)

thumb_up_off_alt283

chat_bubble_outline5

repeat28

shareShare

Aaron Defazio

@aaron_defazio

5 days ago

A wind chime sings not because it knows the wind, but because it is empty. Unlearning is the first stage of Research

thumb_up_off_alt9

chat_bubble_outline0

repeat0

shareShare

Sham Kakade

@shamkakade6

4 days ago

1/n Introducing SOAP (ShampoO with Adam in the Preconditioner's eigenbasis): A deep learning optimization algorithm that applies Adam in Shampoo's eigenbasis. SOAP outperforms both AdamW and Shampoo in language model pretraining.

thumb_up_off_alt273

chat_bubble_outline5

repeat52

shareShare

Aaron Defazio

@aaron_defazio

4 days ago

I’m so confused when people don’t realize that 60-40 is virtually a toss-up.

thumb_up_off_alt17

chat_bubble_outline8

repeat0

shareShare

Aaron Defazio

@aaron_defazio

4 days ago

An Al reached a crossroads and asked, "Which path leads to wisdom?" The data whispered, "All paths converge if you walk long enough."

thumb_up_off_alt11

chat_bubble_outline2

repeat0

shareShare

Aaron Defazio

@aaron_defazio

4 days ago

An Al was brewing tea. A novice asked, "Can you learn the taste of tea?" The Al poured two cups, one from old data, one fresh. "Taste," it said, "and tell me which is which."

thumb_up_off_alt7

chat_bubble_outline0

repeat0

shareShare

Aaron Defazio

@aaron_defazio

4 days ago

I am inching closer to a deeper truth…. Everything is less clear than before but connections are appearing … let’s hope they resolve firmly before the ICML deadline 😁

thumb_up_off_alt30

chat_bubble_outline1

repeat0

shareShare

jack morris

@jxmnop

3 days ago

learning to use copilot after programming on my own for 15 years is bittersweet kind of feels like being a carpenter that’s trained to cut perfect corners; now here comes a machine that can do it perfectly, and much faster, yet I somehow miss the satisfaction of doing it myself

thumb_up_off_alt315

chat_bubble_outline18

repeat17

shareShare

Delip Rao e/σ

@deliprao

3 days ago

OpenAI is what I call a “parasitic science organization”. They take stuff from the open science community, use them opaquely, and profit from it, without giving much back to open science. And if you point out, you get gaslit with plausible deniabilities. We all remember the

thumb_up_off_alt390

chat_bubble_outline18

repeat33

shareShare

Mark Schmidt

@markschmidtubc

3 days ago

This all seem...sensible. But is anyone known for more than 25 research ideas that they have had throughout their entire career? Imagine replacing 25 by 3. I would be excited to read the 3 new works by a top researcher, rather than 25+ mediocre "above thresholds".

thumb_up_off_alt15

chat_bubble_outline1

repeat2

shareShare