NeelNanda5 : Great work from my MATS scholars @calsmcdougall an • TwiCopy

Neel Nanda

@NeelNanda5

+ Follow

Mechanistic Interpretability lead @DeepMind. Formerly @AnthropicAI, independent. In this to reduce AI X-risk. Neural networks can be understood, let's go do it!

calendar_today30-06-2022 15:18:58

1,8K Tweets

13,4K Followers

89 Following

Neel Nanda

@NeelNanda5

4 weeks ago

Great work from my MATS scholars Callum McDougall and Joseph Bloom, in honour of today's special occasion!

Turns out SAEs contain wild features, like a Neel Nanda feature, and this perseverance feature:
lesswrong.com/posts/BK8AMsNH…

Great work from my MATS scholars @calsmcdougall and @JBloomAus, in honour of today's special occasion! Turns out SAEs contain wild features, like a Neel Nanda feature, and this perseverance feature: lesswrong.com/posts/BK8AMsNH…

account_circle