profile-img
Rob Miles (✈️ Berkeley)

@robertskmiles

Explaining AI Alignment to anyone who'll stand still for long enough, on YouTube and Discord.

Music, movies, microcode, and high-speed pizza delivery

calendar_today15-04-2010 04:10:41

11,2K Tweets

16,1K Followers

777 Following

Rob Miles (✈️ Berkeley)(@robertskmiles) 's Twitter Profile Photo

I think most people (quite reasonably) think 'We built ChatGPT, so we must basically understand how it works'
This is not true at all. Humans did not build ChatGPT. In a way it would be closer to say we 'grew' it. We have basically no idea how it does what it does.

account_circle
Rob Miles (✈️ Berkeley)(@robertskmiles) 's Twitter Profile Photo

We're like a plane company CEO; we don't know how to build jets, we know how to hire engineers. Being good at paying engineers doesn't cause you to understand how jets actually work, at all. Especially since these 'engineers' never speak a word or document anything

account_circle
Rob Miles (✈️ Berkeley)(@robertskmiles) 's Twitter Profile Photo

Their results are bizarre and inhuman. Neel Nanda trained a tiny transformer to do addition, then spent weeks figuring out what it was doing - one of the only times in history someone has understood how a transformer works.
This is the algorithm it created. To *add two numbers*!

Their results are bizarre and inhuman. @NeelNanda5 trained a tiny transformer to do addition, then spent weeks figuring out what it was doing - one of the only times in history someone has understood how a transformer works. This is the algorithm it created. To *add two numbers*!
account_circle