@armenagha : If you were interested in my cryptic posts on how to train Chameleon-like models up to 4x faster, check out our MoMa paper which covers a detailed overview of most of our architectural improvements. tl;dr adaptive compute in 3-dim, modality, width, depth. • TwiDoom

Armen Aghajanyan

@armenagha

+ Follow

ex-RS FAIR/MSFT

ID: 1515424688

calendar_today14-06-2013 05:43:07

591 Tweet

11,11K Takipçi

266 Takip Edilen

Armen Aghajanyan

@armenagha

2 months ago

If you were interested in my cryptic posts on how to train Chameleon-like models up to 4x faster, check out our MoMa paper which covers a detailed overview of most of our architectural improvements. tl;dr adaptive compute in 3-dim, modality, width, depth.

thumb_up_off_alt216

chat_bubble_outline3

repeat24

shareShare