Vaibhav (VB) Srivastav (@reach_vb) 's Twitter Profile
Vaibhav (VB) Srivastav

@reach_vb

GPU poor @Huggingface | F1 fan | Here for @at_sofdog’s wisdom | *opinions my own

ID: 874987512850128897

linkhttps://huggingface.co calendar_today14-06-2017 13:50:54

5,5K Tweet

15,15K Takipçi

220 Takip Edilen

Vaibhav (VB) Srivastav (@reach_vb) 's Twitter Profile Photo

Mistral released Pixtral 12B Vision Language Model 🔥 Some notes on the release: 1. Text backbone: Mistral Nemo 12B 2. Vision Adapter: 400M 3. Uses GeLU (for vision adapter) & 2D RoPE (for vision encoder) 4. Larger vocabulary - 131,072 5. Three new special tokens - `img`,

Mistral released Pixtral 12B Vision Language Model 🔥
Some notes on the release:

1. Text backbone: Mistral Nemo 12B
2. Vision Adapter: 400M
3. Uses GeLU (for vision adapter) & 2D RoPE (for vision encoder)
4. Larger vocabulary - 131,072
5. Three new special tokens  - `img`,