Ross Wightman(@wightmanr) 's Twitter Profileg
Ross Wightman

@wightmanr

Computer Vision @ 🤗. Ex head of Software, Firmware Engineering at a Canadian 🦄. Currently building ML, AI systems or investing in startups that do it better.

ID:557902603

linkhttp://rwightman.com/ calendar_today19-04-2012 17:34:53

3,9K Tweet

18,6K Takipçi

1,1K Takip Edilen

Follow People
Ali Hassani(@AliHassaniJr) 's Twitter Profile Photo

Fused neighborhood attention now supports backward pass! Upgrade today to get all the features and all the speed. Get up to 844%, 385% and 447% improvement in 1D/2D/3D forward+backward pass at the op level. Training w/ spatio-temporal attention should be a breeze now.

Fused neighborhood attention now supports backward pass! Upgrade today to get all the features and all the speed. Get up to 844%, 385% and 447% improvement in 1D/2D/3D forward+backward pass at the op level. Training w/ spatio-temporal attention should be a breeze now.
account_circle
Leo Tronchon(@LeoTronchon) 's Twitter Profile Photo

With no changes from Llama2 to Llama3 architecture, the importance of data has never been so clear 🦙
While Idefics2 was heavily advertised, our efforts on data didn't shine ☀️ as much, so here are the 3 most important data updates we did:
1. 🍵 The Cauldron, a compilation of 50…

With no changes from Llama2 to Llama3 architecture, the importance of data has never been so clear 🦙 While Idefics2 was heavily advertised, our efforts on data didn't shine ☀️ as much, so here are the 3 most important data updates we did: 1. 🍵 The Cauldron, a compilation of 50…
account_circle
clem 🤗(@ClementDelangue) 's Twitter Profile Photo

Not everyone is aware but the fact that llama3 has been added to HF on day one means that right away, you can easily run it on @aws Sagemaker, @microsoft AzureML, Google Cloud Vertex and the Hugging Face hosted solutions (autotrain, spaces, serverless and dedicated endpoints).

Not everyone is aware but the fact that llama3 has been added to HF on day one means that right away, you can easily run it on @aws Sagemaker, @microsoft AzureML, @googlecloud Vertex and the @huggingface hosted solutions (autotrain, spaces, serverless and dedicated endpoints).
account_circle
Leo Tronchon(@LeoTronchon) 's Twitter Profile Photo

Today we release Idefics2 our newest 8B Vison-Language Model!
💪 With only 8B parameters Idefics is one of the strongest open models out there
📋 We used multiple OCR datasets, including PDFA and IDL from Ross Wightman and Pablo Montalvo, and increased resolution up to 980x980 to improve

Today we release Idefics2 our newest 8B Vison-Language Model! 💪 With only 8B parameters Idefics is one of the strongest open models out there 📋 We used multiple OCR datasets, including PDFA and IDL from @wightmanr and @m_olbap, and increased resolution up to 980x980 to improve
account_circle
Philipp Schmid(@_philschmid) 's Twitter Profile Photo

Introducing Idefics2, the strongest Vision-Language-Model (VLM) < 10B! 🚀 Idefics2 comes with significantly enhanced capabilities in OCR, document understanding, and visual reasoning. 💬📄🖼️

TL;DR;
📚 8B base and instruction variant
🖼️ Image + text inputs ⇒ Text output
📷…

Introducing Idefics2, the strongest Vision-Language-Model (VLM) < 10B! 🚀 Idefics2 comes with significantly enhanced capabilities in OCR, document understanding, and visual reasoning. 💬📄🖼️ TL;DR; 📚 8B base and instruction variant 🖼️ Image + text inputs ⇒ Text output 📷…
account_circle
Vik Paruchuri(@VikParuchuri) 's Twitter Profile Photo

I wrote a blog post on going from not knowing anything about deep learning last year to training state of the art OSS models - vikas.sh/post/how-i-got… .

Hope it helps you.

tldr; read the deep learning book, implemented papers + taught, built open source tools

account_circle
Julien Chaumond(@julien_c) 's Twitter Profile Photo

Did you know that you can now view PDFs hosted on the Hugging Face hub, directly on the hub?

Read here Stas Bekman's Machine Learning Engineering Open Book

Did you know that you can now view PDFs hosted on the @huggingface hub, directly on the hub? Read here @StasBekman's Machine Learning Engineering Open Book
account_circle
Brigitte 🤗(@BrigitteTousi) 's Twitter Profile Photo

“[Open-source AI] is really fundamental because it allows everyone to seize the technology, to diminish the fear of limited understanding or of not being qualified to use AI” - Remi Cadene 🔥🔥
Great piece in Bloomberg Opinion by Parmy Olson on the open-source landscape bloomberg.com/opinion/articl…

account_circle