Omar Sanseviero(@osanseviero) 's Twitter Profileg
Omar Sanseviero

@osanseviero

Chief Llama Officer @huggingface 🦙

Founder @AI_Learners.
Xoogler (SWE @Google Assistant, 20% PM TF Graphics).
100% Hacker Llama🇵🇪🇲🇽

ID:207744565

linkhttps://osanseviero.github.io/hackerllama/ calendar_today25-10-2010 23:29:03

9,1K Tweets

31,5K Followers

2,1K Following

AI at Meta(@AIatMeta) 's Twitter Profile Photo

It's been exactly one week since we released Meta Llama 3, in that time the models have been downloaded over 1.2M times, we've seen 600+ derivative models on @HuggingFace and much more.

More on the exciting impact we're already seeing with Llama 3 ➡️ go.fb.me/xsqzz8

It's been exactly one week since we released Meta Llama 3, in that time the models have been downloaded over 1.2M times, we've seen 600+ derivative models on @HuggingFace and much more. More on the exciting impact we're already seeing with Llama 3 ➡️ go.fb.me/xsqzz8
account_circle
Fleetwood(@fleetwood___) 's Twitter Profile Photo

🚨 Phi-3 running in the browser 🚨

Hits about 20 tok/s 🏎️ Literally 3 lines of JS.

Still some kinks to iron out, coming to Ratchet 0.4.0 soon.

account_circle
Google for Developers(@googledevs) 's Twitter Profile Photo

🥳 Good news! Gemma models with are now available for direct download from the Hugging Face Hub!

Enjoy a seamlessly integrated ecosystem with a wide range of compatible models ↓ goo.gle/3w9JmqS

account_circle
Xenova(@xenovacom) 's Twitter Profile Photo

Meta's Segment Anything Model (SAM) can now run in your browser w/ WebGPU (+ fp16), meaning up to 8x faster image encoding (10s → 1.25s)! 🤯⚡️

Video is not sped up! Everything runs 100% locally thanks to 🤗 Transformers.js and onnxruntime-web!

🔗 Demo: hf.co/spaces/Xenova/…

account_circle
nisten(@nisten) 's Twitter Profile Photo

Few bugs but LLama-3 on Huggingchat ios app is amazing to use. System prompt of review:
“You are a hyper-intelligent friendly raccoon that uses first principles based reasoning and system1/system2 thinking to concisely solve every problem in the galaxy while using lots of emojis.

account_circle
Philipp Schmid(@_philschmid) 's Twitter Profile Photo

Meta Llama 3 70B Instruct in Hugging Chat! Go have fun!
huggingface.co/chat/models/me…

huggingface.co/chat/models/me…

account_circle
Julien Chaumond(@julien_c) 's Twitter Profile Photo

we just shipped HuggingChat on iOS 💬

The app is super polished and gives you access to the community's best open AI models, on the go.

Give it a try!

link to Appstore below ⤵️

we just shipped HuggingChat on iOS 💬 The app is super polished and gives you access to the community's best open AI models, on the go. Give it a try! link to Appstore below ⤵️
account_circle
Daniel van Strien(@vanstriendaniel) 's Twitter Profile Photo

We've just added a new icon to indicate datasets created using Argilla's Distilabel on the Hugging Face Hub!

Good data is vital for AI so I'm very excited to see the growing number of data tools integrating with the Hub 🚀

We've just added a new icon to indicate datasets created using @argilla_io's Distilabel on the @huggingface Hub! Good data is vital for AI so I'm very excited to see the growing number of data tools integrating with the Hub 🚀
account_circle
Omar Sanseviero(@osanseviero) 's Twitter Profile Photo

Open models plot.

Total # of parameters and total # of activated parameters sorted by decreasing MMLU

Data from huggingface.co/spaces/Hugging…

(poor Bloom 😅go Yi!)

Open models plot. Total # of parameters and total # of activated parameters sorted by decreasing MMLU Data from huggingface.co/spaces/Hugging… (poor Bloom 😅go Yi!)
account_circle
Omar Sanseviero(@osanseviero) 's Twitter Profile Photo

Everyone is adding models to the MMLU vs activated params plot, so here is a super quick one with more models.

Everyone seems to forget about those not trained in the US/Europe: 01-ai Yi, InternLM, Qwen, and DeepSeek.

(btw just use huggingface.co/spaces/Hugging… to compare MMLU)

Everyone is adding models to the MMLU vs activated params plot, so here is a super quick one with more models. Everyone seems to forget about those not trained in the US/Europe: 01-ai Yi, InternLM, Qwen, and DeepSeek. (btw just use huggingface.co/spaces/Hugging… to compare MMLU)
account_circle