Peter Wang 🦋
@pwang
Chief AI & Co-founder @AnacondaInc; invented @pyscript_dev, @PyData @Bokeh @Datashader. Former physicist. A student of the human condition. bsky: @wang.social
ID:8472272
https://anaconda.com 27-08-2007 22:12:02
39,3K Tweets
48,2K Followers
2,3K Following
Had a lot of fun Data Council. Thanks again to Pete Soderling for the conf and Tristan Zajonc for being an amazing track host. My talk is now on YT. Tho I’m thinking of renaming it to “What? It’s just a file?! 🤯” instead.
youtu.be/9O2pfXkCDmU?si…
I agree with Andrej Karpathy that smaller models would be welcome as well… What is the best way to build a quality 1B-3B model that could be used in an MoE configuration?
Is it best to quantize and distill or extract from a big existing model, or is it better to train up a small model