Parul Pandey (@pandeyparul) 's Twitter Profile
Parul Pandey

@pandeyparul

Author | Principal Data Scientist @h2oai | @kaggle Grandmaster(Notebooks) | Mom

ID: 232607343

linkhttps://linktr.ee/parulpandey calendar_today31-12-2010 17:17:13

1,1K Tweet

7,7K Takipçi

1,1K Takip Edilen

Anna Zink (@annalzink) 's Twitter Profile Photo

There are good arguments for removing race from medical algorithms, but there may be unintended consequences. Our PNAS paper finds that race-blind algorithms can *worsen* racial inequalities, bc they can't adjust for racial disparities in data quality. shorturl.at/7ugW5

Parul Pandey (@pandeyparul) 's Twitter Profile Photo

Extremely excited to share that ‘Machine Learning for High Risk Applications’ is now also available in Korean 🇰🇷. Thanks to the team at O'Reilly.

Extremely excited to share that ‘Machine Learning for High Risk Applications’ is now also available in Korean 🇰🇷. Thanks to the team at O'Reilly.
Parul Pandey (@pandeyparul) 's Twitter Profile Photo

Machine Learning Street Talk has been dropping some seriously cool interviews back-to-back which I’ll highly recommend: 1️⃣ Prof. Subbarao Kambhampati (కంభంపాటి సుబ్బారావు) spilling the tea on ChatGPT’s reasoning skills (or lack thereof). 2️⃣ Sayash Kapoor on AI existential risk and government policy 3️⃣ Sara Hooker

Machine Learning Street Talk has been dropping some seriously cool interviews back-to-back which I’ll highly recommend: 
1️⃣ Prof. <a href="/rao2z/">Subbarao Kambhampati (కంభంపాటి సుబ్బారావు)</a>  spilling the tea on ChatGPT’s reasoning skills (or lack thereof). 
2️⃣ <a href="/sayashk/">Sayash Kapoor</a> on AI existential risk and government policy
3️⃣ <a href="/sarahookr/">Sara Hooker</a>
Parul Pandey (@pandeyparul) 's Twitter Profile Photo

Excellent paper reinforcing Goodhart's Law: When a measure becomes a target, it ceases to be a good measure. The authors unveil the "benchmark blindspot" where contaminated test data inflates LLM scores. Their solution? A "retro-holdout" dataset creation method, applied to

Excellent paper reinforcing Goodhart's Law: When a measure becomes a target, it ceases to be a good measure. The authors unveil the "benchmark blindspot" where contaminated test data inflates LLM scores. Their solution? A "retro-holdout" dataset creation method, applied to
Parul Pandey (@pandeyparul) 's Twitter Profile Photo

Finally got my hands on a book that’s been on my radar for so long 😀! Why Machines Learn by Anil Ananthaswamy beautifully blends storytelling with intuitive explanations of technical machine learning concepts, combined with interesting anecdotes about the journey from Rosenblatt’s

Finally got my hands on a book that’s been on my radar for so long 😀! Why Machines Learn by <a href="/anilananth/">Anil Ananthaswamy</a>  beautifully blends storytelling with intuitive explanations of technical machine learning concepts, combined with interesting anecdotes about the journey from Rosenblatt’s
Parul Pandey (@pandeyparul) 's Twitter Profile Photo

This is a great example of why writing publicly is so valuable. From my own journey, I can attest that writing allows you to share your learnings, build a network, and also learn along the way. Congrats Leonie 👏

Parul Pandey (@pandeyparul) 's Twitter Profile Photo

This is a great introductory video by Yann Dubois on building LLMs. It covers often-overlooked aspects like evaluation, costs, and data, and not just architecture. It’s part of Stanford’s CS229 Machine Learning course for Summer 2024. Link: youtube.com/watch?v=9vM4p9…

This is a great introductory video by <a href="/yanndubs/">Yann Dubois</a>  on building LLMs. It covers often-overlooked aspects like evaluation, costs, and data, and not just architecture. It’s part of Stanford’s CS229 Machine Learning course for Summer 2024.

Link: youtube.com/watch?v=9vM4p9…
Parul Pandey (@pandeyparul) 's Twitter Profile Photo

The debate between open-source and closed-source AI has been gaining a lot of attention lately, but H2O.ai has been pioneering open-source AI for over a decade. Our CEO shared a great blog post in April this year, where he explains our long-standing commitment to open-source

The debate between open-source and closed-source AI has been gaining a lot of attention lately, but <a href="/h2oai/">H2O.ai</a>  has been pioneering open-source AI for over a decade.

Our CEO shared a great blog post in April this year, where he explains our long-standing commitment to open-source
Yuntian Deng (@yuntiandeng) 's Twitter Profile Photo

We're providing free access to OpenAI's new o1 reasoning model through our WildChat chatbot: 🔗huggingface.co/spaces/yuntian… Also, proud that WildChat was referenced in the safety evaluation for o1!

Parul Pandey (@pandeyparul) 's Twitter Profile Photo

This book is a visual treat. It covers topics like embeddings, attention mechanisms, fine-tuning techniques, LoRA, quantization - all illustrated with colorful graphics that make it a joy to read.

This book is a visual treat. It covers topics like embeddings, attention mechanisms, fine-tuning techniques, LoRA, quantization -  all illustrated with colorful graphics that make it a joy to read.
Cyril Zakka, MD (@cyrilzakka) 's Twitter Profile Photo

I mean no disrespect to any of the authors in this thread but the claims being made are wild and/or taken out of context. The AgentClinic dataset is based on an online dataset (read contamination) I helped create, from a resource aimed at helping medical students study for the