Itay Itzhak (@itay_itzhak_) 's Twitter Profile
Itay Itzhak

@itay_itzhak_

NLProc, deep learning, and machine learning. Ph.D. student @TechnionLive and @HebrewU

ID: 1195653141934542848

linkhttp://itay1itzhak.github.io calendar_today16-11-2019 10:41:51

81 Tweet

229 Followers

202 Following

Moran Mizrahi (@moranmiz) 's Twitter Profile Photo

🚀 Excited to share our latest paper about the sensitivity of LLMs to prompts! arxiv.org/abs/2401.00595 Our work may partly explain why some models seem less accurate than their formal evaluation may suggest. 🧐 Guy Kaplan, Dan H.M 🎗, Rotem Dror, Hyadata Lab (Dafna Shahaf), Gabriel Stanovsky

🚀 Excited to share our latest paper about the sensitivity of LLMs to prompts!
arxiv.org/abs/2401.00595

Our work may partly explain why some models seem less accurate than their formal evaluation may suggest. 🧐

<a href="/guymkaplan/">Guy Kaplan</a>, <a href="/malk_dan/">Dan H.M 🎗</a>, <a href="/DrorRotem/">Rotem Dror</a>, <a href="/HyadataLab/">Hyadata Lab (Dafna Shahaf)</a>, <a href="/GabiStanovsky/">Gabriel Stanovsky</a>
Adi Simhi (@adisimhi) 's Twitter Profile Photo

Excited to share our new paper- Constructing Benchmarks and Interventions for Combating Hallucinations in LLMs Check out our work here: arxiv.org/abs/2404.09971 This work was done with Jonathan Herzig Idan Szpektor Yonatan Belinkov

Asaf Yehudai (@asafyehudai) 's Twitter Profile Photo

🎉Excited to share our paper When LLMs are Unfit, Use FastFit: Fast and Effective Text Classification with Many Classes was accepted to NAACL HLT 2024! 🚀SOTA results ⚡Fast training & inference 🎯High accuracy 📄Paper:arxiv.org/abs/2404.12365 💻Package:github.com/IBM/fastfit

🎉Excited to share our paper 

When LLMs are Unfit, Use FastFit: Fast and Effective Text Classification with Many Classes

was accepted to <a href="/naaclmeeting/">NAACL HLT 2024</a>!

🚀SOTA results
⚡Fast training &amp; inference
🎯High accuracy
📄Paper:arxiv.org/abs/2404.12365
💻Package:github.com/IBM/fastfit
Adi Simhi (@adisimhi) 's Twitter Profile Photo

LLMs are often said to "hallucinate", "confabulate", or produce untruthful responses, which led to much work trying to mitigate such behavior. But what does it mean for an LM to hallucinate? And how can we effectively intervene in model internals to combat hallucinations?

Michael Toker (@michael_toker) 's Twitter Profile Photo

What if we could visualize language models’ computation process - with images? Introducing Diffusion Lens: a method for peeking into the internals of the text encoders of text-to-image pipelines. arxiv.org/abs/2403.05846 Demo: huggingface.co/spaces/tokeron… [1/9]

What if we could visualize language models’ computation process - with images? 
Introducing Diffusion Lens: a method for peeking into the internals of the text encoders of text-to-image pipelines.
arxiv.org/abs/2403.05846
Demo: huggingface.co/spaces/tokeron…
[1/9]
Zachary Bamberger (@zacharybamberg1) 's Twitter Profile Photo

Paper release 🧵: We (Yonatan Belinkov , Chaim Baskin, Ofek Glick and I) are proud to introduce DEPTH: Discourse Education through Pre-Training Hierarchically. Code: github.com/zbambergerNLP/… Paper: arxiv.org/abs/2405.07788

Hadas Orgad (@orgadhadas) 's Twitter Profile Photo

Our paper Diffusion Lens got accepted to #ACL2024 main conference! 🌴⭐️ Visualize LLMs computation process with our live demo >> huggingface.co/spaces/tokeron… For a quick TL;DR checkout Michael Toker's thread or project website - tokeron.github.io/DiffusionLensW…

Dana Arad 🎗️ (@dana_arad4) 's Twitter Profile Photo

Excited to share Diffusion Lens got accepted to #ACL2024 main conference!  🎉 Check out our demo: huggingface.co/spaces/tokeron… and paper: tokeron.github.io/DiffusionLensW…

Gabriel Stanovsky (@gabistanovsky) 's Twitter Profile Photo

Check out SEAM🤐 : a challenging LLM benchmark for multi-doc tasks and a stochastic approach to evaluation which addresses the brittleness of few-shot evaluation. Evaluate your model at seam-benchmark.github.io

Shachar Don-Yehiya (@shachar_don) 's Twitter Profile Photo

Human feedback is critical for language models development 💬, but collecting it is costly 🤑 We find that users naturally include feedback when interacting with chat models, and we can automatically extract it! arxiv.org/abs/2407.10944 W Leshem Choshen 🤖🤗 Omri Abend 🧵👇

Human feedback is critical for language models development 💬, but collecting it is costly 🤑

We find that users naturally include feedback when interacting with chat models, and we can automatically extract it!

arxiv.org/abs/2407.10944

W <a href="/LChoshen/">Leshem Choshen 🤖🤗</a> <a href="/AbendOmri/">Omri Abend</a> 🧵👇
Ori Yoran (@oriyoran) 's Twitter Profile Photo

Can AI agents solve realistic, time-consuming web tasks such as “Which gyms near me have fitness classes on the weekend, before 7AM?" We introduce AssistantBench, a benchmark with 214 such tasks. Our new GPT-4 based agent gets just 25% accuracy! assistantbench.github.io

David Bau (@davidbau) 's Twitter Profile Photo

Time to study #llama3 405b, but gosh it's big! Please retweet: if you have a great experiment but not enough GPU, here is an opportunity to apply for shared #NDIF research resources. Deadline July 30: ndif.us/405b.html You'll help NDIF test, we'll help you run 405b

Itay Itzhak (@itay_itzhak_) 's Twitter Profile Photo

Are you in #Bangkok? Come to hear about our #TACL paper on emergent cognitive biases in LLMs! Let's chat at: - Convention Center A1 (Poster) – Today, 16:00 - Lotus Suite 5-7 (Talk) – Tuesday, 11:15 Looking forward to geeking out on biases, LLMs, and more! 🚀 #ACL2024 #NLProc

Are you in #Bangkok? Come to hear about our #TACL paper on emergent cognitive biases in LLMs! Let's chat at:

- Convention Center A1 (Poster) – Today, 16:00
- Lotus Suite 5-7 (Talk) – Tuesday, 11:15 

Looking forward to geeking out on biases, LLMs, and more! 🚀 #ACL2024 #NLProc
Itay Itzhak (@itay_itzhak_) 's Twitter Profile Photo

Had a blast presenting at #ACL2024! Thanks to everyone who joined the discussions on LMs biases. Stay tuned for more on the origins of biases - we've got exciting work coming! 🔍 #NLProc

Dana Arad 🎗️ (@dana_arad4) 's Twitter Profile Photo

Excited about pursuing a graduate degree in AI or Machine Learning? In just two weeks, you have a chance to hear about the research happening in our lab and chat with students and faculty 👩🏻‍🎓

Excited about pursuing a graduate degree in AI or Machine Learning? In just two weeks, you have a chance to hear about the research happening in our lab and chat with students and faculty 👩🏻‍🎓