Madhur Panwar (@mdrpanwar) 's Twitter Profile
Madhur Panwar

@mdrpanwar

Research Fellow at @MSFTResearch | Previously at @Adobe, Speech Lab @NTUsg | NLP | Math + CS @bitspilaniindia

ID: 1915027248

linkhttps://mdrpanwar.github.io calendar_today28-09-2013 19:12:01

2 Tweet

39 Followers

219 Following

Madhur Panwar (@mdrpanwar) 's Twitter Profile Photo

Is it possible to infer the information content of LLMs' activations w/o intervention experiments? 🤔 Absolutely! ✅ We propose a method to map activations to specific input subsets, revealing the encoded information w/o altering the model. 🌟 See the linked🧵for more