Furong Huang (@furongh) 's Twitter Profile
Furong Huang

@furongh

Associate professor of @umdcs @umiacs @ml_umd at UMD. Researcher in #AI/#ML, AI #Alignment, #RLHF, #Trustworthy ML, #EthicalAI, AI #Democratization, AI for ALL.

ID: 195674678

linkhttps://furong-huang.com/ calendar_today27-09-2010 09:11:38

1,1K Tweet

5,5K Followers

2,2K Following

Ahmad Beirami (@abeirami) 's Twitter Profile Photo

Dear reviewers: Please engage with author rebuttal! Please read & ask clarifying questions as needed. If your score remains unchanged after rebuttal, please provide brief feedback as well on why! (Junior) authors put a ton of effort in the last week! Let's not despair them.

Furong Huang (@furongh) 's Twitter Profile Photo

RLHF is helpful when the model is very good, and you just need to collect user feedback to “patch things.” RL is indeed important for more complex tasks, like reasoning and planning. Having LLMs in the RL loop would be helpful for generalization and “warm-start” RL.

Miguel Vasco (@omiguelvasco) 's Twitter Profile Photo

Can we train vision-based reinforcement learning agents that achieve super-human performance against human players in racing games? Check out our latest work to be presented at RL_Conference! Paper: rlj.cs.umass.edu/2024/papers/Pa… Website: ai.sony/publications/A… A very fast 🧵(1/7)

Ahmad Beirami (@abeirami) 's Twitter Profile Photo

Token-wise MDPs/RL has recently found lots of interest in alignment, including - Our controlled decoding (arxiv.org/abs/2310.17022) -Souradip Chakraborty's transfer Q-star (arxiv.org/abs/2405.20495) -Xiangyu Qi's token-wise constrained fine-tuning (arxiv.org/abs/2406.05946)

Glen Berseth (@glenberseth) 's Twitter Profile Photo

Andy Barto answering a question RL_Conference that I have been thinking about for 2 years. Turns out ML (supervised learning) is a special case of RL we constructed for some reason...

Andy Barto answering a question  <a href="/RL_Conference/">RL_Conference</a> that I have been thinking about for 2 years. Turns out ML (supervised learning) is a special case of RL we constructed for some reason...
Furong Huang (@furongh) 's Twitter Profile Photo

Come and talk to Yuhang Zhou about Mementos at #ACL2024NLP. 📅 Aug 13, 4 pm at Poster Session 5. ACL 2024 🤩Check out our dataset on Huggingface 👉 huggingface.co/datasets/furon…

ICLR 2025 (@iclr_conf) 's Twitter Profile Photo

We are now accepting nominations for reviewers and ACs at ICLR 2025. To nominate yourself or someone else, please complete this form: forms.gle/VKmG1DJgWzKTY9…

Ananda Theertha Suresh (@th33rtha) 's Twitter Profile Photo

Excited to share the slides of the 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞 𝐦𝐨𝐝𝐞𝐥 𝐢𝐧𝐟𝐞𝐫𝐞𝐧𝐜𝐞: 𝐭𝐡𝐞𝐨𝐫𝐲 & 𝐚𝐥𝐠𝐨𝐫𝐢𝐭𝐡𝐦𝐬 tutorial that Ahmad Beirami and I gave at #ISIT2024. We covered basics of 𝘪𝘯𝘧𝘦𝘳𝘦𝘯𝘤𝘦, 𝘢𝘭𝘪𝘨𝘯𝘮𝘦𝘯𝘵 and 𝘦𝘧𝘧𝘪𝘤𝘪𝘦𝘯𝘤𝘺 from a theoretical lens.

Excited to share the slides of the 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞 𝐦𝐨𝐝𝐞𝐥 𝐢𝐧𝐟𝐞𝐫𝐞𝐧𝐜𝐞: 𝐭𝐡𝐞𝐨𝐫𝐲 &amp; 𝐚𝐥𝐠𝐨𝐫𝐢𝐭𝐡𝐦𝐬 tutorial that <a href="/abeirami/">Ahmad Beirami</a> and I gave at #ISIT2024. We covered basics of 𝘪𝘯𝘧𝘦𝘳𝘦𝘯𝘤𝘦, 𝘢𝘭𝘪𝘨𝘯𝘮𝘦𝘯𝘵 and 𝘦𝘧𝘧𝘪𝘤𝘪𝘦𝘯𝘤𝘺 from a theoretical lens.
Furong Huang (@furongh) 's Twitter Profile Photo

Thought experiment: Give an idea to students and the AI Scientist, then wait a whole month (which feels like eternity these days 🤡). Who'll write the better paper? Spoiler: Humans might need to start applying for internships with their AI overlords. 🤣 #AI #AcademicChallenge

Jordan Boyd-Graber (@boydgraber) 's Twitter Profile Photo

I'm bummed that family obligations prevented me from presenting this epic paper. This work represented a long journey for me. I first began working on the language of Diplomacy in 2015, and I struggled for years to get funding to build a bot that could play it ...

I'm bummed that family obligations prevented me from presenting this epic paper.  This work represented a long journey for me.  I first began working on the language of Diplomacy in 2015, and I struggled for years to get funding to build a bot that could play it ...
Vikash Sehwag (@vsehwag_) 's Twitter Profile Photo

It only took 10 years! From barely managing cifar10 to realistic image synthesis. The contrast also relates to my last ten years of research work: From barely knowing about deep learning in undergraduate to actively contributing to this field. I imagine soon we will be able to

It only took 10 years!
From barely managing cifar10 to realistic image synthesis.

The contrast also relates to my last ten years of research work: From barely knowing about deep learning in undergraduate to actively contributing to this field.

I imagine soon we will be able to
John Langford (@johnclangford) 's Twitter Profile Photo

We are doing a post-covid restart of the NYC ML Symposium (events.nyas.org/event/ml2024/s… ) on Oct 18th. It's a great chance to meet folks in the area, so drop by if interested. Abstracts for posters are due Sept 6th, in about 10 days. (These are super quick & easy.)

Furong Huang (@furongh) 's Twitter Profile Photo

Registration is open for the 15th Annual Machine Learning Symposium on Oct. 18! Join NYAS in NYC for presentations, short talks, & poster sessions showcasing the latest in #ML research. Register now for early bird pricing or submit an abstract: bit.nyas.org/3YeLtFF

Registration is open for the 15th Annual Machine Learning Symposium on Oct. 18! Join <a href="/NYASciences/">NYAS</a> in NYC for presentations, short talks, &amp; poster sessions showcasing the latest in #ML research. Register now for early bird pricing or submit an abstract: bit.nyas.org/3YeLtFF
Furong Huang (@furongh) 's Twitter Profile Photo

To all my students: If sending funny videos and cute pics is ‘pebbling’, then consider me your dedicated penguin 🐧, sprinkling your inbox with intellectual pebbles ❤️! “Enjoyed reading this paper just came out today—thought you might too!” “This blog is visionary—perfect for