Furong Huang (@furongh) Twitter Tweets • TwiDoom

Furong Huang

@furongh

a month ago

It was a lot of fun catching up with folks at MSR-NYC! Thanks for the invitation John Langford!

thumb_up_off_alt33

chat_bubble_outline0

repeat0

shareShare

Dear reviewers: Please engage with author rebuttal! Please read & ask clarifying questions as needed. If your score remains unchanged after rebuttal, please provide brief feedback as well on why! (Junior) authors put a ton of effort in the last week! Let's not despair them.

thumb_up_off_alt91

chat_bubble_outline2

repeat14

shareShare

Furong Huang

@furongh

a month ago

RLHF is helpful when the model is very good, and you just need to collect user feedback to “patch things.” RL is indeed important for more complex tasks, like reasoning and planning. Having LLMs in the RL loop would be helpful for generalization and “warm-start” RL.

thumb_up_off_alt18

chat_bubble_outline0

repeat2

shareShare

Miguel Vasco

@omiguelvasco

a month ago

Can we train vision-based reinforcement learning agents that achieve super-human performance against human players in racing games? Check out our latest work to be presented at RL_Conference! Paper: rlj.cs.umass.edu/2024/papers/Pa… Website: ai.sony/publications/A… A very fast 🧵(1/7)

thumb_up_off_alt34

chat_bubble_outline1

repeat10

shareShare

Ahmad Beirami

@abeirami

a month ago

Token-wise MDPs/RL has recently found lots of interest in alignment, including - Our controlled decoding (arxiv.org/abs/2310.17022) -Souradip Chakraborty's transfer Q-star (arxiv.org/abs/2405.20495) -Xiangyu Qi's token-wise constrained fine-tuning (arxiv.org/abs/2406.05946)

thumb_up_off_alt15

chat_bubble_outline0

repeat5

shareShare

Pablo Samuel Castro

@pcastr

a month ago

Third keynote by Andy Barto RL_Conference , arguing that it was always RL, with a standing ovation at the end!

Third keynote by Andy Barto <a href="/RL_Conference/">RL_Conference</a> , arguing that it was always RL, with a standing ovation at the end!

thumb_up_off_alt174

chat_bubble_outline4

repeat17

shareShare

Glen Berseth

@glenberseth

a month ago

Andy Barto answering a question RL_Conference that I have been thinking about for 2 years. Turns out ML (supervised learning) is a special case of RL we constructed for some reason...

Andy Barto answering a question <a href="/RL_Conference/">RL_Conference</a> that I have been thinking about for 2 years. Turns out ML (supervised learning) is a special case of RL we constructed for some reason...

thumb_up_off_alt158

chat_bubble_outline4

repeat17

shareShare

Furong Huang

@furongh

a month ago

Really wish I could have been there. It looks like everyone had a fantastic time at the conference!

thumb_up_off_alt14

chat_bubble_outline3

repeat0

shareShare

Pablo Samuel Castro

@pcastr

a month ago

Really nice initiative by Ben Eysenbach , who prepared these posters (hung around RL_Conference ) of notable women in RL !

Really nice initiative by <a href="/ben_eysenbach/">Ben Eysenbach</a> , who prepared these posters (hung around <a href="/RL_Conference/">RL_Conference</a> ) of notable women in RL !

thumb_up_off_alt249

chat_bubble_outline4

repeat41

shareShare

Furong Huang

@furongh

a month ago

Come and talk to Yuhang Zhou about Mementos at #ACL2024NLP. 📅 Aug 13, 4 pm at Poster Session 5. ACL 2024 🤩Check out our dataset on Huggingface 👉 huggingface.co/datasets/furon…

thumb_up_off_alt25

chat_bubble_outline1

repeat10

shareShare

ICLR 2025

@iclr_conf

a month ago

We are now accepting nominations for reviewers and ACs at ICLR 2025. To nominate yourself or someone else, please complete this form: forms.gle/VKmG1DJgWzKTY9…

thumb_up_off_alt185

chat_bubble_outline3

repeat49

shareShare

Ananda Theertha Suresh

@th33rtha

a month ago

Excited to share the slides of the 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞 𝐦𝐨𝐝𝐞𝐥 𝐢𝐧𝐟𝐞𝐫𝐞𝐧𝐜𝐞: 𝐭𝐡𝐞𝐨𝐫𝐲 & 𝐚𝐥𝐠𝐨𝐫𝐢𝐭𝐡𝐦𝐬 tutorial that Ahmad Beirami and I gave at #ISIT2024. We covered basics of 𝘪𝘯𝘧𝘦𝘳𝘦𝘯𝘤𝘦, 𝘢𝘭𝘪𝘨𝘯𝘮𝘦𝘯𝘵 and 𝘦𝘧𝘧𝘪𝘤𝘪𝘦𝘯𝘤𝘺 from a theoretical lens.

Excited to share the slides of the 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞 𝐦𝐨𝐝𝐞𝐥 𝐢𝐧𝐟𝐞𝐫𝐞𝐧𝐜𝐞: 𝐭𝐡𝐞𝐨𝐫𝐲 & 𝐚𝐥𝐠𝐨𝐫𝐢𝐭𝐡𝐦𝐬 tutorial that <a href="/abeirami/">Ahmad Beirami</a> and I gave at #ISIT2024. We covered basics of 𝘪𝘯𝘧𝘦𝘳𝘦𝘯𝘤𝘦, 𝘢𝘭𝘪𝘨𝘯𝘮𝘦𝘯𝘵 and 𝘦𝘧𝘧𝘪𝘤𝘪𝘦𝘯𝘤𝘺 from a theoretical lens.

thumb_up_off_alt64

chat_bubble_outline3

repeat12

shareShare

Furong Huang

@furongh

a month ago

Thought experiment: Give an idea to students and the AI Scientist, then wait a whole month (which feels like eternity these days 🤡). Who'll write the better paper? Spoiler: Humans might need to start applying for internships with their AI overlords. 🤣 #AI #AcademicChallenge

thumb_up_off_alt10

chat_bubble_outline1

repeat0

shareShare

Jordan Boyd-Graber

@boydgraber

a month ago

I'm bummed that family obligations prevented me from presenting this epic paper. This work represented a long journey for me. I first began working on the language of Diplomacy in 2015, and I struggled for years to get funding to build a bot that could play it ...

thumb_up_off_alt70

chat_bubble_outline4

repeat10

shareShare

Vikash Sehwag

@vsehwag_

a month ago

It only took 10 years! From barely managing cifar10 to realistic image synthesis. The contrast also relates to my last ten years of research work: From barely knowing about deep learning in undergraduate to actively contributing to this field. I imagine soon we will be able to

thumb_up_off_alt90

chat_bubble_outline0

repeat10

shareShare

Furong Huang

@furongh

a month ago

Twitter friends, any recommendations for an intro/survey to probabilistic sampling?

thumb_up_off_alt9

chat_bubble_outline1

repeat0

shareShare

John Langford

@johnclangford

22 days ago

We are doing a post-covid restart of the NYC ML Symposium (events.nyas.org/event/ml2024/s… ) on Oct 18th. It's a great chance to meet folks in the area, so drop by if interested. Abstracts for posters are due Sept 6th, in about 10 days. (These are super quick & easy.)

thumb_up_off_alt134

chat_bubble_outline8

repeat26

shareShare

Furong Huang

@furongh

15 days ago

Giving a keynote on Oct 18th, 2024. Looking forward to catching up with old friends and meeting new ones!

thumb_up_off_alt9

chat_bubble_outline0

repeat2

shareShare

Furong Huang

@furongh

15 days ago

Registration is open for the 15th Annual Machine Learning Symposium on Oct. 18! Join NYAS in NYC for presentations, short talks, & poster sessions showcasing the latest in #ML research. Register now for early bird pricing or submit an abstract: bit.nyas.org/3YeLtFF

Registration is open for the 15th Annual Machine Learning Symposium on Oct. 18! Join <a href="/NYASciences/">NYAS</a> in NYC for presentations, short talks, & poster sessions showcasing the latest in #ML research. Register now for early bird pricing or submit an abstract: bit.nyas.org/3YeLtFF

thumb_up_off_alt16

chat_bubble_outline0

repeat3

shareShare

Furong Huang

@furongh

15 days ago

To all my students: If sending funny videos and cute pics is ‘pebbling’, then consider me your dedicated penguin 🐧, sprinkling your inbox with intellectual pebbles ❤️! “Enjoyed reading this paper just came out today—thought you might too!” “This blog is visionary—perfect for

thumb_up_off_alt25

chat_bubble_outline1

repeat1

shareShare