Calvin Luo (@calvinyluo) Twitter Tweets • TwiDoom

Calvin Luo

@calvinyluo

2 years ago

A fun, cute, festive cartoon by cAPSLOCK! 🤖🐦🎃🍬

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

How can robots perform a wide variety of novel tasks from natural language? Execited to present Code as Policies - using language models to directly write robot policy code from language instructions. See paper, colabs, blog, and demos at code-as-policies.github.io long 🧵👇

thumb_up_off_alt667

chat_bubble_outline17

repeat148

shareShare

Ofir Nachum

@ofirnachum

2 years ago

My second post is up now (as promised, updates are very infrequent). "Paper Writing: A View from the Trenches" ofirnachum.github.io/posts/view-fro…

thumb_up_off_alt29

chat_bubble_outline0

repeat6

shareShare

Saurabh Saxena

@srbhsxn

2 years ago

📢 Code, checkpoints and a colab for multitask (object detection, semantic segmentation, keypoint detection and captioning) Pix2Seq are now available! github.com/google-researc… github.com/google-researc… x.com/_akhaliq/statu…

thumb_up_off_alt104

chat_bubble_outline3

repeat24

shareShare

Calvin Luo

@calvinyluo

2 years ago

An elegant paper that proposes a novel compression objective inspired by minimum description length to discover and learn skills with maximally common structure. Really exciting implications for hierarchical reinforcement learning - I #LOVE this work!

thumb_up_off_alt8

chat_bubble_outline1

repeat1

shareShare

Sangnie Bhardwaj

@sangnie

a year ago

Join us at the WiML Un-Workshop breakout session on "Role of Mentorship and Networking"! Do not miss the chance to talk with leading researchers Samy Bengio, Susan Zhang Hugo Larochelle Sharon Y. Li Pablo Samuel Castro John Langford and others! #ICML2023 WiML

thumb_up_off_alt55

chat_bubble_outline2

repeat22

shareShare

Durk Kingma

@dpkingma

a year ago

Sneak peek: - Paper: drive.google.com/file/d/1jILc6c… - Slides: drive.google.com/file/d/1rlee-a…

thumb_up_off_alt388

chat_bubble_outline5

repeat71

shareShare

Archit Sharma

@archit_sharma97

7 months ago

High-quality human feedback for RLHF is expensive 💰. AI feedback is emerging as a scalable alternative, but are we using AI feedback effectively? Not yet; RLAIF improves perf *only* when LLMs are SFT'd on a weak teacher. Simple SFT on a strong teacher can outperform RLAIF! 🧵->

thumb_up_off_alt335

chat_bubble_outline13

repeat52

shareShare

Nate Gillman

@gillmanlab

7 months ago

Excited to share our latest preprint: “Self-Correcting Self-Consuming Loops for Generative Model Training”. It's a step towards generative AI models that can learn from the universe of data they generate!! 🤖(1/n)

thumb_up_off_alt111

chat_bubble_outline4

repeat20

shareShare

Zheng-Xin Yong (Yong)

@yong_zhengxin

7 months ago

Presenting LexC-Gen that generates data for extremely low-resource languages. 🤗 You only need 7B-LLMs and bilingual lexicons. 🔥 Our synthetic data are competitive with expert-translated data on sentiment and topic classification. Paper + Code: batsresearch.github.io/lexcgen/ [1/n]

thumb_up_off_alt69

chat_bubble_outline1

repeat18

shareShare

Aaron Lou

@aaron_lou

7 months ago

Announcing Score Entropy Discrete Diffusion (SEDD) w/ Chenlin Meng Stefano Ermon. SEDD challenges the autoregressive language paradigm, beating GPT-2 on perplexity and quality! Arxiv: arxiv.org/abs/2310.16834 Code: github.com/louaaron/Score… Blog: aaronlou.com/blog/discrete-… 🧵1/n

thumb_up_off_alt679

chat_bubble_outline20

repeat134

shareShare

Haotian Fu

@haotiannnnnnnnn

7 months ago

Excited to present LAST: A new framework for discovering skills from demonstrations! By combining LLM initial segmentation+temporal variational inference+MDL, an agent is able to discover reusable skills from long-horizon task trajectories that helps downstream task learning.

thumb_up_off_alt29

chat_bubble_outline1

repeat6

shareShare

Calvin Luo

Calvin Luo

Jacky Liang

Ofir Nachum

Saurabh Saxena

Calvin Luo

Sangnie Bhardwaj

Durk Kingma

Archit Sharma

Nate Gillman

Zheng-Xin Yong (Yong)

Aaron Lou

Haotian Fu