Berkeley AI Research (@berkeley_ai) Twitter Tweets • TwiCopy

Berkeley AI Research

@berkeley_ai

+ Follow

We're graduate students, postdocs, faculty and scientists at the cutting edge of artificial intelligence research.

ID:891077171673931776

linkhttp://bair.berkeley.edu/ calendar_today28-07-2017 23:25:27

772 Tweets

152,2K Followers

190 Following

Follow People

Soumith Chintala

Cofounded and lead @PyTorch at Meta. Also dabble in robotics at NYU. AI is delicious when it is accessible and open-source.

+ Follow

Sergey Levine

Associate Professor at UC Berkeley

+ Follow

NeurIPS Conference

New Orleans, Dec 10-16, 23. https://t.co/ga8aOw615g Tweets to this account are not monitored. Please send feedback to [email protected].

+ Follow

Kyunghyun Cho

a combination of a mediocre scientist, a mediocre manager, a mediocre advisor & a mediocre PC at @nyuniversity (@CILVRatNYU) & @genentech (@PrescientDesign).

+ Follow

Oriol Vinyals

VP of Research & Deep Learning Lead, Google DeepMind. AlphaStar, AlphaFold, AlphaCode, WaveNet, seq2seq, distillation, TF. Previous: Google Brain.

+ Follow

Berkeley AI Research

1 month ago

Registration is now open for an exciting workshop organized by Aditi Krishnapriyan and Jennifer Listgarten at the Simons Institute for the Theory of Computing June 10th-14th in Berkeley, AI≡Science: Strengthening the Bond Between the Sciences and Artificial Intelligence. simons.berkeley.edu/workshops/aisc…

thumb_up_off_alt35

chat_bubble_outline0

account_circle

Xiuyu Li

1 month ago

Handling long context in LLMs is expensive, but can we cut the cost by learning them offline for a specific set/genre of documents?

Introducing LLoCO, our new technique that learns documents offline through context compression and in-domain finetuning using LoRA, which archives…

Handling long context in LLMs is expensive, but can we cut the cost by learning them offline for a specific set/genre of documents? Introducing LLoCO, our new technique that learns documents offline through context compression and in-domain finetuning using LoRA, which archives…

thumb_up_off_alt250

chat_bubble_outline0

account_circle

Jiayi Pan

1 month ago

New paper from @Berkeley_AI on Autonomous Evaluation and Refinement of Digital Agents!

We show that VLM/LLM-based evaluators can significantly improve the performance of agents for web browsing and device control, advancing sotas by 29% to 75%.

arxiv.org/abs/2404.06474 [🧵]

New paper from @Berkeley_AI on Autonomous Evaluation and Refinement of Digital Agents! We show that VLM/LLM-based evaluators can significantly improve the performance of agents for web browsing and device control, advancing sotas by 29% to 75%. arxiv.org/abs/2404.06474 [🧵]

thumb_up_off_alt281

chat_bubble_outline0

account_circle

Shishir Patil

1 month ago

📢Excited to release GoEx⚡️a runtime for LLM-generated actions like code, API calls, and more. Featuring 'post-facto validation' for assessing LLM actions after execution 🔍 Key to our approach is 'undo' 🔄 and 'damage confinement' abstractions to manage unintended actions &…

thumb_up_off_alt200

chat_bubble_outline0

account_circle

Jason Hu

1 month ago

🚀 Introducing RouterBench, the first comprehensive benchmark for evaluating LLM routers! 🎉
A collaboration between Martian and Prof. Kurt Keutzer at UC Berkeley, we've created the first holistic framework to assess LLM routing systems. 🧵1/8

To read more:…

thumb_up_off_alt136

chat_bubble_outline0

account_circle

Ken Goldberg

1 month ago

'Why don't we have better robots yet?': just posted on the TED Talks home page under Newest Talks (3rd row from the top) with links to PiE Robotics and Forbes article on art by Ben Wolff Benjamin Wolff @Berkeley_AI UC Berkeley Berkeley Engineering TED.com

'Why don't we have better robots yet?': just posted on the @TEDTalks home page under Newest Talks (3rd row from the top) with links to @PieRobotics and @Forbes article on art by Ben Wolff @creativecellist @Berkeley_AI @UCBerkeley @Cal_Engineer TED.com

thumb_up_off_alt130

chat_bubble_outline0

account_circle

Berkeley AI Research

1 month ago

New BAIR blog post on modeling extremely large images, by Ritwik Gupta 🇺🇦 Jitendra MALIK trevordarrell and more!: bair.berkeley.edu/blog/2024/03/2…

thumb_up_off_alt23

chat_bubble_outline0

account_circle

Karl Pertsch

1 month ago

Access to *diverse* training data is a major bottleneck in robot learning. We're releasing DROID, a large-scale in-the-wild manipulation dataset. 76k trajectories, 500+ scenes, multi-view stereo, language annotations etc
Check it out & download today!

💻: droid-dataset.github.io

thumb_up_off_alt197

chat_bubble_outline0

account_circle

Carlo Sferrazza

@carlo_sferrazza

1 month ago

Humanoids 🤖 will do anything humans can do. But are state-of-the-art algorithms up to the challenge?

Introducing HumanoidBench, the first-of-its-kind simulated humanoid benchmark with 27 distinct whole-body tasks requiring intricate long-horizon planning and coordination.

🧵👇

thumb_up_off_alt322

chat_bubble_outline0

account_circle

Anastasios Nikolas Angelopoulos

@ml_angelopoulos

2 months ago

U give me: a bunch of unlabeled data.

I give u: AI-generated labels.

Result: a massive, but biased, val set.

We use PPI to correct the bias, giving unbiased evaluations with better precision 🚀

arxiv.org/abs/2403.07008

Experiments on GPT-4 and ResNets, using lmsys.org :)

U give me: a bunch of unlabeled data. I give u: AI-generated labels. Result: a massive, but biased, val set. We use PPI to correct the bias, giving unbiased evaluations with better precision 🚀 arxiv.org/abs/2403.07008 Experiments on GPT-4 and ResNets, using @lmsysorg :)

thumb_up_off_alt176

chat_bubble_outline0

account_circle

Berkeley AI Research

2 months ago

Looking to hire top AI talent?

We've compiled a list of the brilliant Berkeley AI Research Ph.D. Graduates of 2024 who are currently on the academic and industry job markets. (Thanks to our friends Stanford AI Lab for the idea!)

Check it out here:
bair.berkeley.edu/blog/2024/03/1…

thumb_up_off_alt150

chat_bubble_outline0

account_circle

Katie Kang

2 months ago

We know LLMs hallucinate, but what governs what they dream up? Turns out it’s all about the “unfamiliar” examples they see during finetuning

Our new paper shows that manipulating the supervision on these special examples can steer how LLMs hallucinate

arxiv.org/abs/2403.05612
🧵

We know LLMs hallucinate, but what governs what they dream up? Turns out it’s all about the “unfamiliar” examples they see during finetuning Our new paper shows that manipulating the supervision on these special examples can steer how LLMs hallucinate arxiv.org/abs/2403.05612 🧵

thumb_up_off_alt358

chat_bubble_outline0

account_circle

Catherine Chen

2 months ago

Do brain representations of language depend on whether the inputs are pixels or sounds?

Our Communications Biology paper studies this question from the perspective of language timescales. We find that representations are highly similar between modalities! rdcu.be/dACh5

1/8

Do brain representations of language depend on whether the inputs are pixels or sounds? Our @CommsBio paper studies this question from the perspective of language timescales. We find that representations are highly similar between modalities! rdcu.be/dACh5 1/8

thumb_up_off_alt108

chat_bubble_outline0

account_circle

Boyi Li

2 months ago

🚀 Thrilled to share our CVPR 2024 paper: Self-correcting LLM-controlled Diffusion Models (SLD)!

SLD can automatically edit any image or fix text-to-image misalignments across any generative model like #DALLE3 and #SDXL - no extra training is needed.

youtube.com/watch?v=PxoOl9…

thumb_up_off_alt226

chat_bubble_outline0

account_circle

Ritwik Gupta 🇺🇦

2 months ago

From your cell phone to your TV, images and videos are now captured in 4K resolution or better. Vision methods, however, opt to downsize or crop them, losing information. We introduce xT, our framework to model large images end-to-end on contemporary GPUs! ai-climate.berkeley.edu/xt-website/

thumb_up_off_alt57

chat_bubble_outline0

account_circle

Toru

2 months ago

Achieving bimanual dexterity with RL + Sim2Real!

toruowo.github.io/bimanual-twist/

TLDR - We train two robot hands to twist bottle lids using deep RL followed by sim-to-real. A single policy trained with simple simulated bottles can generalize to drastically different real-world objects.

thumb_up_off_alt208

chat_bubble_outline0

account_circle

Nika Haghtalab

2 months ago

I'm honored to be included in this amazing cohort of Schmidt Sciences AI2050 fellows.

Grateful to all the students and collaborators, whose efforts towards a comprehensive foundations of AI and ML that accounts for social and strategic considerations, this award also recognizes.

thumb_up_off_alt55

chat_bubble_outline0

account_circle

Ilija Radosavovic

2 months ago

we cast real-world humanoid control as next token prediction; our approach enables joint training with youtube videos and walks in sf

thumb_up_off_alt562

chat_bubble_outline0

account_circle

Berkeley AI Research

2 months ago

Congratulations to Berkeley AI Research member Soufiane Hayou on receiving the Gradient AI Research Fellowship!

gradient.ai/blog/soufiane-…

thumb_up_off_alt13

chat_bubble_outline0

account_circle

Lawrence Yunliang Chen

@Lawrence_Y_Chen

2 months ago

Introducing Mirage: Zero-shot transfer of visuomotor policies to unseen robot embodiments 🤖

With Mirage, you can train a policy on one robot and deploy it on a different one that it has never seen, with no additional data or training! 🧵👇 (1/8)

🌐 robot-mirage.github.io

thumb_up_off_alt90

chat_bubble_outline0

account_circle