Beidi Chen (@beidichen) 's Twitter Profile
Beidi Chen

@beidichen

Asst. Prof @CarnegieMellon, Visiting Researcher @Meta, Postdoc @Stanford, Ph.D. @RiceUniversity, Large-Scale ML, a fan of Dota2.

ID: 424387623

linkhttps://www.andrew.cmu.edu/user/beidic/ calendar_today29-11-2011 18:22:36

387 Tweet

6,6K Followers

355 Following

Paul Liang (@pliang279) 's Twitter Profile Photo

📣 I'm thrilled to share that I’ll be joining MIT as an assistant professor this fall, joint between MIT Media Lab & MIT EECS. My group will advance the foundations of multisensory AI to enhance the human experience. I look forward to tackling exciting challenges in multimodal AI

📣 I'm thrilled to share that I’ll be joining MIT as an assistant professor this fall, joint between <a href="/medialab/">MIT Media Lab</a> &amp; <a href="/MITEECS/">MIT EECS</a>.

My group will advance the foundations of multisensory AI to enhance the human experience.

I look forward to tackling exciting challenges in multimodal AI
Horace He (@chhillee) 's Twitter Profile Photo

For too long, users have lived under the software lottery tyranny of fused attention implementations. No longer. Introducing FlexAttention, a new PyTorch API allowing for many attention variants to enjoy fused kernels in a few lines of PyTorch. pytorch.org/blog/flexatten… 1/10

For too long, users have lived under the software lottery tyranny of fused attention implementations. 

No longer. 

Introducing FlexAttention, a new PyTorch API allowing for many attention variants to enjoy fused kernels in a few lines of PyTorch.
pytorch.org/blog/flexatten…
1/10
Dan Fu (@realdanfu) 's Twitter Profile Photo

Excited to share that I will be joining UCSD CSE as an assistant professor in January 2026! I'll be recruiting PhD students from the 2024 application pool - if you're interested in anything ML Sys/efficiency/etc please reach out & put my name on your application! Until then

Chunting Zhou (@violet_zct) 's Twitter Profile Photo

Introducing *Transfusion* - a unified approach for training models that can generate both text and images. arxiv.org/pdf/2408.11039 Transfusion combines language modeling (next token prediction) with diffusion to train a single transformer over mixed-modality sequences. This

Introducing *Transfusion* - a unified approach for training models that can generate both text and images. arxiv.org/pdf/2408.11039

Transfusion combines language modeling (next token prediction) with diffusion to train a single transformer over mixed-modality sequences. This
Beidi Chen (@beidichen) 's Twitter Profile Photo

🤯This study explains my year-long confusion on why #GPT4 leak says OpenAI deployed speculative decoding in their serving last June by Dylan Patel SemiAnalysis because I thought SD is only useful for small batches... Surprisingly speculative decoding can bring more benefits when

Prof. Anima Anandkumar (@animaanandkumar) 's Twitter Profile Photo

Congratulations Jiawei Zhao on an excellent PhD defense! Jiawei has been a pioneer in hardware-efficient training. When he started his PhD, everyone was focusing on inference efficiency, and training runs were small, Jiawei took the bold step to pursue training efficiency. Slides:

Congratulations <a href="/jiawzhao/">Jiawei Zhao</a> on an excellent PhD defense! Jiawei has been a pioneer in hardware-efficient training. When he started his PhD, everyone was focusing on inference efficiency, and training runs were small, Jiawei took the bold step to pursue training efficiency. Slides:
Tianqi Chen (@tqchenml) 's Twitter Profile Photo

#MLSys2025 call for papers is out! The conference will be led by the general chair Matei Zaharia , PC chairs Yingyan (Celine) Lin, and Gauri Joshi. Consider submitting and bringing your latest works in AI and systems—more details at mlsys.org.

#MLSys2025 call for papers is out! The conference will be led by the general chair  <a href="/matei_zaharia/">Matei Zaharia</a> , PC chairs <a href="/CelineLinatGT/">Yingyan (Celine) Lin</a>, and Gauri Joshi. Consider submitting and bringing your latest works in AI and systems—more details at mlsys.org.
Together AI (@togethercompute) 's Twitter Profile Photo

We are excited to share our latest work on speculative decoding for high-throughput inference! Before this work, we thought speculative decoding was useless at large batch sizes since the GPUs would go brrrr from processing all the different inputs. Much to our surprise, we