Sebastian Raschka
@rasbt
AI & ML researcher. Author of the "Build a Large Language Model From Scratch" book (mng.bz/n1O4). LLM research engineer @LightningAI.
ID: 865622395
https://sebastianraschka.com/books/ 07-10-2012 02:06:16
16,16K Tweet
285,285K Followers
907 Following
Here's my full interview with Sebastian Raschka, one of the great AI educators today. We cover many details from DPO training failure modes, ChatGPT vs Claude, Llama 3.1, moderating Arxiv, avoiding hype, writing, getting started in AI, and other topics. Chapters: 00:00:00 Introduction &
While there are 100s of LLMs papers published each month proposing new techniques. The best way to see what truly works in practice is by looking at the pre-training and post-training pipelines of the latest state-of-the-art models. Here's what I found: magazine.sebastianraschka.com/p/new-llm-pre-…