Chris Alexiuk (@llm_wizard) 's Twitter Profile
Chris Alexiuk

@llm_wizard

Lover of Machine Learning, D&D, and Physics

ID: 1401910925288955913

linkhttp://youtube.com/@chrisalexiuk calendar_today07-06-2021 14:36:48

671 Tweet

520 Followers

353 Following

Chris Alexiuk (@llm_wizard) 's Twitter Profile Photo

This depends a lot on what you want out of your career - and isn't great general advice. Right now, LLMs are great systems to work on and with. There's a tonne of unexplored space, and a tonne of cool ideas that need more thought. Finding a career in the part of the field won't

Chris Alexiuk (@llm_wizard) 's Twitter Profile Photo

Where Yann LeCun's definition of "science" is too strict for me; Bojan Tunguz's definition is too loose. There are too many definitions of "the scientific method" - and so the phrase "if you're using the scientific method in your work, you are doing science" means different things to

NVIDIA (@nvidia) 's Twitter Profile Photo

We're honored to be named among TIME’s Most Influential Companies of 2024. Thanks to our partners and customers, NVIDIA continues to shape the future of AI. #TIME100 nvda.ws/4e1d9D4

Chris Alexiuk (@llm_wizard) 's Twitter Profile Photo

"Getting out of your bubble" is the best advice in this list! It's crazy how much knowledge you take for granted when you're stuck inside highly specialized communities.

JFPuget 🇺🇦 (@jfpuget) 's Twitter Profile Photo

It should be mandatory to read Turing's paper on Machine Intelligence before having the right to use the phrase "Turing test", whatever the context and purpose. It is not too late to read it, here it is: academic.oup.com/mind/article/L…

Chris Alexiuk (@llm_wizard) 's Twitter Profile Photo

Hang on, you're saying if I want to be good at Twitter - all I need to do is say: RAG is just improving LLM outputs by dynamically adding relevant context?

Philipp Schmid (@_philschmid) 's Twitter Profile Photo

Not Llama 3 405B, but Nemotron 4 340B! NVIDIA just released 340B dense LLM matching the original OpenAI GPT-4 performance for chat applications and synthetic data generation. 🤯 NVIDIA does not claim ownership of any outputs generated. 💚 TL;DR: 🧮 340B Paramters with 4k

Not Llama 3 405B, but Nemotron 4 340B! <a href="/nvidia/">NVIDIA</a> just released 340B dense LLM matching the original <a href="/OpenAI/">OpenAI</a> GPT-4 performance for chat applications and synthetic data generation. 🤯 NVIDIA does not claim ownership of any outputs generated. 💚

TL;DR:
🧮 340B Paramters with 4k
Chris Alexiuk (@llm_wizard) 's Twitter Profile Photo

Today, NVIDIA announced a brand new set of models, Nemotron-4 340B, tailor-made for synthetic data generation! Using a custom permissive license called NVIDIA Open Model License Agreement, which: - Allows commercial use - NVIDIA does not own outputs - Is only 2 pages The

Today, <a href="/nvidia/">NVIDIA</a> announced a brand new set of models, Nemotron-4 340B, tailor-made for synthetic data generation!

Using a custom permissive license called NVIDIA Open Model License Agreement, which:
- Allows commercial use
- NVIDIA does not own outputs
- Is only 2 pages

The
lmsys.org (@lmsysorg) 's Twitter Profile Photo

Chatbot Arena update! NVIDIA AI's Nemotron-4-340B has just edged past Llama-3-70B to become the new best open model on Arena leaderboard! Key highlights: - Impressive performance in longer queries - Balanced multilingual capabilities - Robust performance in "Hard Prompts"

Chatbot Arena update!

<a href="/NVIDIAAI/">NVIDIA AI</a>'s Nemotron-4-340B has just edged past Llama-3-70B to become the new best open model on Arena leaderboard!

Key highlights:
- Impressive performance in longer queries
- Balanced multilingual capabilities
- Robust performance in "Hard Prompts"
Chris Alexiuk (@llm_wizard) 's Twitter Profile Photo

This feels like saying "we shouldn't work on noise-reduction during NISQ because eventually we won't need it". It feels like there are loads of innovations left that can be applied to LLMs which are also applicable to deep learning in general, as well as LLMs being a potential

Chris Alexiuk (@llm_wizard) 's Twitter Profile Photo

Awe man, Qdrant - very disappointing to see. I would expect this more from smaller, less established teams. Thanks to Nils for running this.

harpreet (@datascienceharp) 's Twitter Profile Photo

I recently got a chance to hack around with NVIDIA's NIM API (that's a lot of capital letters in a row), and I gotta say...it's actually pretty dope. (And I've got a starter notebook for you to hack around with the API). 🤔WTF is a NIM? It's basically a Docker container with

NVIDIA AI Developer (@nvidiaaidev) 's Twitter Profile Photo

Explore our advanced pipeline for creating a preference dataset using Nemotron-4 340B Instruct; we generate tailored questions and responses, which are then assessed by Nemotron-4 340B Reward to ensure precise training with NeMo Aligner. nvda.ws/4d315zW