Juan 🇸🇻 Martínez (@just1n14n) 's Twitter Profile
Juan 🇸🇻 Martínez

@just1n14n

Salvadoran - Full Stack Economist - Keio University Econ PhD (Labor and Education) + Web Dev - Opinions always my own

ID: 119258721

calendar_today03-03-2010 03:31:46

6,6K Tweet

441 Takipçi

1,1K Takip Edilen

OpenAI (@openai) 's Twitter Profile Photo

We're releasing a preview of OpenAI o1—a new series of AI models designed to spend more time thinking before they respond. These models can reason through complex tasks and solve harder problems than previous models in science, coding, and math. openai.com/index/introduc…

Noam Brown (@polynoamial) 's Twitter Profile Photo

Today, I’m excited to share with you all the fruit of our effort at OpenAI to create AI models capable of truly general reasoning: OpenAI's new o1 model series! (aka 🍓) Let me explain 🧵 1/

Today, I’m excited to share with you all the fruit of our effort at <a href="/OpenAI/">OpenAI</a> to create AI models capable of truly general reasoning: OpenAI's new o1 model series! (aka 🍓) Let me explain 🧵 1/
Nando de Freitas (@nandodf) 's Twitter Profile Photo

It’s time to say thank you and goodbye to Google DeepMind. I had the immense fortune of working there for 10 years. They were undoubtedly the most exciting years in the history of AI, and I feel that I grew beyond all my expectations thanks to my uniquely smart, generous and

ARC Prize (@arcprize) 's Twitter Profile Photo

We put OpenAI o1 to the test against ARC Prize. Results: both o1 models beat GPT-4o. And o1-preview is on par with Claude 3.5 Sonnet. Can chain-of-thought scale to AGI? What explains o1's modest scores on ARC-AGI? Our notes: arcprize.org/blog/openai-o1…

We put OpenAI o1 to the test against ARC Prize.

Results: both o1 models beat GPT-4o. And o1-preview is on par with Claude 3.5 Sonnet.

Can chain-of-thought scale to AGI? What explains o1's modest scores on ARC-AGI?

Our notes:
arcprize.org/blog/openai-o1…