Scale AI (@scale_ai) Twitter Tweets • TwiDoom

Alexandr Wang

2 months ago

1/Meta just released Llama3.1 405B! Scale AI partnered deeply with Meta on this release: 🥇 SEAL Evaluations: Based on our evals 🥇 on IF 🥈 on Math #4 on Coding 💼 Enterprise partnership for custom Llama models 🤖 Data Foundry partnership on RLHF & SFT 👇

1/Meta just released Llama3.1 405B!

<a href="/scale_AI/">Scale AI</a> partnered deeply with <a href="/Meta/">Meta</a> on this release:

🥇 SEAL Evaluations: Based on our evals
🥇 on IF
🥈 on Math
#4 on Coding
💼 Enterprise partnership for custom Llama models
🤖 Data Foundry partnership on RLHF & SFT

👇

thumb_up_off_alt554

chat_bubble_outline24

repeat66

shareShare

TechNet

@technetupdate

2 months ago

AI has been utilized in the financial services and housing sectors for decades. But as Scale AI testified, we must properly deploy AI in a safe, responsible, and thoughtful manner to grow the U.S. economy. #AIforAmerica

thumb_up_off_alt5

chat_bubble_outline2

repeat1

shareShare

Ed Ludlow

@edludlow

2 months ago

!!! ITS FINALLY HERE !!! Bloomberg’s 2024 List of Top AI Startups: With so much going on, it was difficult to decide which companies should make the cut. Rachel Metz Shirin Ghaffary and Dina Bass had a tough job. bloomberg.com/features/2024-… (gift link) Including: xAI/Elon Musk,

!!! ITS FINALLY HERE !!!

Bloomberg’s 2024 List of Top AI Startups:

With so much going on, it was difficult to decide which companies should make the cut. <a href="/rachelmetz/">Rachel Metz</a> <a href="/shiringhaffary/">Shirin Ghaffary</a> and <a href="/dinabass/">Dina Bass</a> had a tough job.

bloomberg.com/features/2024-… (gift link)

Including: <a href="/xai/">xAI</a>/<a href="/elonmusk/">Elon Musk</a>,

thumb_up_off_alt83

chat_bubble_outline7

repeat19

shareShare

The Cognitive Revolution Podcast

@cogrev_podcast

2 months ago

New episode out! Nathan Labenz hosts Riley Goodside, the world's first staff prompt engineer at Scale AI , to discuss the evolution of prompt engineering. Checkout the full episode here : cognitiverevolution.ai/from-poetry-to…

thumb_up_off_alt21

chat_bubble_outline4

repeat7

shareShare

Scale AI

@scale_ai

2 months ago

Introducing the latest addition to the SEAL Leaderboards: Adversarial Robustness. scl.ai/ar-leaderboard Adversarial Robustness evaluates top models against 1,000 adversarial prompts, covering critical areas like illegal activities, harm, and hate speech. Why it matters 👇 ✅

thumb_up_off_alt12

chat_bubble_outline3

repeat0

shareShare

Demis Hassabis

@demishassabis

2 months ago

Great to see Gemini 1.5 Pro top the new @scale_ai leaderboard for adversarial robustness! Congrats to the entire Gemini team, and special thanks to Anca Dragan & the AI safety team for leading the charge on building in robustness to our models as a core capability.

thumb_up_off_alt406

chat_bubble_outline20

repeat50

shareShare

OPTO

@optothemes

2 months ago

⚔️ AI Wars: Who's Leading the New Global Arms Race? 🎙️ Vijay Karunamurthy, Field CTO at Scale AI discusses global competition for AI talent and regulatory challenges faced by companies like Scale AI working across the US and UK, in the wider context of the new arms race against China.

thumb_up_off_alt5

chat_bubble_outline1

repeat2

shareShare

Google DeepMind

@googledeepmind

2 months ago

Gemini 1.5 Pro is the safest model on @Scale_AI's leaderboard for adversarial robustness. Evaluations look at how it performed when tested by harmful prompts compared to others. As we continue to develop advanced AI, we're committed to ensuring safety is built in from scratch.

thumb_up_off_alt299

chat_bubble_outline29

repeat42

shareShare

Nathan Labenz

@labenz

2 months ago

People tell me they listen to The Cognitive Revolution Podcast for the "nuggets" If you're automating routine work with LLMs, my episode with Riley Goodside, world's first Staff Prompt Engineer Scale AI, is full of them Here's Riley on task decomposition & reasoning demonstrations Recommended!

thumb_up_off_alt34

chat_bubble_outline6

repeat4

shareShare

Scale AI

@scale_ai

a month ago

Scale is on Forbes’ 2024 Cloud 100 list! The list recognizes the world’s top 100 cloud computing companies. forbes.com/lists/cloud100/ Join us on our mission to accelerate the development of AI applications 👉 scale.com/careers

Scale is on <a href="/Forbes/">Forbes</a>’ 2024 Cloud 100 list! The list recognizes the world’s top 100 cloud computing companies.

forbes.com/lists/cloud100/

Join us on our mission to accelerate the development of AI applications 👉 scale.com/careers

thumb_up_off_alt19

chat_bubble_outline3

repeat3

shareShare

You Might Be Right Podcast

@ymbrpodcast

a month ago

#ListenNow: It's been nearly a year since our episode about Artificial Intelligence (AI) technology in Season 3, which seems like a lifetime ago in this quickly developing industry. In this week's episode, Govs. Phil Bredesen and Bill Haslam spoke with Michael Kratsios, former

thumb_up_off_alt6

chat_bubble_outline0

repeat1

shareShare

Scale AI

@scale_ai

a month ago

How do you know if a model is truly solving problems or if it’s just repeating answers from its training? Scale's Hugh Zhang is giving a tech talk on how his team developed GSM1k to expose potential data contamination in leading reasoning benchmarks 👉 scl.ai/unmasking-llm-…

How do you know if a model is truly solving problems or if it’s just repeating answers from its training?

Scale's <a href="/hughbzhang/">Hugh Zhang</a> is giving a tech talk on how his team developed GSM1k to expose potential data contamination in leading reasoning benchmarks

👉 scl.ai/unmasking-llm-…

thumb_up_off_alt23

chat_bubble_outline2

repeat3

shareShare

Nathaniel Li

@natliml

23 days ago

Who's better at LLM mischief — humans or AIs? Spoiler: It's us. Human red teamers achieve 70%+ attack success rates against LLM defenses that stump automated adversarial attacks. Why? We’re better at adversarial yapping.🧵

thumb_up_off_alt89

chat_bubble_outline8

repeat18

shareShare

Scale AI

@scale_ai

16 days ago

📢Happening tomorrow! Can’t make it? Register to receive the recording: scl.ai/unmasking-llm-… How do you know if a model is truly solving problems or if it’s just repeating answers from its training? Join Hugh Zhang 's tech talk tomorrow to learn about what his team found

thumb_up_off_alt8

chat_bubble_outline0

repeat0

shareShare

Scale AI

@scale_ai

15 days ago

We’ve added Mistral Large 2, GPT-4o (August 2024), and Gemini 1.5 Pro (August 27, 2024) to the SEAL LLM Leaderboards. See how they rank compared to leading LLMs across Coding, Instruction Following, Math, and Spanish domains: scl.ai/leaderboard

thumb_up_off_alt19

chat_bubble_outline2

repeat3

shareShare