Saleema Amershi (@saleemaamershi) 's Twitter Profile
Saleema Amershi

@saleemaamershi

Partner Research Manager of the Human-AI eXperiences (HAX) team @MSFTResearch. #AI #Agents #HCI #UX. Mom of two little ones. She/Her. Opinions my own.

ID: 1250849048308793345

linkhttp://saleemaamershi.com/ calendar_today16-04-2020 18:11:04

249 Tweet

2,2K Followers

550 Following

Saleema Amershi (@saleemaamershi) 's Twitter Profile Photo

[Update] Our team will be accepting PhD Internship applications until **January 5th, 2024**. Please apply by then if you're interested in working with us! SeeπŸ‘‡for details.

Besmira Nushi πŸ’™πŸ’› (@besanushi) 's Twitter Profile Photo

New blog by Adam Fourney (hci.social/@adam) and Qingyun Wu on measurement tools for complex multi agent workflows in AutoGen. AutoGenBench is a command line tool on pypi which handles downloading, configuring, running, and reporting supported benchmarks in AutoGen.➑️microsoft.github.io/autogen/blog/2…

New blog by <a href="/adamfourney/">Adam Fourney (hci.social/@adam)</a> and <a href="/qingyun_wu/">Qingyun Wu</a> on measurement tools for complex multi agent workflows in <a href="/pyautogen/">AutoGen</a>. AutoGenBench is a command line tool on pypi which handles downloading, configuring, running, and reporting supported benchmarks in AutoGen.➑️microsoft.github.io/autogen/blog/2…
Victor Dibia (@vykthur) 's Twitter Profile Photo

#internopportunity, summer 2024 … if you are a HCI PhD student interested in helping define the future of tools for creating multi-agent applications (AutoGen, AutoGen Studio, AutoGen), and a builder, please reach out to me, Gagan Bansal or Adam Fourney (hci.social/@adam) (DM or email). A

Sebastien Bubeck (@sebastienbubeck) 's Twitter Profile Photo

phi-3 is here, and it's ... good :-). I made a quick short demo to give you a feel of what phi-3-mini (3.8B) can do. Stay tuned for the open weights release and more announcements tomorrow morning! (And ofc this wouldn't be complete without the usual table of benchmarks!)

Microsoft Research (@msftresearch) 's Twitter Profile Photo

Researchers at Microsoft emphasize the importance of globally equitable AI. Join us at Research Forum on June 4 to learn more about data representation, the latest foundational advances, and how inclusive AI creates impact. Register now.

Microsoft Research (@msftresearch) 's Twitter Profile Photo

Don't miss Adam Fourney's talk on June 4 at Microsoft Research Forum, where he will discuss the effectiveness of using multiple agents, working together, to complete complex multi-step tasks. Register now. msft.it/6010YkH4E

Don't miss Adam Fourney's talk on June 4 at Microsoft Research Forum, where he will discuss the effectiveness of using multiple agents, working together, to complete complex multi-step tasks. Register now. msft.it/6010YkH4E
Victor Dibia (@vykthur) 's Twitter Profile Photo

Over the last few months, I have been working on a low code developer tool (AutoGen Studio) to enable rapid prototyping of multi-agent systems using AutoGen. Really excited to see the work highlighted on the Microsoft Microsoft Research blog today! In the blog, we cover tasks

John Langford (@johnclangford) 's Twitter Profile Photo

As part of the Microsoft Research forum (x.com/MSFTResearch/s… ), I'm running a panel on multimodal models (x.com/msftresearch/s… ) which seems like a fun topic. Register/join if that's your jam :-)

AutoGen (@pyautogen) 's Twitter Profile Photo

This new paper from Massachusetts Institute of Technology (MIT) uses multiple #AutoGen agents to generate, refine, and validate research hypothesis! SciAgents: Automating Scientific Discovery Through Multi-Agent Intelligent Graph Reasoning by Ghafarollahi & Markus J. Buehler "We present SciAgents, an approach that

This new paper from <a href="/MIT/">Massachusetts Institute of Technology (MIT)</a> uses multiple #AutoGen agents to generate, refine, and validate research hypothesis!

SciAgents: Automating Scientific Discovery Through Multi-Agent Intelligent Graph Reasoning
by Ghafarollahi &amp; <a href="/ProfBuehlerMIT/">Markus J. Buehler</a> 

"We present SciAgents, an approach that
Besmira Nushi πŸ’™πŸ’› (@besanushi) 's Twitter Profile Photo

Excited to announce the release of Eureka, an open-source framework for evaluating and understanding large foundation models! 🌟 Eureka offers: πŸ”In-depth analysis of 12 cutting-edge models 🧠 Multimodal & language capability testing beyond single-score reporting and rankings πŸ“ˆ

Excited to announce the release of Eureka, an open-source framework for evaluating and understanding large foundation models! 🌟

Eureka offers: πŸ”In-depth analysis of 12 cutting-edge models 🧠 Multimodal &amp; language capability testing beyond single-score reporting and rankings πŸ“ˆ
Saleema Amershi (@saleemaamershi) 's Twitter Profile Photo

πŸš€#AI advances are accelerating, with new models emerging regularly. Benchmark scores only reveal so much. For #HumanCenteredAI, we must ask: How will this model work in my app and for my users? Eureka standardizes LLM evaluation for deeper insights beyond single-score metricsπŸ‘‡