Andy Zhou (@andyz245) 's Twitter Profile
Andy Zhou

@andyz245

undergrad student at UIUC | intern @virtueai_co | @lapisrocks

ID: 765646771498340352

linkhttp://www.andyzhou.ai calendar_today16-08-2016 20:29:50

673 Tweet

416 Takipçi

406 Takip Edilen

Haohan Wang (@haohanwang) 's Twitter Profile Photo

New Preprint. 🔒 In an era where AI's influence is surging, government compliance isn't optional—it's imperative. Our research introduces GUARD, a novel system ensuring LLMs and VLMs are tested for adherence to these critical standards. #AICompliance #GovernmentGuidelines

Dan Hendrycks (@danhendrycks) 's Twitter Profile Photo

Can hazardous knowledge be unlearned from LLMs without harming other capabilities? We’re releasing the Weapons of Mass Destruction Proxy (WMDP), a dataset about weaponization, and we create a way to unlearn this knowledge. 📝arxiv.org/abs/2403.03218 🔗wmdp.ai

Can hazardous knowledge be unlearned from LLMs without harming other capabilities?

We’re releasing the Weapons of Mass Destruction Proxy (WMDP), a dataset about weaponization, and we create a way to unlearn this knowledge.

📝arxiv.org/abs/2403.03218
🔗wmdp.ai
Haohan Wang (@haohanwang) 's Twitter Profile Photo

🚀 Thanks for hosting! Excited to share our latest work on jailbreaking LLMs: 1️⃣ Compliance testing with jailbreak 🧐 arxiv.org/abs/2402.03299 2️⃣ systematic approach to defense 💪 arxiv.org/abs/2401.17263 with Haibo, Andy Zhou, Lapis Labs, and Bo Li; Trustworthy ML Initiative (TrustML)

LlamaIndex 🦙 (@llama_index) 's Twitter Profile Photo

Language Agent Tree Search 🤖🌲 As LLMs get faster, better, cheaper, developers will be able to compose agentic systems that are able to plan out an entire tree of possible futures, instead of just sequentially planning the next state (e.g. in ReAct). This is crucial for higher

Language Agent Tree Search 🤖🌲

As LLMs get faster, better, cheaper, developers will be able to compose agentic systems that are able to plan out an entire tree of possible futures, instead of just sequentially planning the next state (e.g. in ReAct). This is crucial for higher
Andy Zhou (@andyz245) 's Twitter Profile Photo

Pleased to announce Language Agent Tree Search was accepted to #ICML2024 !! We propose a general search algorithm for LM agents that effectively navigates the prompt space for agent tasks Check it out here arxiv.org/abs/2310.04406 Great LangChain implementation here

Andy Zhou (@andyz245) 's Twitter Profile Photo

I got an email bringing this paper to my attention that mentioned something was concerning, and the methodology is almost exactly the same as our ICML 2024 work (released Oct 2023), Language Agent Tree Search (arxiv.org/abs/2310.04406), but did not cite us...

Revanth Gangi Reddy (@gangi_official) 's Twitter Profile Photo

Introducing FIRST: Faster Improved Listwise Reranking with Single Token Decoding arxiv.org/pdf/2406.15657 Listwise LLM reranking typically outputs the ranking order as a generation sequence. Instead, we use output logits of the first generated identifier to obtain the ranking.

Introducing FIRST: Faster Improved Listwise Reranking with Single Token Decoding

arxiv.org/pdf/2406.15657

Listwise LLM reranking typically outputs the ranking order as a generation sequence. Instead, we use output logits of the first generated identifier to obtain the ranking.
Virtue AI (@virtueai_co) 's Twitter Profile Photo

We present AIR 2024, a unified AI Risk Taxonomy for AI regulation and company policy-guided risk assessment and compliance, jointly with Stanford University's HELM. 📜Blog: virtueai.com/2024/07/27/dec…

Andy Zhou (@andyz245) 's Twitter Profile Photo

Excited to release this work on AI policy! We map out company/government policies into a taxonomy with 314 potentially harmful categories arxiv.org/abs/2406.17864

alphaXiv (@askalphaxiv) 's Twitter Profile Photo

Excited to feature Tamper-Resistant Safeguards for Open-Weight LLMs from Lapis Labs! Introducing the first safeguards for LLMs that resist fine-tuning attacks, showing the power of tamper-resistance to make open-weight LLMs safer. Rishub Tamirisa is here to answer your questions!

Excited to feature Tamper-Resistant Safeguards for Open-Weight LLMs from <a href="/lapisrocks/">Lapis Labs</a>!

Introducing the first safeguards for LLMs that resist  fine-tuning attacks, showing the power of tamper-resistance to make open-weight LLMs safer.

<a href="/rishub_t/">Rishub Tamirisa</a> is here to answer your questions!
Chi Wang (@chi_wang_) 's Twitter Profile Photo

Join us for the invited talk Language Agent Tree Search in the AutoGen community meetup (Do not miss!) Time: August 26, Monday 9am PT. Event link: discord.com/events/1153072… Abstract: ⬇️