Jesse Mu(@jayelmnop) 's Twitter Profileg
Jesse Mu

@jayelmnop

Computational linguistics @AnthropicAI

ID:146312174

linkhttp://jesse.mu calendar_today21-05-2010 03:42:20

584 Tweets

4,9K Followers

584 Following

John Hewitt(@johnhewtt) 's Twitter Profile Photo

Our paper on Backpacks has won an Outstanding Paper Award at ACL 2023!

If you're excited about both fascinating learned structure in language models, and designing architectures to enable interpretability while maintaining expressivity, take a read!

backpackmodels.science

Our paper on Backpacks has won an Outstanding Paper Award at ACL 2023! If you're excited about both fascinating learned structure in language models, and designing architectures to enable interpretability while maintaining expressivity, take a read! backpackmodels.science
account_circle
Joy Hsu(@joycjhsu) 's Twitter Profile Photo

How can we build a modular and compositional system that understands 3D scenes? Excited to introduce our #CVPR2024 paper β€” NS3D: Neuro-Symbolic Grounding of 3D Objects and Relations, w Jiayuan Mao and Jiajun Wu . Check out our poster next week at Tue-AM-249.

How can we build a modular and compositional system that understands 3D scenes? Excited to introduce our @CVPR paper β€” NS3D: Neuro-Symbolic Grounding of 3D Objects and Relations, w @maojiayuan and @jiajunwu_cs . Check out our poster next week at Tue-AM-249.
account_circle
Jessy Lin(@realJessyLin) 's Twitter Profile Photo

How can agents like LLMs become decision-making partners for humans?

πŸ’¬ Excited to share a new paper + suite of envs for π˜₯𝘦𝘀π˜ͺ𝘴π˜ͺ𝘰𝘯-𝘰𝘳π˜ͺ𝘦𝘯𝘡𝘦π˜₯ π˜₯π˜ͺ𝘒𝘭𝘰𝘨𝘢𝘦𝘴, where agents + humans collab to solve hard everyday problems. [1/n]

Site: collaborative-dialogue.github.io

account_circle
Tatsunori Hashimoto(@tatsu_hashimoto) 's Twitter Profile Photo

We are releasing AlpacaFarm, a simulator enabling everyone to run and study the full RLHF pipeline at a fraction of the time (<24h) and cost (<$200) w/ LLM-simulated annotators. Starting w/ Alpaca, we show RLHF gives big 10+% winrate gains vs davinci003 (crfm.stanford.edu/2023/05/22/alp…)

We are releasing AlpacaFarm, a simulator enabling everyone to run and study the full RLHF pipeline at a fraction of the time (<24h) and cost (<$200) w/ LLM-simulated annotators. Starting w/ Alpaca, we show RLHF gives big 10+% winrate gains vs davinci003 (crfm.stanford.edu/2023/05/22/alp…)
account_circle
Rosanne Liu(@savvyRL) 's Twitter Profile Photo

Happy to share that our paper on teaching generative models how to spell is accepted at ACL 2023

Simply by making models (both language-only and text-to-image) character-aware, the notorious spelling errors are fixed!
arxiv.org/abs/2212.10562

account_circle
Anthropic(@AnthropicAI) 's Twitter Profile Photo

Introducing 100K Context Windows! We’ve expanded Claude’s context window to 100,000 tokens of text, corresponding to around 75K words. Submit hundreds of pages of materials for Claude to digest and analyze. Conversations with Claude can go on for hours or days.

account_circle
Jesse Mu(@jayelmnop) 's Twitter Profile Photo

Gist model checkpoints are now up on Hugging Face. Give it a try and see what prompts you can (or can't) compress!

LLaMA-7B (weight diff only): huggingface.co/jayelm/llama-7…
FLAN-T5-XXL: huggingface.co/jayelm/flan-t5…
Code: github.com/jayelm/gisting

account_circle