Maitrix.org (@maitrixorg) Twitter Tweets • TwiDoom

LLM360

4 months ago

Please welcome K2-65B🏔️, the most performant fully-open LLM released to date. As a blueprint for open-source AGI, we release all model checkpoints, code, logs, and data. About K2: 🧠65 billion parameters 🪟Fully transparent & reproducible 🔓Apache 2.0 📈Outperforms Llama 2 70B

thumb_up_off_alt493

chat_bubble_outline6

repeat146

shareShare

Theoretically Media

@theomediaai

4 months ago

No shade to Runway obviously, but in today's video I'm taking a look at what the future of AI Video Motion Control might look like via Maitrix.org's Pandora. PLUS...

No shade to Runway obviously, but in today's video I'm taking a look at what the future of AI Video Motion Control might look like via <a href="/MaitrixOrg/">Maitrix.org</a>'s Pandora.
PLUS...

thumb_up_off_alt34

chat_bubble_outline3

repeat4

shareShare

Zhiting Hu

@zhitinghu

4 months ago

Check out #K2, a fully-open 65B LLM released by LLM360 Matching the performance of #Llama2 70B, #K2 is among the most powerful LLMs made fully transparent! Over the past 6 months, LLM360 has made open a series of LLMs across different tiers, all with open weights,

thumb_up_off_alt21

chat_bubble_outline0

repeat4

shareShare

Robert Scoble

@scobleizer

3 months ago

Missed this.

thumb_up_off_alt46

chat_bubble_outline2

repeat7

shareShare

Patrick Ranger

@aibuzznews

3 months ago

Everyone is talking about Sora and Kling. Another model is on the horizon. Learn about Pandora from Maitrix.org!

thumb_up_off_alt40

chat_bubble_outline27

repeat6

shareShare

Zhiting Hu

@zhitinghu

3 months ago

🔥Excited that Redcoast won the #NAACL2024 best demo runner up! Redcoast is a super easy-to-use tool ☕️ for automated distributed training of #LLMs, diffusion, reinforcement learning, meta-learning, etc. Users just write three functions (collate, loss, predict), and Redcoast

thumb_up_off_alt34

chat_bubble_outline2

repeat7

shareShare

Lianhui Qin

@lianhuiq

3 months ago

💡Divergence thinking💡 is a hallmark of human creativity and problem-solving 🤖Can LLMs also do divergent reasoning to generate diverse solutions🤔? Introducing Flow-of-Reasoning (FoR) 🌊, a data-efficient way of training LLM policy to generate diverse, high-quality reasoning

thumb_up_off_alt236

chat_bubble_outline7

repeat68

shareShare

Maitrix.org

@maitrixorg

3 months ago

"With long context LLMs comes long prompts"👇 People typically just write 1- or 2-sentence quick prompts when using an LLM for a task. How to create 1- or 2-page long prompts to boost performance? 🔥PromptAgent automatically writes long prompts for you!🔥 Without need of the

thumb_up_off_alt24

chat_bubble_outline0

repeat9

shareShare

Zhiting Hu

@zhitinghu

3 months ago

Optimizing pages-long expert-level prompts automatically 👇 It's fascinating that _prompt optimization_ can be formulated as a _planning_ problem: - Treat the LLM as a world model🌎 - We want a prompt, as a plan trajectory, that thrives in this world - So we do strategic

thumb_up_off_alt27

chat_bubble_outline0

repeat8

shareShare

Zhiting Hu

@zhitinghu

2 months ago

Sasha Rush We have a new work at #ICML2024 to learn latent auto-encoding for text. It improves VAEs and other DGMs quite a bit: arxiv.org/abs/2402.19009 The idea is to augment diffusion with parameterized encoder-decoder (e.g., pretrained LLMs) Or in an alternative view, it replaces the

thumb_up_off_alt17

chat_bubble_outline4

repeat4

shareShare

Shibo Hao

@ber18791531

2 months ago

Excited to share that our paper “LLM Reasoners: New Evaluation, Library, and Analysis of Step-by-Step Reasoning with Large Language Models” is accepted to Conference on Language Modeling 2024! arxiv.org/abs/2404.05221 #LLMReasoners #COLM #COLM24

thumb_up_off_alt86

chat_bubble_outline3

repeat24

shareShare

Zhoujun (Jorge) Cheng

@chengzhoujun

2 months ago

Happy to share that our agent engineering framework OpenAgents (github.com/xlang-ai/OpenA…) and LM tool-using survey(arxiv.org/2403.15452) are accepted by #COLM! See you at UPenn. And on a personal update, I feel excited and lucky that I will start my PhD at UCSD this fall

thumb_up_off_alt206

chat_bubble_outline5

repeat42

shareShare

Han Guo

@hanguo97

2 months ago

Introducing FLUTE, a CUDA kernel for non-uniformly quantized (via a lookup table) LLM Inference. It accelerates QLoRA's NormalFloat (NF) out of the box and more. As an application, we extended NF4 and are releasing quantized models for LLaMA-3 (8B/70B) and Gemma-2 (9B/27B).

thumb_up_off_alt298

chat_bubble_outline6

repeat61

shareShare

Maitrix.org

@maitrixorg

a month ago

🥳Our work on Multi-Model Theory of Mind evaluation won the #ACL2024 Outstanding Paper Award! How well can machines 🤖 form a coherent mental picture🧠of humans from vision-language observations? Can machines understand humans' goals and beliefs? Our MMToM-QA shows models

thumb_up_off_alt28

chat_bubble_outline0

repeat14

shareShare

Tianmin Shu

@tianminshu

a month ago

Excited to announce the official leaderboard for our MMToM-QA benchmark hosted on Maitrix.org

thumb_up_off_alt13

chat_bubble_outline0

repeat5

shareShare

Samuel Albanie

@samuelalbanie

a month ago

Enjoyed this paper on LMs, world models and agent models by Zhiting Hu and Tianmin Shu TLDR: for reasoning tasks, it’s a useful abstraction to treat LMs as simulators (“backends”) that simulate agent models and world models arxiv.org/abs/2312.05230

Enjoyed this paper on LMs, world models and agent models by <a href="/ZhitingHu/">Zhiting Hu</a> and <a href="/tianminshu/">Tianmin Shu</a>

TLDR: for reasoning tasks, it’s a useful abstraction to treat LMs as simulators (“backends”) that simulate agent models and world models

arxiv.org/abs/2312.05230

thumb_up_off_alt92

chat_bubble_outline2

repeat21

shareShare

Zhiting Hu

@zhitinghu

23 days ago

Very interesting work of simulating digital game with a DOOM "world model"! 🌴Pandora we created earlier is a general-domain world model aiming to simulate diverse worlds, including digital games, interactively controlled by natural language. A larger and better version is

thumb_up_off_alt37

chat_bubble_outline2

repeat4

shareShare