Xiuyu Li (@xiuyu_l) Twitter Tweets • TwiCopy

11 months ago

I have been waiting for this plot for a long time -- such a great reference to holistically assess the capabilities and limitations of those 'llama family' open-source models. Most importantly, it provides insights into the potential applications a self-hosted model is capable of

thumb_up_off_alt13

repeat2

account_circle

Baifeng

@baifeng_shi

11 months ago

Thanks AK for sharing! Our new work, RPT, can help your robot learn better by masked pre-training on sensorimotor sequences.

thumb_up_off_alt26

repeat9

account_circle

Zhuohan Li

@zhuohan123

11 months ago

🌟 Thrilled to introduce vLLM with Woosuk Kwon!

🚀 vLLM is an open-source LLM inference and serving library that accelerates HuggingFace Transformers by 24x and powers lmsys.org Vicuna and Chatbot Arena.

Github: github.com/vllm-project/v…
Blog: vllm.ai

account_circle

Forrest Iandola

@fiandola

11 months ago

If you'll be #CVPR2023 on Sunday morning, come check out the LOVEU workshop! We have competitions for video understanding and generative AI video editing.
sites.google.com/view/loveucvpr…

Thanks for reading, now enjoy a video of a horse on mars by the winning team:

thumb_up_off_alt6

repeat4

account_circle

Xiuyu Li

11 months ago

Check out breakthroughs in LLM quantization with our new work, which presents a sensitivity-aware non-uniform quantization scheme that outperforms existing methods at 3 & 4 bits, and our 4-bit quantized Vicuna matches FP baseline performance as evaluated by GPT-4! 🔥

thumb_up_off_alt12

repeat0

account_circle

Ji Lin

@jilin_14

1 year ago

SmoothQuant is good for W8A8 LLM quantization, what about low-bit weight-only quantization (e.g., W4A16)? We present Activation-aware Weight Quantization (AWQ) for LLM compression and acceleration: github.com/mit-han-lab/ll… 🧵

account_circle

Baifeng

@baifeng_shi

1 year ago

Humans pay attention to different objects when performing different tasks. Can vision transformer (ViT) do that as well?

In our recent work, we build a ViT with task-guided attention! 1/n

Visit our website to learn more: sites.google.com/view/absvit

account_circle

lmsys.org

@lmsysorg

1 year ago

Introducing Vicuna, an open-source chatbot impressing GPT-4!

🚀 Vicuna reaches 90%* quality of ChatGPT/Bard while significantly outperforming other baselines, according to GPT-4's assessment.

Blog: vicuna.lmsys.org
Demo: chat.lmsys.org

account_circle

Xiuyu Li

1 year ago

Excited to share our latest research GARNET, which improves GNN robustness on large-scale graphs with millions of nodes, and can serve as a plug-and-play module for various graphs and GNN backbones.

thumb_up_off_alt11

repeat0

account_circle

Xiuyu Li

1 year ago

Excited to meet with friends and have a Cornell University Artificial Intelligence reunion at my first in-person conference! Horace He

Excited to meet with friends and have a @cuai_cornell reunion at my first in-person conference! @cHHillee

thumb_up_off_alt21

repeat1

account_circle

Xiuyu Li

1 year ago

TorchSparse is really a great project to work on — if you are interested in learning more, please come and chat with us at MLSys 2022 :)

thumb_up_off_alt18

repeat0