Junyang Lin (@justinlin610) 's Twitter Profile
Junyang Lin

@justinlin610

Chief Evangelist Officer of Qwen Team ❤️ 🍵 ☕️ 🍷 🥃

ID: 4473952878

linkhttps://www.linkedin.com/in/junyang-lin-0b2b38151/ calendar_today06-12-2015 10:28:42

1,1K Tweet

6,6K Followers

1,1K Following

Junyang Lin (@justinlin610) 's Twitter Profile Photo

Woooh finally finally finally it is here!!!!!!! Qwen2-VL! 2B and 7B openweight under Apache 2.0 and 72B for API temporarily! Check the SOTA performance on vision understanding benchmarks and check the new capabilities of understanding videos and playing as a visual agent! Have

Vaibhav (VB) Srivastav (@reach_vb) 's Twitter Profile Photo

Qwen 2VL 7B & 2B are here - Apache 2.0 licensed smol Vision Language Models competitive with GPT 4o mini - w/ video understanding, function calling and more! 🔥 > 72B (to be released later) beats 3.5 Sonnet & GPT 4o > Can understand up to 20 min of video > Handles arbitrary

Qwen 2VL 7B & 2B are here - Apache 2.0 licensed smol Vision Language Models competitive with GPT 4o mini - w/ video understanding, function calling and more! 🔥

> 72B (to be released later) beats 3.5 Sonnet & GPT 4o
> Can understand up to 20 min of video
> Handles arbitrary
Junyang Lin (@justinlin610) 's Twitter Profile Photo

Some notes. We failed at supporting llama.cpp (damn it is a bit too hard but we’ll try again) and thus temporarily no ollama or lmstudio for now

lmsys.org (@lmsysorg) 's Twitter Profile Photo

Does style matter over substance in Arena? Can models "game" human preference through lengthy and well-formatted responses? Today, we're launching style control in our regression model for Chatbot Arena — our first step in separating the impact of style from substance in

Does style matter over substance in Arena? Can models "game" human preference through lengthy and well-formatted responses?

Today, we're launching style control in our regression model for Chatbot Arena — our first step in separating the impact of style from substance in
Maziyar PANAHI (@maziyarpanahi) 's Twitter Profile Photo

Just made a Space on Hugging Face to demo the great Qwen2-VL-2B model just released by Qwen! 🔥 This thing is fast!!! Thanks to ZeroGPU and the fact that this model is only 2B in size! 🚀 PS: merve your Baklava test has passed! 🤗

Just made a Space on <a href="/huggingface/">Hugging Face</a> to demo the great Qwen2-VL-2B model just released by <a href="/Alibaba_Qwen/">Qwen</a>! 🔥

This thing is fast!!! Thanks to ZeroGPU and the fact that this model is only  2B in size! 🚀

PS: <a href="/mervenoyann/">merve</a> your Baklava test has passed! 🤗
Dylan Freedman (@dylfreed) 's Twitter Profile Photo

The new Qwen2-VL-7B Instruct model gets *100%* accuracy extracting text from this handwritten document. This is the first open weights model (Apache 2.0) that I've seen OCR this accurately. (Thank you Florent Daudens for the tip!) huggingface.co/spaces/Ganymed…

The new Qwen2-VL-7B Instruct model gets *100%* accuracy extracting text from this handwritten document. This is the first open weights model (Apache 2.0) that I've seen OCR this accurately. (Thank you <a href="/fdaudens/">Florent Daudens</a> for the tip!)

huggingface.co/spaces/Ganymed…
Junyang Lin (@justinlin610) 's Twitter Profile Photo

I just don't know what happened when we guys are still working on GitHub but last night suddenly QwenLM github org was flagged and since then contents have not been shown in the public. I advise you to turn to huggingface or our discord for contact temporarily.

Junyang Lin (@justinlin610) 's Twitter Profile Photo

Guys, we are still suffering from the problem, and feel free to turn to HF / ModelScope or our Discord server for discussion!

Xiang Yue (@xiangyue96) 's Twitter Profile Photo

I personally think the key factor may not be the "reflection" technique itself, but rather the additional tokens it forces the model to generate. This concept could be connected to [PAUSE] tokens, as described in arxiv.org/pdf/2310.02226. By generating more tokens, the model