Minjoon Seo (@seo_minjoon) Twitter Tweets • TwiDoom

hyunji amy lee

10 months ago

When training dense models on large collections, what strategies give OOD retrieval? Resource-intensive methods like data augmentation & pretraining help.. But what about the training strategy itself 🤔 In our work, we show 3️⃣ ingredients 🧑‍🍳 for a great retrieval recipe 🍝

thumb_up_off_alt96

chat_bubble_outline1

repeat19

shareShare

hyunji amy lee

@hyunji_amy_lee

10 months ago

🤕 Have you ever wondered if LMs generate responses based on your provided input context? 🤨 How can I ensure that LM is giving me a response based on my context? We introduce Strict Grounding along with dataset, metric to evaluate whether LM truly grounds to input context.

thumb_up_off_alt65

chat_bubble_outline1

repeat20

shareShare

Seungone Kim

@seungonekim

8 months ago

🤔How could you evaluate whether your Vision Language Model (VLM) is closely reaching the capabilities of GPT-4V? We’re excited to present 🔥Prometheus-Vision, the first open-source VLM specialized for evaluating other VLMs based on fine-grained scoring criteria, with co-lead

thumb_up_off_alt147

chat_bubble_outline3

repeat43

shareShare

Dongkeun Yoon

@dongkeun_yoon

8 months ago

❗New multilingual paper❗ 🤔LMs good at reasoning are mostly English-centric (MetaMath, Orca 2, etc). 😃Let’s adapt them to solve multilingual tasks. BUT without using multilingual data! We present LangBridge, a zero-shot approach to adapt LMs for multilingual reasoning.

thumb_up_off_alt217

chat_bubble_outline6

repeat57

shareShare

Sungdong Kim

@sungdongkim4

7 months ago

🤔 Do we always need a human preference for effective LLM alignment after an SFT stage? Our answer is NO 🙅‍♂️ We present a ✨preference-free alignment approach✨, leveraging an off-the-shelf retriever with effective regularizer functions: Regularized Relevance Reward (R^3). [1/n]

thumb_up_off_alt155

chat_bubble_outline1

repeat48

shareShare

hyunji amy lee

@hyunji_amy_lee

6 months ago

New preprint "Semiparametric Token-Sequence Co-Supervision" We introduce semiparametric token-sequence co-supervision, which trains LM by simultaneously leveraging supervision from a parametric token and a nonparametric sequence embedding space. arxiv.org/abs/2403.09024

thumb_up_off_alt85

chat_bubble_outline4

repeat25

shareShare

Wenhao Yu

@wyu_nd

5 months ago

📢 Excited to share that we will organize the 3rd workshop on Knowledge-Augmented NLP at ACL 2024. We will have six amazing speakers! We welcome your submissions and invite you to discuss with our speakers and organizers at the workshop. Looking forward to seeing you in Thailand!

thumb_up_off_alt76

chat_bubble_outline1

repeat17

shareShare

Hyeonbin Hwang

@ronalhwang

5 months ago

🚨 New LLM Reasoning Paper 🚨 Q. How can LLMs self-improve their reasoning ability? ⇒ Introducing Self-Explore⛰️🧭, a training method specifically designed to help LLMs avoid reasoning pits by learning from their own outputs! [1/N]

thumb_up_off_alt292

chat_bubble_outline8

repeat55

shareShare

Ai2

@allen_ai

5 months ago

Announcing our latest addition to the OLMo family, OLMo 1.7!🎉Our team's efforts to improve data quality, training procedures and model architecture have led to a leap in performance. See how OLMo 1.7 stacks up against its peers and peek into the technical details on the blog:

thumb_up_off_alt169

chat_bubble_outline13

repeat47

shareShare

Twelve Labs (twelvelabs.io)

@twelve_labs

5 months ago

🚀 We're excited to share the technical report of Pegasus-1, our 17B-parameter VLM, setting new benchmarks in video understanding. It surpasses larger models like Gemini Pro and Ultra in video conversation, QA, summarization, and temporal understanding. bit.ly/pegasus-1-tech…

thumb_up_off_alt196

chat_bubble_outline4

repeat47

shareShare

Seungone Kim

@seungonekim

5 months ago

#NLProc Introducing 🔥Prometheus 2, an open-source LM specialized on evaluating other language models. ✅Supports both direct assessment & pairwise ranking. ✅ Improved evaluation capabilities compared to its predecessor. ✅Can assess based on user-defined evaluation criteria.

thumb_up_off_alt160

chat_bubble_outline3

repeat41

shareShare

Seongyun Lee

@sylee_ai

4 months ago

🚨 New LLM personalization/alignment paper 🚨 🤔 How can we obtain personalizable LLMs without explicitly re-training reward models/LLMs for each user? ✔ We introduce a new zero-shot alignment method to control LLM responses via the system message 🚀

thumb_up_off_alt209

chat_bubble_outline4

repeat53

shareShare

Seungone Kim

@seungonekim

3 months ago

🤔How can we systematically assess an LM's proficiency in a specific capability without using summary measures like helpfulness or simple proxy tasks like multiple-choice QA? Introducing the ✨BiGGen Bench, a benchmark that directly evaluates nine core capabilities of LMs.

thumb_up_off_alt185

chat_bubble_outline7

repeat55

shareShare

Hoyeon Chang

@hoyeon_chang

3 months ago

🚨 New paper 🚨 How Large Language Models Acquire Factual Knowledge During Pretraining? I’m thrilled to announce the release of my new paper! 🎉 This research explores how LLMs acquire and retain factual knowledge during pretraining. Here are some key insights:

thumb_up_off_alt430

chat_bubble_outline10

repeat102

shareShare

Doyoung Kim

@doyoungkim_ml

3 months ago

🤔 Humans excel at generalizing planning into extrapolated data or rapidly adapting with limited train data. How is it possible for language models? Introducing 🧠Cognitive Map for Language Models, a framework achieving Optimal Planning via Verbally Representing the World Model🌍

thumb_up_off_alt35

chat_bubble_outline1

repeat13

shareShare

Alice Oh

@aliceoh

a month ago

We are hosting wonderful NLP colleagues at KAIST on their way to ACL Bangkok! 🤩 On-site registration is closed, but the talks will be broadcast on Zoom. Please join us! Date/Time: Aug 10, 2024, 10:05-12:30 KST (UCT+9) Parallel Session 1: Advanced Language Models and AI

thumb_up_off_alt121

chat_bubble_outline2

repeat43

shareShare