Soham Deshmukh (@sohamdesh_) Twitter Tweets • TwiDoom

Soham Deshmukh

@sohamdesh_

+ Follow

speech and audio @Microsoft

ID: 3284204886

linkhttps://soham97.github.io calendar_today19-07-2015 11:11:24

80 Tweet

167 Followers

236 Following

Soham Deshmukh

@sohamdesh_

10 months ago

Nice work and brings us one step closer to unifying speech and audio models!

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Soham Deshmukh

@sohamdesh_

10 months ago

I'll be in New Orleans next week for NeurIPS. Happy to chat about audio & speech processing, or applied research in the industry. Feel free to DM me to chat! #NeurIPS2023

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Soham Deshmukh

@sohamdesh_

9 months ago

Spotted Yann LeCun in the wild at NeurIPS! Amazing ideas and discussions around self-supervised learning #NeurIPS2023

Spotted <a href="/ylecun/">Yann LeCun</a> in the wild at NeurIPS! Amazing ideas and discussions around self-supervised learning #NeurIPS2023

thumb_up_off_alt12

chat_bubble_outline0

repeat0

shareShare

The 6 pages of responsible NLP checklist is a lot. Though some of the questions were a helpful reminder to add details, however, I would have preferred a smaller targeted checklist where the intended audience and the checklist’s impact was clearly specified.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Zachary Lipton

@zacharylipton

9 months ago

Just learned despite everyone voting down *CL's 🤡-y arxiv embargo policy, it's still firmly in place for ACL 2024. If *CL were a company, the board & leadership wd be fired, the talent wd've left 5 years ago, the common stock wd be worth $0, & WSB wd be taking an interest.

thumb_up_off_alt126

chat_bubble_outline10

repeat12

shareShare

Joan Serrà

@serrjoa

8 months ago

Paper proposing to leverage audio-language models to measure audio quality in multiple tasks (TTA, TTM, speech denoising, etc.), in a reference-free way. One of the tricks is to compare against multiple *and opposite* quality-related text prompts. arxiv.org/abs/2402.00282

thumb_up_off_alt50

chat_bubble_outline1

repeat8

shareShare

arXiv Sound

@arxivsound

8 months ago

``PAM: Prompting Audio-Language Models for Audio Quality Assessment,'' Soham Deshmukh Dareen Alharthi Benjamin Elizalde Hannes Gamper Mahmoud Al Ismail Rita Singh Bhiksha Raj Huaming Wang … ift.tt/6HIcVpn

thumb_up_off_alt20

chat_bubble_outline0

repeat6

shareShare

Microsoft Research

@msftresearch

6 months ago

In this issue: LLMs in the Imaginarium: Tool Learning through simulated trial and error; Benchmarking LLMs across languages, modalities, models and tasks; Training audio captioning models without audio. msft.it/6016cIfvY

thumb_up_off_alt38

chat_bubble_outline2

repeat10

shareShare

arXiv Sound

@arxivsound

2 months ago

``Audio Entailment: Assessing Deductive Reasoning for Audio Understanding,'' Soham Deshmukh, Shuo Han, Hazim Bukhari, Benjamin Elizalde, Hannes Gamper, Rita Singh, Bhiksha Raj, ift.tt/mMbigfn

thumb_up_off_alt4

chat_bubble_outline0

repeat1

shareShare

Anurag Kumar

@acouintel

4 days ago

We have started hiring research interns for 2025. I am looking for a PhD student with background in multimodal generation/understanding. metacareers.com/jobs/374372872…. Flexible with the timing but summers are usually the best :) #internship2025 #multimodal #audio #speech AI at Meta

thumb_up_off_alt147

chat_bubble_outline2

repeat32

shareShare