Soham Deshmukh (@sohamdesh_) 's Twitter Profile
Soham Deshmukh

@sohamdesh_

speech and audio @Microsoft

ID: 3284204886

linkhttps://soham97.github.io calendar_today19-07-2015 11:11:24

80 Tweet

167 Followers

236 Following

Soham Deshmukh (@sohamdesh_) 's Twitter Profile Photo

I'll be in New Orleans next week for NeurIPS. Happy to chat about audio & speech processing, or applied research in the industry. Feel free to DM me to chat! #NeurIPS2023

Soham Deshmukh (@sohamdesh_) 's Twitter Profile Photo

The 6 pages of responsible NLP checklist is a lot. Though some of the questions were a helpful reminder to add details, however, I would have preferred a smaller targeted checklist where the intended audience and the checklist’s impact was clearly specified.

Zachary Lipton (@zacharylipton) 's Twitter Profile Photo

Just learned despite everyone voting down *CL's 🤡-y arxiv embargo policy, it's still firmly in place for ACL 2024. If *CL were a company, the board & leadership wd be fired, the talent wd've left 5 years ago, the common stock wd be worth $0, & WSB wd be taking an interest.

Joan SerrĂ  (@serrjoa) 's Twitter Profile Photo

Paper proposing to leverage audio-language models to measure audio quality in multiple tasks (TTA, TTM, speech denoising, etc.), in a reference-free way. One of the tricks is to compare against multiple *and opposite* quality-related text prompts. arxiv.org/abs/2402.00282

Paper proposing to leverage audio-language models to measure audio quality in multiple tasks (TTA, TTM, speech denoising, etc.), in a reference-free way. One of the tricks is to compare against multiple *and opposite* quality-related text prompts.

arxiv.org/abs/2402.00282
arXiv Sound (@arxivsound) 's Twitter Profile Photo

``PAM: Prompting Audio-Language Models for Audio Quality Assessment,'' Soham Deshmukh Dareen Alharthi Benjamin Elizalde Hannes Gamper Mahmoud Al Ismail Rita Singh Bhiksha Raj Huaming Wang … ift.tt/6HIcVpn

Microsoft Research (@msftresearch) 's Twitter Profile Photo

In this issue: LLMs in the Imaginarium: Tool Learning through simulated trial and error; Benchmarking LLMs across languages, modalities, models and tasks; Training audio captioning models without audio. msft.it/6016cIfvY

In this issue: LLMs in the Imaginarium: Tool Learning through simulated trial and error; Benchmarking LLMs across languages, modalities, models and tasks; Training audio captioning models without audio. msft.it/6016cIfvY
arXiv Sound (@arxivsound) 's Twitter Profile Photo

``Audio Entailment: Assessing Deductive Reasoning for Audio Understanding,'' Soham Deshmukh, Shuo Han, Hazim Bukhari, Benjamin Elizalde, Hannes Gamper, Rita Singh, Bhiksha Raj, ift.tt/mMbigfn

Anurag Kumar (@acouintel) 's Twitter Profile Photo

We have started hiring research interns for 2025. I am looking for a PhD student with background in multimodal generation/understanding. metacareers.com/jobs/374372872…. Flexible with the timing but summers are usually the best :) #internship2025 #multimodal #audio #speech AI at Meta