Peymàn M. Kiasari (@pkiasari) 's Twitter Profile
Peymàn M. Kiasari

@pkiasari

ML researcher and engineer

ID: 1432193514226556930

calendar_today30-08-2021 04:09:21

60 Tweet

11 Followers

62 Following

William Merrill (@lambdaviking) 's Twitter Profile Photo

✨Excited to finally drop our new paper: SSMs “look like” RNNs, but we show their statefulness is an illusion🪄🐇 Current SSMs cannot express basic state tracking, but a minimal change fixes this! 👀 w/ jackson petty, Ashish Sabharwal arxiv.org/abs/2404.08819

✨Excited to finally drop our new paper: SSMs “look like” RNNs, but we show their statefulness is an illusion🪄🐇

Current SSMs cannot express basic state tracking, but a minimal change fixes this! 👀

w/ <a href="/jowenpetty/">jackson petty</a>, <a href="/Ashish_S_AI/">Ashish Sabharwal</a>
arxiv.org/abs/2404.08819
PyTorch (@pytorch) 's Twitter Profile Photo

Announcing the alpha release of torchtune! torchtune is a PyTorch-native library for fine-tuning LLMs. It combines hackable memory-efficient fine-tuning recipes with integrations into your favorite tools. Get started fine-tuning today! Details: hubs.la/Q02t214F0

Tanishq Mathew Abraham, Ph.D. (@iscienceluvr) 's Twitter Profile Photo

Google announces Med-Gemini, a family of Gemini models fine-tuned for medical tasks! 🔬 Achieves SOTA on 10 of the 14 benchmarks, spanning text, multimodal & long-context applications. Surpasses GPT-4 on all benchmarks! This paper is super exciting, let's dive in ↓

Google announces Med-Gemini, a family of Gemini models fine-tuned for medical tasks! 🔬

Achieves SOTA on 10 of the 14 benchmarks, spanning text, multimodal &amp;  long-context applications. 

Surpasses GPT-4 on all benchmarks!

This paper is super exciting, let's dive in ↓
Peymàn M. Kiasari (@pkiasari) 's Twitter Profile Photo

We'll be presenting our paper (arxiv.org/abs/2401.14469) at #ICLR2024 tomorrow! Check us out at Hall B, poster #16 (10:45 a.m. – 12:45 p.m). If you are presenting tomorrow too let me know.

We'll be presenting our paper (arxiv.org/abs/2401.14469) at #ICLR2024 tomorrow!

Check us out at Hall B, poster #16 (10:45 a.m. – 12:45 p.m).

If you are presenting tomorrow too let me know.
Sepp Hochreiter (@hochreitersepp) 's Twitter Profile Photo

I am so excited that xLSTM is out. LSTM is close to my heart - for more than 30 years now. With xLSTM we close the gap to existing state-of-the-art LLMs. With NXAI we have started to build our own European LLMs. I am very proud of my team. arxiv.org/abs/2405.04517

Peymàn M. Kiasari (@pkiasari) 's Twitter Profile Photo

Excited to announce the launch of our new dataset website! 🚀 Check it out Visual Graph Arena at: vga.csail.mit.edu Our paper is currently under review and will be available after publication. Stay tuned! #MachineLearning

Peymàn M. Kiasari (@pkiasari) 's Twitter Profile Photo

Attended #ICML2024 poster session today. Noticed just a few papers on CNNs! it seems everyone is shifting focus to Transformers.

Valeriy M., PhD, MBA, CQF (@predict_addict) 's Twitter Profile Photo

I was highly sceptical of the paper “Large language models are zero-shot time series forecasters.“ from NYU when it came out. Somehow the paper was published at NeurIPS, but as often the reality and the claims in transformer and LLM papers for time series are vastly different.

I was highly sceptical of the paper “Large language models are zero-shot time series forecasters.“ from NYU when it came out. Somehow the paper was published at NeurIPS, but as often the reality and the claims in transformer and LLM papers for time series are vastly different.
Peymàn M. Kiasari (@pkiasari) 's Twitter Profile Photo

Trying my best to create a vintage effect for a chart in Matplotlib! 😄 I want it to clearly look like it's from an old paper, and it is not our result. I hope the reviewers wouldn't think it's unprofessional!

Trying my best to create a vintage effect for a chart in Matplotlib! 😄 I want it to clearly look like it's from an old paper, and it is not our result.

I hope the reviewers wouldn't think it's unprofessional!