Peymàn M. Kiasari (@pkiasari) Twitter Tweets • TwiDoom

Dimitris Papailiopoulos

@dimitrispapail

5 months ago

This is kinda interesting. ChatGPT gets tricked by some of the same color illusions as we do.

thumb_up_off_alt1,1K

chat_bubble_outline71

repeat80

shareShare

✨Excited to finally drop our new paper: SSMs “look like” RNNs, but we show their statefulness is an illusion🪄🐇 Current SSMs cannot express basic state tracking, but a minimal change fixes this! 👀 w/ jackson petty, Ashish Sabharwal arxiv.org/abs/2404.08819

thumb_up_off_alt1,1K

chat_bubble_outline24

repeat206

shareShare

PyTorch

@pytorch

5 months ago

Announcing the alpha release of torchtune! torchtune is a PyTorch-native library for fine-tuning LLMs. It combines hackable memory-efficient fine-tuning recipes with integrations into your favorite tools. Get started fine-tuning today! Details: hubs.la/Q02t214F0

thumb_up_off_alt1,1K

chat_bubble_outline20

repeat298

shareShare

Peymàn M. Kiasari

@pkiasari

5 months ago

I wish @pytorch supported complex neural networks too.

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Peymàn M. Kiasari

@pkiasari

5 months ago

It's 45TB!

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

5 months ago

Google announces Med-Gemini, a family of Gemini models fine-tuned for medical tasks! 🔬 Achieves SOTA on 10 of the 14 benchmarks, spanning text, multimodal & long-context applications. Surpasses GPT-4 on all benchmarks! This paper is super exciting, let's dive in ↓

thumb_up_off_alt1,1K

chat_bubble_outline21

repeat208

shareShare

Peymàn M. Kiasari

@pkiasari

5 months ago

We'll be presenting our paper (arxiv.org/abs/2401.14469) at #ICLR2024 tomorrow! Check us out at Hall B, poster #16 (10:45 a.m. – 12:45 p.m). If you are presenting tomorrow too let me know.

thumb_up_off_alt4

chat_bubble_outline0

repeat0

shareShare

Sepp Hochreiter

@hochreitersepp

5 months ago

I am so excited that xLSTM is out. LSTM is close to my heart - for more than 30 years now. With xLSTM we close the gap to existing state-of-the-art LLMs. With NXAI we have started to build our own European LLMs. I am very proud of my team. arxiv.org/abs/2405.04517

thumb_up_off_alt1,1K

chat_bubble_outline48

repeat382

shareShare

Peymàn M. Kiasari

@pkiasari

2 months ago

Excited to announce the launch of our new dataset website! 🚀 Check it out Visual Graph Arena at: vga.csail.mit.edu Our paper is currently under review and will be available after publication. Stay tuned! #MachineLearning

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Peymàn M. Kiasari

@pkiasari

2 months ago

Attended #ICML2024 poster session today. Noticed just a few papers on CNNs! it seems everyone is shifting focus to Transformers.

thumb_up_off_alt6

chat_bubble_outline0

repeat0

shareShare

Peymàn M. Kiasari

@pkiasari

2 months ago

Just received #NeurIPS2024 reviews. They are so brutal I don't even understand why! 🫠

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Valeriy M., PhD, MBA, CQF

@predict_addict

a month ago

I was highly sceptical of the paper “Large language models are zero-shot time series forecasters.“ from NYU when it came out. Somehow the paper was published at NeurIPS, but as often the reality and the claims in transformer and LLM papers for time series are vastly different.

thumb_up_off_alt30

chat_bubble_outline2

repeat13

shareShare

Peymàn M. Kiasari

@pkiasari

a month ago

Trying my best to create a vintage effect for a chart in Matplotlib! 😄 I want it to clearly look like it's from an old paper, and it is not our result. I hope the reviewers wouldn't think it's unprofessional!

thumb_up_off_alt3

chat_bubble_outline0

repeat0

shareShare

Peymàn M. Kiasari

Dimitris Papailiopoulos

William Merrill

PyTorch

Peymàn M. Kiasari

Peymàn M. Kiasari

Tanishq Mathew Abraham, Ph.D.

Peymàn M. Kiasari

Sepp Hochreiter

Peymàn M. Kiasari

Peymàn M. Kiasari

Peymàn M. Kiasari

Valeriy M., PhD, MBA, CQF

Peymàn M. Kiasari