Lucio Dery Jnr Mwinm (@derylucio) Twitter Tweets • TwiCopy

Lucio Dery Jnr Mwinm

@derylucio

+ Follow

ID:2998763504

linkhttps://ldery.github.io/ calendar_today27-01-2015 22:31:54

243 Tweets

474 Followers

956 Following

Simran Khanuja

1 month ago

Ever noticed how Pixar adapts movies for international markets? The beloved newscaster in Zootopia is a jaguar in Brazil, a panda in China, a koala in Australia …

While machine translation (MT) has only dealt with language in speech/text thus far, we extend the scope of MT to…

Ever noticed how Pixar adapts movies for international markets? The beloved newscaster in Zootopia is a jaguar in Brazil, a panda in China, a koala in Australia … While machine translation (MT) has only dealt with language in speech/text thus far, we extend the scope of MT to…

thumb_up_off_alt222

chat_bubble_outline0

account_circle

Lucio Dery Jnr Mwinm

1 month ago

Checkout our work on adapting multitask learning as a tool against worst case group error.

Our modified MTL approach (main task + pre-training auxiliary objective + L1 embedding reg) is competitive against bespoke DRO (Distributionally Robust Optimization) methods

thumb_up_off_alt17

chat_bubble_outline0

account_circle

Arthur Douillard

1 month ago

I'm super excited to release DiPaCo, a new kind of mixture of experts, that can scale engineering-wise to data centers across the entire world!

A few words about it in this thread 🧵

thumb_up_off_alt185

chat_bubble_outline0

account_circle

Junhong Shen

1 month ago

Introducing Unified PDE Solvers (UPS), a step towards efficiently building foundation models for PDE solvers (arxiv.org/abs/2403.07187)! Starting from a pretrained LM, UPS tackles diverse spatiotemporal PDEs with SOTA accuracy, using ~20x less data and a single A6000! 🧵[1/x]

Introducing Unified PDE Solvers (UPS), a step towards efficiently building foundation models for PDE solvers (arxiv.org/abs/2403.07187)! Starting from a pretrained LM, UPS tackles diverse spatiotemporal PDEs with SOTA accuracy, using ~20x less data and a single A6000! 🧵[1/x]

thumb_up_off_alt105

chat_bubble_outline0

account_circle

Victor Akinwande

2 months ago

For large-scale causal discovery, there's no need to use NOTEARS for its speed. Consider using LiNGAM. We've parallelized it, achieving a 32x speed-up on GPUs.

NOTEARS:
Scalable: ✅
Identifiability guarantees: ❌

AcceleratedLiNGAM:
Scalable: ✅
Identifiability guarantees: ✅

For large-scale causal discovery, there's no need to use NOTEARS for its speed. Consider using LiNGAM. We've parallelized it, achieving a 32x speed-up on GPUs. NOTEARS: Scalable: ✅ Identifiability guarantees: ❌ AcceleratedLiNGAM: Scalable: ✅ Identifiability guarantees: ✅

thumb_up_off_alt41

chat_bubble_outline0

account_circle

AIMS South Africa

3 months ago

Applications are now open for our AI for Science Master's program 🧪!

Thanks to Google DeepMind there are 40 full scholarships available to students from across Africa 🎓. The cherry on top - 1:1 mentorship by Google DeepMind researchers 🍒

More details: ai.aims.ac.za

Applications are now open for our AI for Science Master's program 🧪! Thanks to @GoogleDeepMind there are 40 full scholarships available to students from across Africa 🎓. The cherry on top - 1:1 mentorship by @GoogleDeepMind researchers 🍒 More details: ai.aims.ac.za

thumb_up_off_alt215

chat_bubble_outline0

account_circle

Lucio Dery Jnr Mwinm

4 months ago

Apply to work with Patrick Emedom-Nnamdi !
His startup is up to some cool stuff 🤫 -- so if you're into ML and Healthcare -- please reach out to him !

thumb_up_off_alt4

chat_bubble_outline0

account_circle

Steven Kolawole

@_stevenkolawole

5 months ago

I've read ~50 SOPs (via STEM for development, my personal network, random email requests, and my recent Twitter form) over the past 2 months, and these are COMMON MISTAKES I've noticed graduate school applicants making quite often:

[A thread]

thumb_up_off_alt164

chat_bubble_outline0

account_circle

Lucio Dery Jnr Mwinm

5 months ago

Graham is amazing !
Come work with him

thumb_up_off_alt16

chat_bubble_outline0

account_circle

Lucio Dery Jnr Mwinm

5 months ago

Checkout this amazing effort by Simran Khanuja ! Labeled data for MT isn’t magicked - you must chose what points to label (under annotation constraints) to help your target language. Simran’s work provides insights about what strategies work and when !

thumb_up_off_alt11

chat_bubble_outline0

account_circle

Graham Neubig

6 months ago

I am running (unopposed?!) for chair of the NAACL board: naacl.org/elections/2024…

My goals:
1. Maintain the high quality of NAACL's content
2. Make sure NAACL welcomes researchers of all backgrounds

If you are a NAACL member, I would appreciate your support; check your email!

thumb_up_off_alt182

chat_bubble_outline0

account_circle

Victor Akinwande

7 months ago

The dominant paradigm in deep learning is to build larger-sized models. In recent work, we show that the statistical guarantees of such models do not necessarily suffer as a result of their sheer size.

arxiv.org/abs/2310.03957

🧵

The dominant paradigm in deep learning is to build larger-sized models. In recent work, we show that the statistical guarantees of such models do not necessarily suffer as a result of their sheer size. arxiv.org/abs/2310.03957 🧵

thumb_up_off_alt116

chat_bubble_outline0

account_circle

Amanda Bertsch

7 months ago

Now accepted to NeurIPS'23! Looking forward to talking to folks in New Orleans 🎉

thumb_up_off_alt67

chat_bubble_outline0

account_circle

Alexis Ross

8 months ago

*⃣ Resource alert for people applying to CS PhD programs this cycle *⃣ cs-sop.org contains >60 example statements of purpose! It's made possible by the many generous submissions from new applicants, and new ones are always welcome! 😊

thumb_up_off_alt91

chat_bubble_outline0

account_circle

Asher Trockman

9 months ago

How much of the value of pre-training comes from merely serving as a *good initialization*? 🤔

We propose a simple, effective, and compute-free initialization method for Transformers (esp. vision) that *mimics* pre-training.

See our #ICML oral at 3pm today (313)! w/Zico Kolter

How much of the value of pre-training comes from merely serving as a *good initialization*? 🤔 We propose a simple, effective, and compute-free initialization method for Transformers (esp. vision) that *mimics* pre-training. See our #ICML oral at 3pm today (313)! w/@zicokolter

thumb_up_off_alt122

chat_bubble_outline0

account_circle

Lucio Dery Jnr Mwinm

9 months ago

Amazing, insightful thread by Delip Rao e/σ on understanding motivations of parties of interest behind the different framings of LLM access and capabilities

thumb_up_off_alt0

chat_bubble_outline0

account_circle

Lucio Dery Jnr Mwinm

10 months ago

Congrats Kayo Yin 🇦🇹 Patrick Fernandes !!!!

thumb_up_off_alt6

chat_bubble_outline0

account_circle

Accepted papers at TMLR

10 months ago

The Vendi Score: A Diversity Evaluation Metric for Machine Learning

Dan Friedman, Adji Bousso Dieng.

Action editor: Antonio Vergari.

openreview.net/forum?id=g97OH…

#diversity #diverse #similarity

thumb_up_off_alt31

chat_bubble_outline0

account_circle