Tanishq Mathew Abraham, Ph.D. (@iscienceluvr) 's Twitter Profile
Tanishq Mathew Abraham, Ph.D.

@iscienceluvr

PhD at 19 |
Founder and CEO at @MedARC_AI |
Research Director at @StabilityAI |
@kaggle Notebooks GM |
Biomed. engineer @ 14 |
TEDx talk➡bit.ly/3tpAuan

ID: 441465751

linkhttps://tanishq.ai calendar_today20-12-2011 03:45:50

14,14K Tweet

60,60K Followers

1,1K Following

Tanishq Mathew Abraham, Ph.D. (@iscienceluvr) 's Twitter Profile Photo

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens abs: arxiv.org/abs/2406.11271 A new interleaved multimodal pretraining dataset, consists of one trillion text tokens and three billion images, a 10x scale-up from existing

MINT-1T: Scaling Open-Source Multimodal Data by 10x: A Multimodal Dataset with One Trillion Tokens

abs: arxiv.org/abs/2406.11271

A new interleaved multimodal pretraining dataset, consists of one trillion text tokens and three billion images, a 10x scale-up from existing