Michael Tschannen (@mtschannen) Twitter Tweets • TwiDoom

By popular demand, we updated the GIVT arxiv with a derivation of the loss function and additional details arxiv.org/abs/2312.02116 We're also working on the code release - stay tuned!

thumb_up_off_alt16

chat_bubble_outline1

repeat2

shareShare

Very interesting work by some colleagues in Google DeepMind: Oliver and Hartmut present a new JAX library for easy and fast machine learning under E(3)-equivariance, e.g. point clouds in 3D space. paper: arxiv.org/abs/2401.07595 code: github.com/google-researc…

thumb_up_off_alt59

chat_bubble_outline0

repeat18

shareShare

Ibrahim Alabdulmohsin | إبراهيم العبدالمحسن

@ibomohsin

7 months ago

How is next-token prediction capable of such intelligent behavior? I’m very excited to share our work, where we study the fractal structure of language. TLDR: thinking of next-token prediction in language as “word statistics” is a big oversimplification! arxiv.org/abs/2402.01825

$How is next-token prediction capable of such intelligent behavior? I’m very excited to share our work, where we study the fractal structure of language. TLDR: thinking of next-token prediction in language as “word statistics” is a big oversimplification! arxiv.org/abs/2402.01825$

thumb_up_off_alt529

chat_bubble_outline14

repeat110

shareShare

Olivier Bachem

@olivierbachem

7 months ago

Super excited about this work where we improve the MusicLM text-to-music model with RL and user feedback and see substantial gains in performance.

thumb_up_off_alt26

chat_bubble_outline0

repeat4

shareShare

mechcoder

@mechcoder

6 months ago

Training a simple classifier on top of "frozen features" from large vision models is now common and powerful. In our CVPR 2024 paper (arxiv.org/abs/2403.10519), we show that just applying simple augmentations on such frozen features can improve few-shot classification.

thumb_up_off_alt70

chat_bubble_outline2

repeat9

shareShare

Fabian Mentzer

@mentzer_f

6 months ago

The GIVT that keeps giving. We scaled up our soft token generation transformer and improved the GMM. Also there is 👨🏽‍💻 CODE 🧑‍💻. Check it out ✨

thumb_up_off_alt24

chat_bubble_outline2

repeat1

shareShare

Neil Houlsby

@neilhoulsby

6 months ago

GiVT improvements and checkpoints! As a reminder, GiVT performs autoregressive image generation with continuous latents, (surprisingly) outperformimg the standard approach of generating discrete tokens.

thumb_up_off_alt22

chat_bubble_outline0

repeat5

shareShare

Lucas Beyer (bl16)

@giffmana

6 months ago

Slowly but surely freeing us from the discretization in tokenization! Now the code and checkpoints of GIVT are available, so you can play around with them:

thumb_up_off_alt85

chat_bubble_outline1

repeat9

shareShare

Fabian Mentzer

@mentzer_f

4 months ago

Happening today at #ICLR2024, Poster #39 at 4.30 PM! Come by :)

thumb_up_off_alt26

chat_bubble_outline1

repeat3

shareShare

Lucas Beyer (bl16)

@giffmana

4 months ago

We release PaliGemma. I'll keep it short, still on vacation: - sota open base VLM designed to transfer quickly, easily, and strongly to a wide range of tasks - Also does detection and segmentation - We provide lots of examples - Meaty tech report later! ai.google.dev/gemma/docs/pal…

thumb_up_off_alt866

chat_bubble_outline27

repeat129

shareShare

Xiaohua Zhai

@xiaohuazhai

4 months ago

We introduced PaliGemma at Google I/O. I would like to quote a few links in this thread for developers, most importantly, share our new academic program to support PaliGemma research with Google Cloud credits🧵

thumb_up_off_alt80

chat_bubble_outline1

repeat18

shareShare

Michael Tschannen

@mtschannen

2 months ago

Great work by Boris Dayma 🖍️ and Pedro Cuenca creating an open-source reproduction of CapPa! They present some interesting architecture tweaks including the use of registers.

thumb_up_off_alt15

chat_bubble_outline1

repeat4

shareShare

Lucas Beyer (bl16)

@giffmana

2 months ago

✨PaliGemma report will hit arxiv tonight. We tried hard to make it interesting, and not "here model. sota results. kthxbye." So here's some of the many interesting ablations we did, check the paper tomorrow for more! 🧶

thumb_up_off_alt858

chat_bubble_outline20

repeat117

shareShare

Michael Tschannen

Morena

Fabian Mentzer

Michael Tschannen

Daniel Keysers

Ibrahim Alabdulmohsin | إبراهيم العبدالمحسن

Olivier Bachem

mechcoder

Fabian Mentzer

Neil Houlsby

Lucas Beyer (bl16)

Fabian Mentzer

Lucas Beyer (bl16)

Xiaohua Zhai

Michael Tschannen

Lucas Beyer (bl16)