Utkarsh (@utkarsh1352002) Twitter Tweets • TwiDoom

Birchlabs

a year ago

SDXL's VAE can be modified to use neighbourhood attention, accelerated by NATTEN. in fact, you can remove attention entirely and images look pretty similar. left = global self-attention right = no attention! thanks Rivers Have Wings for the idea! github.com/Birch-san/sdxl…

thumb_up_off_alt177

chat_bubble_outline5

repeat13

shareShare

SkalskiP

@skalskip92

a year ago

What papers should I read to expand my knowledge of Transformers? Please send links in the comments and write why this paper is worth reading. Thanks for your help!

thumb_up_off_alt680

chat_bubble_outline32

repeat101

shareShare

Elizabeth Laraki

@elizlaraki

a year ago

In 2008, Google Maps launched in India. But we quickly ran into a problem: Nobody used street names. And street names were the foundation of Google Maps. The team had to make some big adaptations. 15 years later, the changes have stood the test of time. Here's how the team

thumb_up_off_alt4,4K

chat_bubble_outline112

repeat1,1K

shareShare

Dreaming Tulpa 🥓👑

@dreamingtulpa

10 months ago

Readout Heads look like the next evolution of guided image generation! Like ControlNet, readout heads can be used for pose, depth, or edge-guided generations. But compared to ControlNet models, they're much more lightweight. Let's take a closer look:

thumb_up_off_alt256

chat_bubble_outline6

repeat38

shareShare

Agrim Gupta

@agrimgupta92

9 months ago

We introduce W.A.L.T, a diffusion model for photorealistic video generation. Our model is a transformer trained on image and video generation in a shared latent space. 🧵👇

thumb_up_off_alt1,1K

chat_bubble_outline55

repeat263

shareShare

Utkarsh

@utkarsh1352002

9 months ago

I have just published an article explaining Segment Anything Model on medium link.medium.com/V6Yift8UPFb . Hope y'all like it.

thumb_up_off_alt2

chat_bubble_outline0

repeat0

shareShare

Utkarsh

@utkarsh1352002

8 months ago

Looking for deep learning project ideas. Topics I am interested in includes Domain Translation and Explainability. Reference papers would be appreciated😅. I am a final year undergrad so do show some mercy 😂😅.

thumb_up_off_alt3

chat_bubble_outline1

repeat1

shareShare

Marc Lou

@marc_louvion

8 months ago

In 2022, I used to launch startups in the void. - Post daily on 𝕏 - Make a product demo video - Share the Product Hunt link for support But I got 0 sales...

thumb_up_off_alt541

chat_bubble_outline30

repeat16

shareShare

SkalskiP

@skalskip92

7 months ago

If you are curious about how non-max suppression works, some time ago, I made an infographic explaining this algorithm. 7/n

thumb_up_off_alt117

chat_bubble_outline3

repeat7

shareShare

Utkarsh

@utkarsh1352002

6 months ago

Working at really great efficiency, wrote two articles last week😅😱 I-JEPA: medium.com/@utkarsh135/i-… and MAE enable efficient knowledge distillers: medium.com/@utkarsh135/ma…

thumb_up_off_alt1

chat_bubble_outline0

repeat0

shareShare

Gradio

@gradio

6 months ago

🎨 Introducing Follow-Your-Click: Transforming the way we animate images with just a click and a short prompt! New ImageToVideo model. Say goodbye to moving entire scenes and hello to precision and creativity.

thumb_up_off_alt437

chat_bubble_outline3

repeat107

shareShare

philz1337x

@philz1337x

6 months ago

✨ For the last few months I have been reverse engineering Magnific AI's famous upscaler It uses MultiDiffusion, ControlNet tiles and details LoRas In true AI spirit, I am open sourcing it for everyone to use for free in your apps It's designed to easily control the amount of

thumb_up_off_alt2,2K

chat_bubble_outline165

repeat377

shareShare

Martin Nebelong

@martinnebelong

6 months ago

Apples, oranges, and the future of 3d ;-) Realtime, instant feedback, real-time lighting, and effects in a more painterly, sculptural, intuitive workflow. An underlying game engine lets you make full games, interactive movies, or art installations. AI takes care of the final

thumb_up_off_alt706

chat_bubble_outline25

repeat107

shareShare

Dreaming Tulpa 🥓👑

@dreamingtulpa

6 months ago

FRESCO combines ControlNet with Ebsynth! The result: Improved spatial and temporal consistency for video-to-video 🔥 Links and 5 cool examples:

thumb_up_off_alt114

chat_bubble_outline2

repeat16

shareShare

Sandeep Jain Tradeswift

@sandeepkrjaints

5 months ago

A very Good Saturday Morning🙏 Sharing a beautiful video received on WhatsApp👌💎💪Enjoy the Life in full swing🙂💪 Love Life, Live

thumb_up_off_alt1,1K

chat_bubble_outline49

repeat311

shareShare

Kody Nordquist

@kodynordquist

5 months ago

Entrepreneurs face pressures most people won’t understand. 20 sentences to read when you’re feeling stuck: 1. You've survived 100% of your worst days so far. This too shall pass.

thumb_up_off_alt114

chat_bubble_outline8

repeat25

shareShare

Emmett Shear

@eshear

2 months ago

Gall's Law is one of those statements that just keeps hitting harder and more deeply the more time I spend with it. At first it seemed false, then a truism, then an interesting insight, and now a foundational belief about the world.

thumb_up_off_alt5,5K

chat_bubble_outline90

repeat829

shareShare

Yasser

@yasser_elsaid_

a month ago

I rewatch this once a month for motivation

thumb_up_off_alt8,8K

chat_bubble_outline89

repeat1,1K

shareShare