Utkarsh (@utkarsh1352002) 's Twitter Profile
Utkarsh

@utkarsh1352002

Research Scholar @UPMC | GenAI?๐Ÿค” Stocks?๐Ÿค” utkd135.github.io/utkarsh/

ID: 824319874168188928

calendar_today25-01-2017 18:15:47

117 Tweet

77 Followers

447 Following

Birchlabs (@birchlabs) 's Twitter Profile Photo

SDXL's VAE can be modified to use neighbourhood attention, accelerated by NATTEN. in fact, you can remove attention entirely and images look pretty similar. left = global self-attention right = no attention! thanks Rivers Have Wings for the idea! github.com/Birch-san/sdxlโ€ฆ

SDXL's VAE can be modified to use neighbourhood attention, accelerated by NATTEN.
in fact, you can remove attention entirely and images look pretty similar.
left = global self-attention
right = no attention!
thanks <a href="/RiversHaveWings/">Rivers Have Wings</a> for the idea!
github.com/Birch-san/sdxlโ€ฆ
SkalskiP (@skalskip92) 's Twitter Profile Photo

What papers should I read to expand my knowledge of Transformers? Please send links in the comments and write why this paper is worth reading. Thanks for your help!

What papers should I read to expand my knowledge of Transformers?

Please send links in the comments and write why this paper is worth reading. Thanks for your help!
Elizabeth Laraki (@elizlaraki) 's Twitter Profile Photo

In 2008, Google Maps launched in India. But we quickly ran into a problem: Nobody used street names. And street names were the foundation of Google Maps. The team had to make some big adaptations. 15 years later, the changes have stood the test of time. Here's how the team

In 2008, Google Maps launched in India.

But we quickly ran into a problem:

Nobody used street names.

And street names were the foundation of Google Maps.

The team had to make some big adaptations.

15 years later, the changes have stood the test of time.

Here's how the team
Dreaming Tulpa ๐Ÿฅ“๐Ÿ‘‘ (@dreamingtulpa) 's Twitter Profile Photo

Readout Heads look like the next evolution of guided image generation! Like ControlNet, readout heads can be used for pose, depth, or edge-guided generations. But compared to ControlNet models, they're much more lightweight. Let's take a closer look:

Agrim Gupta (@agrimgupta92) 's Twitter Profile Photo

We introduce W.A.L.T, a diffusion model for photorealistic video generation. Our model is a transformer trained on image and video generation in a shared latent space. ๐Ÿงต๐Ÿ‘‡

Utkarsh (@utkarsh1352002) 's Twitter Profile Photo

I have just published an article explaining Segment Anything Model on medium link.medium.com/V6Yift8UPFb . Hope y'all like it.

Utkarsh (@utkarsh1352002) 's Twitter Profile Photo

Looking for deep learning project ideas. Topics I am interested in includes Domain Translation and Explainability. Reference papers would be appreciated๐Ÿ˜…. I am a final year undergrad so do show some mercy ๐Ÿ˜‚๐Ÿ˜….

Marc Lou (@marc_louvion) 's Twitter Profile Photo

In 2022, I used to launch startups in the void. - Post daily on ๐• - Make a product demo video - Share the Product Hunt link for support But I got 0 sales...

In 2022, I used to launch startups in the void.

- Post daily on ๐•
- Make a product demo video
- Share the Product Hunt link for support

But I got 0 sales...
SkalskiP (@skalskip92) 's Twitter Profile Photo

If you are curious about how non-max suppression works, some time ago, I made an infographic explaining this algorithm. 7/n

If you are curious about how non-max suppression works, some time ago, I made an infographic explaining this algorithm.

7/n
Utkarsh (@utkarsh1352002) 's Twitter Profile Photo

Working at really great efficiency, wrote two articles last week๐Ÿ˜…๐Ÿ˜ฑ I-JEPA: medium.com/@utkarsh135/i-โ€ฆ and MAE enable efficient knowledge distillers: medium.com/@utkarsh135/maโ€ฆ

Gradio (@gradio) 's Twitter Profile Photo

๐ŸŽจ Introducing Follow-Your-Click: Transforming the way we animate images with just a click and a short prompt! New ImageToVideo model. Say goodbye to moving entire scenes and hello to precision and creativity.

philz1337x (@philz1337x) 's Twitter Profile Photo

โœจ For the last few months I have been reverse engineering Magnific AI's famous upscaler It uses MultiDiffusion, ControlNet tiles and details LoRas In true AI spirit, I am open sourcing it for everyone to use for free in your apps It's designed to easily control the amount of

Martin Nebelong (@martinnebelong) 's Twitter Profile Photo

Apples, oranges, and the future of 3d ;-) Realtime, instant feedback, real-time lighting, and effects in a more painterly, sculptural, intuitive workflow. An underlying game engine lets you make full games, interactive movies, or art installations. AI takes care of the final

Dreaming Tulpa ๐Ÿฅ“๐Ÿ‘‘ (@dreamingtulpa) 's Twitter Profile Photo

FRESCO combines ControlNet with Ebsynth! The result: Improved spatial and temporal consistency for video-to-video ๐Ÿ”ฅ Links and 5 cool examples:

Sandeep Jain Tradeswift (@sandeepkrjaints) 's Twitter Profile Photo

A very Good Saturday Morning๐Ÿ™ Sharing a beautiful video received on WhatsApp๐Ÿ‘Œ๐Ÿ’Ž๐Ÿ’ชEnjoy the Life in full swing๐Ÿ™‚๐Ÿ’ช Love Life, Live

Kody Nordquist (@kodynordquist) 's Twitter Profile Photo

Entrepreneurs face pressures most people wonโ€™t understand. 20 sentences to read when youโ€™re feeling stuck: 1. You've survived 100% of your worst days so far. This too shall pass.

Emmett Shear (@eshear) 's Twitter Profile Photo

Gall's Law is one of those statements that just keeps hitting harder and more deeply the more time I spend with it. At first it seemed false, then a truism, then an interesting insight, and now a foundational belief about the world.

Gall's Law is one of those statements that just keeps hitting harder and more deeply the more time I spend with it. At first it seemed false, then a truism, then an interesting insight, and now a foundational belief about the world.