Sean Kirmani
@seankirmani
Researcher at @GoogleDeepMind. Technology optimist.
ID: 1151711025093206017
https://kirmani.ai 18-07-2019 04:31:20
87 Tweet
478 Takipçi
505 Takip Edilen
For robotics and AR applications, there’s a lot of benefits of having spatially 3D grounded VLMs. This recent work led by Boyuan Chen adds 3D reasoning capabilities to VLMs. One cool result is that we are able to answer *quantitative* distance questions as a reward signal.
Very excited about the huge potential of applying foundation models to robotics, & Gemini is perfect for this bc it’s natively multimodal. Some cool recent experiments below. If you're interested to work at the frontier of robotics, the Google DeepMind robotics team is hiring!
This is a very nice article by Hans Peter Brondmo about our work at Everyday Robots. My time there was one of the most formative parts of my career. My major takeaway is that robots will be “boring” soon. The recent energy in Silicon Valley makes me optimistic. wired.com/story/inside-g…