Job Description

Snapshot
Help us build generative models of the 3D world. World models power numerous domains, such as media generation, visual reasoning, simulation, planning for embodied agents, and real-time interactive experiences. Work with us to build better versions of Gemini, Genie, and Veo, while also exploring new, spatial modalities beyond images and videos.
The Role
Key responsibilities: Conduct research to build generative multimodal models of the 3D world. Solve essential problems to train world models at massive scale: build and train large-scale systems for data annotation, curate and annotate training datasets, build and maintain large model training infrastructure, develop scaling ladders and training recipes, develop metrics for spatial intelligence, enable real-time interactive experiences, study the integration of spatial modalities with multimodal language models, and of course: actually train massive-scale models.

Areas of focus:


+ 3D computer vi...

Apply for this Position

Ready to join Google? Click the button below to submit your application.

Submit Application