Zoox is transforming mobility-as-a-service by developing a fully autonomous, purpose-built fleet designed for AI to drive and humans to enjoy.
The Perception team at Zoox is fundamental to our autonomous vehicle technology, creating the understanding of the world for our self-driving robots. We enable safe and efficient navigation in complex environments through sophisticated detection, classification, and tracking systems.
As an engineer in the Semantics and Foundation Model team, you will develop advanced multimodal large language models that enhance our vehicles' environmental understanding. You'll develop and fine-tune these models for off-vehicle analysis while also optimizing them for real-time performance on our robotaxi platform, ensuring they can efficiently identify hazards and interpret driving restrictions with minimal latency. Working alongside world-class engineers and researchers, you'll leverage premium sensor data and cutting-edge infrastructure to validate your algorithms in real-world conditions, directly impacting the safety and capability of Zoox's autonomous system.
About Zoox
Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.
Accommodations
If you need an accommodation to participate in the application or interview process please reach out to [email protected] or your assigned recruiter.
A Final Note:
You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.
In this role, you will:
Lead the development of multimodal large language models that enhance our robotaxis' understanding of complex urban environmentsDesign and implement efficient model architectures, balancing performance and computational constraints through fine-tuning and distillation techniquesDrive end-to-end ML solutions from research to production, utilizing Zoox's extensive data pipelines and infrastructure to improve autonomous driving capabilitiesCollaborate with perception, planning, and systems teams to integrate your models into the vehicle's decision-making pipelineValidate and optimize your solutions using real-world driving scenarios, directly contributing to the safety and reliability of Zoox's autonomous systemQualifications
MS or PhD in Computer Science, Machine Learning, or related technical fieldDemonstrated experience training and deploying perception and computer vision based modelsExperience building and maintaining ML training pipelines, including data preprocessing, model training, and evaluationProficiency in Python and ML libraries (PyTorch, NumPy) demonstrated through professional or research projectsExperience with model optimization techniques such as quantization, pruning, or knowledge distillationBonus Qualifications
Proficiency in C++Publications in top-tier conferences (CVPR, ICCV, RSS, ICRA)Experience with autonomous robotics systems