Zoox is transforming mobility-as-a-service by developing a fully autonomous, purpose-built fleet designed for AI to drive and humans to enjoy.
Zoox’s internship program provides hands-on experiences with state of the art technology, mentorship from some of the industry's brightest minds, and the opportunity to play a part in our success. Internships at Zoox are reserved for those who demonstrate outstanding academic performance, activities outside their course work, aptitude, curiosity, and a passion for Zoox's mission.
Perception at Zoox is the "Retina of Zoox" — the system responsible for understanding the world around the autonomous vehicle.]
As an MLE intern working on Perception, you may be assigned to one of the following teams:
On the Offline Driving Intelligence team, you will develop advanced multimodal large language models that enhance scenario understanding and driving. You'll develop and fine-tune models with driving data, ensuring models can efficiently identify hazards, interpret driving restrictions, drive and answer questions about the scenario. Working alongside world-class engineers and researchers, you'll leverage premium sensor data and cutting-edge infrastructure to validate your algorithms in real-world conditions, directly impacting productivity, safety and the capability of Zoox's autonomous system.
On the Perception Attributes team, you will collect and generate datasets for specialized vehicle classification and semantic enrichment, design and frame machine learning problems for real-world autonomous driving scenarios and train and evaluate state-of-the-art machine learning models with a focus on computer vision. You will also collaborate with engineers to deploy models for real-time inference on our vehicles, and contribute to improving our vehicle's ability to recognize and respond to emergency vehicles, school buses, construction vehicles, and other specialized road actors.
On the Perception Scene Understanding team, you will develop advanced ML models that perceive our vehicle's surroundings to identify hazards and driving restrictions. You will utilize vision-language models for detecting rare events and ensuring safe driving in these situations. You'll work with state-of-the-art machine learning models that operate in real-time on our robotaxi platform with minimal latency. Collaborating with world-class engineers and researchers across sensors, planning, and other teams, you'll have access to premium sensor data and cutting-edge infrastructure to validate your algorithms in real-world conditions.
On the Occupancy and Rare Events team, you will develop multimodal foundation models that serve as the common backbone for on-vehicle perception, enhancing the system's ability to detect long-tail events and generalize to new geofences. In this role, you will develop effective tokenization techniques for Vision, Lidar, and Radar modalities, leverage LLM techniques to align token embeddings across modalities into a common feature space supporting various 3D tasks (detection, segmentation, tracking, feature matching, dense depth), You'll collaborate with top-notch engineers across PCP, MLInfra, and Offboard Driving Intelligence teams, utilizing Zoox's large-scale dataset to train and evaluate models that directly impact the autonomous system's real-world performance.
On the perception optimization team, you will build optimized inference pipelines for on-bot algorithms. A major focus of optimization is ML models, with techniques such as quantization, pruning, and advanced transformer optimizations such as token pruning, merging and layer pruning being used to deploy large models into the bot to operate at real time. In this role, you will experiment with optimizing SOTA large ML models to make them fit into on-bot compute, including both post-training optimization (e.g. quantization) as well as architectural approaches (e.g. token merging).
Compensation:
The monthly salary range for this position is $5,500 to $9,500. Compensation will vary based on geographic location and level of education. Additional benefits may include medical insurance, and a housing stipend (relocation assistance will be offered based on eligibility).
About Zoox
Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.
Accommodations
If you need an accommodation to participate in the application or interview process please reach out to
[email protected] or your assigned recruiter.
A Final Note:
You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.
Requirements:
Currently working towards a B.S., M.S., Ph.D., or advanced degree in a relevant engineering programMust be returning to school to continue your education upon completing this internshipGood academic standingAble to commit to a 12-week internship beginning in May or June of 2026.At least one previous industry internship, co-op, or project completed in a relevant areaAbility to relocate to the Bay Area, California or Boston for the duration of the internshipInterns at Zoox may not use any proprietary information they are working on as part of their thesis, any published work with their university, or to be distributed to anyone outside of ZooxQualifications (It’s helpful if you meet a majority of the following qualifications, but it isn’t a requirement):
Advanced understanding of Python or C++ (C++ preferred)Experience with production ML pipelines: dataset creation, labeling, training, metricsExperience training/finetuning MLLMs or at least MLLms (SFT/RL)Experience with Vision-Language ModelsExperience with model deployment with TensorRT Experience with Neural Network design and implementationExperience working with LiDAR, Camera and Radar dataExperience with building and processing large scale datasetGPU/CUDA programming experienceBonus Qualifications:
Experience with multimodal foundation model optimization techniquesExperience in algorithm development for Autonomous Driving software