Zoox runs huge volumes of simulation to help rapidly iterate on our driving software and hardware, and validate our vehicles’ safety. We create infinitely varied virtual worlds to challenge our robots, from real world data, entirely novel scenarios, or a combination of both.

Across many teams at Zoox, engineers depend on running simulations. We have many different modes of execution, from single simulations to huge and varied pipelines. Zoox’s velocity is in a great part tied to the end-to-end time for someone kicking off a simulation job to them being able to draw conclusions from its results.

In this role you will be responsible for speeding up the execution pipeline, making it more usable, and enabling us to run at orders of magnitude larger scale. You’ll be improving and/or replacing our work scheduler, diving into resource contention, including CPU, GPU, file and network access contention, from a single node to multiple clusters, in our on-premises datacenter or the cloud. You’ll be helping our varied customer teams with their custom pipelines, from their design through to debugging transient issues.

Responsibilities

  • Make orders of magnitude increases in the scale and capacity of our systems capable of simulating millions of scenarios of driving
  • Monitor and improve efficiency, utilization, and scheduling of simulation jobs, over a large on-prem and cloud compute infrastructure.
  • Build metrics, dashboards, and monitoring systems to ensure the services are up and running optimally
  • Support users by improving usability, troubleshooting urgent issues, contributing to their system design as well as incorporating their requests and feedback into ours

Qualifications

  • Experience building scalable job orchestration frameworks and similar scalable distributed systems
  • Experience building and monitoring system health dashboards and diagnostics frameworks
  • Ability to work with users to understand the key issues and solve the right problems
  • 4+ years of industry experience with Python and C++
  • A BS in Computer Science or equivalent experience

Bonus Qualifications

  • AWS or other cloud infrastructure experience, including EC2, SQS, Kafka etc
  • Database experience 
  • Interest in autonomous vehicles and its benefits in the world
Vaccine Mandate

Employees working in this position will be required to be fully vaccinated against the COVID-19 virus. An applicant is considered fully vaccinated two weeks after their second dose in a 2-dose series, such as the Pfizer or Moderna vaccines, or two weeks after a single-dose vaccine, such as Johnson & Johnson’s Janssen vaccine. Applicants will be required to show proof of vaccination status upon receipt of a conditional offer of employment. That offer of employment will be conditioned upon, among other things, an Applicant’s ability to show proof of vaccination status. Please note the Company provides reasonable accommodations in accordance with applicable state, federal and local laws.

About Zoox

Zoox is developing the first ground-up, fully autonomous vehicle fleet and the supporting ecosystem required to bring this technology to market. Sitting at the intersection of robotics, machine learning, and design, Zoox aims to provide the next generation of mobility-as-a-service in urban environments. We’re looking for top talent that shares our passion and wants to be part of a fast-moving and highly execution-oriented team.


A Final Note:
You do not need to match every listed expectation to apply for this position. Here at Zoox, we know that diverse perspectives foster the innovation we need to be successful, and we are committed to building a team that encompasses a variety of backgrounds, experiences, and skills.