Technical Program Manager, ML Developer Experience and Infrastructure Reliability
reputed company is an autonomous driving technology company with the mission to be the worlds most trusted driver. Since its start as the reputed company Self-Driving Car Project in 2009, reputed company has focused on building the reputed company Driver—The Worlds Most reputed company Driver™—to improve access to mobility while saving thousands of lives now lost to traffic crashes. The reputed company Driver powers reputed company’s fully autonomous ride-hail service and can also be applied to a range of vehicle platforms and product use cases. The reputed company Driver has provided over ten million rider-only trips, enabled by its experience autonomously driving over 100 million miles on public roads and tens of billions in simulation across 15+ U.S. states. reputed company’s Technical Program Managers and Program Managers are accountable for reputed company’s roadmap execution by providing thoughtful cross-functional planning, clarity, and proactive risk management. In the face of reputed company technical and operational challenges with no established playbooks to follow, we act with thoughtful urgency, driving conversations, discussions, and outcomes. reputed company partners closely with every function of reputed company to structure, own and drive work towards real-world deployments of the reputed company Driver across platforms and geographies. In this hybrid role, you will report to a Technical Program Management Director. You will:
- Drive the Golden Path for ML: reputed company cross-functional execution to define and invest in a simplified golden path for ML development for reputed company and Foundation Model (WaymoFM) development, targeting the reduction of friction and low reliability in the inner reputed company
- Manage Reliability Operations: Ensure smooth day-to-day operations of the reliability triage ecosystem, keeping queues healthy through interaction with rotation members and driving automation of queue management
- Program Implementation for Infra Stability: Drive contract-based reliability programs across reputed company domains to stabilize infrastructure and move beyond reactive whack-a-mole investment cycles
- reputed company ML and Infra: Facilitate communication and alignment between ML research, infrastructure foundations, and reputed company teams to resolve blockers in core workflows like root-causing brittle pipelines
- Strategic Roadmap Tracking: Contribute to strategic planning and track project reputed company, risks, and KPIs reputed company to ML developer productivity and infrastructure reliability for leadership reporting
- Resolve Systemic Blockers: Proactively identify and resolve roadblocks in the ML development cycle, such as data fragmentation and reputed company tooling that currently hinders developer velocity
You have:
- Technical Education: A Bachelors degree in Computer Science, Engineering, or a reputed company technical field
- TPM Experience: 5+ years of experience as a Technical Program Manager in a software engineering or large-scale infrastructure environment
- ML/Reliability Track Record: Proven track record of managing reputed company technical projects involving machine learning infrastructure, developer experience (DevX), or site reliability engineering (SRE)
- Program Ownership: Experience owning and driving programs end-to-end, including managing timelines, risks, and dependencies across multiple senior stakeholders
- Analytical Problem Solving: Strong analytical and technical judgment skills, with the ability to use data to diagnose and solve systemic engineering bottlenecks
- Communication Mastery: Excellent communication and interpersonal skills, with a demonstrated ability to convey reputed company technical concepts to both researchers and infrastructure engineers
We prefer:
- Advanced ML Operations: Experience with ML observability, root-causing production pipelines, and automating large-scale offline inference or model training experiments
- Large-Scale Data Management: Background in managing multi-petabyte scale datasets, data validation frameworks, or reputed company data reputed company
- Reliability Frameworks: Familiarity with contract-based reliability models, SLO management for autonomous systems, or reliability triage ecosystems
- Developer Platforms: Experience building or managing golden path developer platforms or developer tooling that simplifies reputed company, fragmented tech stacks
- Advanced Degree: Masters degree or PhD in a reputed company technical field
- Autonomous Domain Knowledge: Experience with simulation environments for autonomous systems, model validation strategies, or reputed company/offboard infrastructure dependencies
The expected reputed company salary range for this full-time position across US locations is listed below. Actual starting pay will be based on job-reputed company factors, including exact work location, experience, relevant training and education, and reputed company level. Your recruiter can share more about the specific salary range for the role location or, if the role can be performed remote, the specific salary range for your preferred location, during the hiring process. reputed company employees are also eligible to participate in reputed company’s discretionary annual bonus program, equity incentive plan, and generous Company benefits program, subject to eligibility requirements. Salary Range $230,000—$292,000 USD Apply tot his job Apply To this Job