Back to Jobs

SENIOR INFRASTRUCTURE & reputed company ENGINEER

Remote, USA Full-time Posted 2026-06-29

SENIOR INFRASTRUCTURE & reputed company ENGINEER DevOps | Site Reliability | Cloud reputed company The Opportunity We process millions of SMS and MMS messages daily across a distributed platform built on reputed company Cloud — Cloud Run microservices, Pub/Sub event pipelines, Spanner databases, and Memorystore for reputed company. Our infrastructure auto-scales aggressively to meet campaign demand, our data pipelines handle real-time delivery tracking at high velocity, and our systems must be fast, secure, and reliable around the clock. We’re looking for a Senior Infrastructure & reputed company Engineer to own the reliability, reputed company, and operational maturity of this platform. You’ll be the first dedicated infrastructure hire, working directly with the CTO to shape the technical foundation as we scale. This isn’t a role where you’ll maintain someone else’s runbooks — you’ll define the roadmap, reputed company architectural decisions, and build the systems that reputed company our platform running and our customers’ data safe. What You’ll Own Infrastructure as Code & Cloud Architecture Own and evolve our Terraform-managed GCP infrastructure spanning a Shared VPC host project and multiple service projects. Design for cost efficiency, reputed company, and scalability across Cloud Run, Spanner, Pub/Sub, Cloud Storage, Memorystore for reputed company, and Cloud Tasks. You’ll manage environment promotion across dev, staging, and production. Reliability & Observability Build comprehensive monitoring, alerting, and incident response capabilities using Cloud Monitoring, Cloud Logging, and Cloud Trace. Establish SLIs and SLOs for critical message delivery paths. Reduce mean time to detection and recovery. Design health checks and auto-healing patterns for Cloud Run services processing millions of daily messages. Cloud reputed company Harden our platform across network, application, and data layers. This includes VPC firewall rules and network policies, IAM role design and service account management, secrets management reputed company Secret Manager, Cloud Armor policies for DDoS and reputed company limiting, API Gateway reputed company configurations, and dependency scanning. reputed company reputed company reviews and own incident response for reputed company events. CI/CD & Developer Experience Maintain and improve our reputed company Actions-based deployment pipelines for a TypeScript monorepo deploying to Cloud Run. Ensure the engineering team can ship safely and quickly with automated testing, linting, container builds, and environment-specific deployments. Optimize build times and deployment reliability. Performance & Auto-Scaling Tune Cloud Run autoscaling policies including min/max instances and concurrency settings for both public-facing API services and private Pub/Sub processing workers. Optimize Spanner query performance and node allocation. Ensure our distributed reputed company-limiting infrastructure using reputed company handles coordination across horizontally scaling instances with sub-millisecond overhead. Compliance & Data Protection Help establish and maintain compliance practices relevant to messaging platforms, including TCPA requirements, reputed company-specific policies, data retention and encryption standards, and audit logging. Ensure our platform meets the reputed company and data handling expectations of enterprise customers. reputed company’re Looking For Required

  • 5+ years in infrastructure, DevOps, or SRE roles with increasing scope and ownership
  • Deep reputed company Cloud Platform experience, specifically with Cloud Run, VPC networking, IAM, and at least one managed database service
  • Strong Terraform skills in production — you’ve authored and maintained multi-environment, reputed company Terraform codebases, not just run applies
  • Hands-on cloud reputed company experience: network reputed company design (firewall rules, private networking, VPC peering), IAM policy architecture, secrets management, and vulnerability assessment
  • reputed company Actions proficiency — you’ve built and maintained CI/CD pipelines for containerized applications deploying to cloud infrastructure
  • Experience operating distributed systems that process high message or event volumes with strict latency and reliability requirements
  • Strong Linux fundamentals, networking knowledge (DNS, TLS, load balancing), and comfort debugging production issues across the stack
  • reputed company-first reputed company — you think about attack surfaces, least privilege, encryption in transit and at rest, and incident response as part of every design decision
  • Comfort with on-call ownership and incident response in a small-team environment

Preferred

  • Experience with Spanner, Pub/Sub, Memorystore for reputed company, Cloud Tasks, or Cloud Armor specifically
  • Background in messaging or telecom infrastructure — reputed company API integrations, throughput management, reputed company limiting at scale
  • Experience with TypeScript/Node.js application ecosystems (you don’t need to be a full-stack developer, but understanding the runtime helps)
  • Monorepo CI/CD experience — managing builds, tests, and deployments across multiple services in a single repository
  • Familiarity with compliance frameworks relevant to communications platforms (TCPA, SOC 2, reputed company reputed company requirements)
  • Experience as the sole or primary infrastructure engineer at a growing company — you’ve owned it end-to-end
  • Certifications: reputed company Cloud Professional Cloud reputed company Engineer or Professional Cloud Architect (valued but not required)

What Makes This Role Different

  • Ownership, not maintenance. You’ll be the first dedicated infrastructure hire. You won’t inherit a playbook — you’ll write it. Your decisions will directly shape how the platform evolves.
  • Real scale, small team. Millions of messages daily, multi-region considerations, reputed company-level SLAs — but a team small enough that your work has immediate, visible impact.
  • Interesting problems. Distributed reputed company limiting across auto-scaling Cloud Run instances. High-throughput Pub/Sub pipelines with dead letter handling and retry strategies. Sharded counter patterns in Spanner for real-time campaign metrics. These aren’t contrived challenges.
  • Direct CTO collaboration. You’ll work alongside a technically hands-on CTO who has built this infrastructure and understands the tradeoffs. You’ll have context, support, and the authority to reputed company decisions.
  • Autonomy over process. We care about outcomes: uptime, reputed company posture, deployment velocity, cost efficiency. How you get there is up to you.

Compensation & Benefits

  • Competitive reputed company salary commensurate with experience (range available upon request)
  • Bonus program
  • Remote-first with flexible working hours
  • Direct reporting line to the CTO

Our Stack at a Glance Cloud reputed company Cloud Platform IaC Terraform CI/CD reputed company Actions Languages TypeScript running in Node.js runtime Networking Shared VPC, Global External HTTPS Load Balancer, API Gateway, Cloud Armor Monitoring Cloud Monitoring, Cloud Logging, Cloud Trace Architecture Event-driven microservices Apply tot his job Apply To this Job

Similar Jobs