Role Overview

We are looking for a highly skilled Senior DevOps Engineer to lead and scale our infrastructure and DevOps practices. This role is a blend of hands-on technical contribution (IC) and strategic leadership, requiring deep expertise in AWS cloud infrastructure, automation, and system reliability.
The ideal candidate will have experience in high-performance, real-time environments, preferably within the gaming industry, and will play a critical role in building and maintaining systems that power engaging, large-scale gaming experiences.

Key Responsibilities

  • Lead the design and implementation of scalable, secure, and highly available infrastructure on AWS.
  • Architect and manage CI/CD pipelines to enable rapid, reliable releases.
  • Act as a hands-on contributor, actively writing infrastructure-as-code, automating workflows, and troubleshooting production issues.
  • Provide technical leadership and mentorship to DevOps and engineering teams, setting best practices and standards.
  • Collaborate closely with game engineers, backend teams, and product stakeholders to support real-time, low-latency systems.
  • Drive observability, monitoring, and incident response strategies to ensure high uptime and performance.
  • Optimize infrastructure for performance, scalability, and cost efficiency.
  • Establish and enforce security best practices, compliance, and governance across cloud environments.
  • Lead initiatives around containerization (Docker, Kubernetes) and microservices architecture.

Required Skills & Qualifications

  • 6+ years of experience in DevOps, SRE, or Infrastructure Engineering roles.
  • Strong expertise in AWS services (EC2, EKS, S3, Lambda, RDS, CloudFront, IAM, VPC, etc.).
  • Proven experience with Infrastructure as Code tools (Terraform, CloudFormation).
  • Hands-on experience with CI/CD tools (Jenkins, GitHub Actions, GitLab CI, etc.).
  • Strong understanding of containerization and orchestration (Docker, Kubernetes).
  • Experience with monitoring & observability tools (Prometheus, Grafana, ELK, Datadog).
  • Solid scripting/programming skills (Python, Bash, or Go).
  • Experience designing and operating highly available, distributed systems.

Preferred Qualifications

  • Prior experience in the gaming industry or real-time, high-traffic consumer applications.
  • Experience supporting multiplayer or live-service architectures.
  • Familiarity with scaling systems for millions of users.
  • Exposure to edge computing/CDN optimization (CloudFront, Fastly, etc.).
  • Strong understanding of low-latency system design.

Leadership Expectations

  • Act as a technical leader and decision-maker for DevOps strategy.
  • Balance hands-on IC work (~70-80%) with team leadership and mentorship (~30-20%).
  • Influence cross-functional teams and drive engineering excellence.
  • Lead by example in ownership, reliability, and operational excellence.