PostJobFreePosted Jun 1, 2026First seen Jun 1, 2026
Our client is a global technology company operating in a large-scale, high-traffic online services environment, focused on delivering reliable and innovative customer-facing platforms.
We are seeking an experienced Senior DevOps Engineer to support and scale a large-scale mobile media platform that delivers personalized advertising and user engagement experiences integrated with a point/reward ecosystem.
In this role, you will lead infrastructure architecture initiatives, improve platform reliability and observability, optimize operational costs, and support high-traffic distributed systems running in cloud-native environments. You will collaborate with cross-functional and international teams to build scalable and resilient platform solutions.
Responsibilities
Design, build, and operate scalable cloud infrastructure using GCP and Kubernetes (GKE/EKS)
Improve platform reliability, availability, observability, and operational efficiency
Develop and maintain CI/CD pipelines and reusable DevOps tooling
Support distributed architectures including microservices, event-driven systems, caching, and messaging platforms
Lead infrastructure modernization, lifecycle management, and upgrade planning with minimal service impact
Drive incident response, root cause analysis, and production stability improvements
Monitor infrastructure utilization and implement cost optimization strategies
Collaborate closely with software engineers, product teams, and overseas stakeholders
Strengthen infrastructure security posture and vulnerability remediation processes
Support data pipeline operations using Airflow and BigQuery environments where applicable
Requirements:
- 7+ years in DevOps/SRE/Platform Engineering.
- 3+ years of deep expertise operating cloud platform (GCP) and Kubernetes (GKE/EKS) at scale.
- Deep knowledge in supporting server-side architectures (monoliths, microservices, event-driven and other distributed systems).
- Mastery of scaling patterns for large-scale, high-traffic services, including scaling across layers (web, database, logging).
- Deep middleware knowledge: distributed caching, message queues, RPC frameworks; plus networking/IAM fundamentals.
- Commitment to production service support: ensure service availability and drive recovery from system troubles as the highest priority.
- Plan and monitor infrastructure resource usage to ensure cost-effectiveness; collaborate with the team to reduce unnecessary costs where possible.
- Passionate technologist with ownership mindset; cost optimization and observability experience preferred.
Preferred Qualifications:
- Design and develop reusable DevOps libraries and CI/CD pipelines that can be easily applied across projects.
- Sharp problem-solving and business-to-tech translation; strong cross-functional leadership.
- Monitoring and alert design that considers business KPIs considered to the characteristics of the service.
- Knowledge of Airflow and DAG in a GCP BigQuery environment, and experience building data pipelines.
Languages
English: Fluent
Japanese: Optional / a plus
Work Environment
Fast-paced, dynamic global environment with collaborative teams across multiple locations
Salary: 9M 12M JPY per year
Location: Hybrid (4 days in the office, 1 day remote)
Office Location: Tokyo, Japan
Working Hours: Flexible schedule with core hours from 11:00 AM to 3:00 PM
Visa Sponsorship: Available
Japanese language certification (such as JLPT N2) is not required for this role, as the company is a large global organization with an international working environment.
Language Requirement: English only
Apply now or contact us for further information: