Senior DevOps Engineer - Mobile Media & Data Platform

TG Japan Inc. · Chiyoda-ku, Tokyo Prefecture, JP

PostJobFree Posted Jun 1, 2026 First seen Jun 1, 2026
Our client is a global technology company operating in a large-scale, high-traffic online services environment, focused on delivering reliable and innovative customer-facing platforms. We are seeking an experienced Senior DevOps Engineer to support and scale a large-scale mobile media platform that delivers personalized advertising and user engagement experiences integrated with a point/reward ecosystem. In this role, you will lead infrastructure architecture initiatives, improve platform reliability and observability, optimize operational costs, and support high-traffic distributed systems running in cloud-native environments. You will collaborate with cross-functional and international teams to build scalable and resilient platform solutions. Responsibilities Design, build, and operate scalable cloud infrastructure using GCP and Kubernetes (GKE/EKS) Improve platform reliability, availability, observability, and operational efficiency Develop and maintain CI/CD pipelines and reusable DevOps tooling Support distributed architectures including microservices, event-driven systems, caching, and messaging platforms Lead infrastructure modernization, lifecycle management, and upgrade planning with minimal service impact Drive incident response, root cause analysis, and production stability improvements Monitor infrastructure utilization and implement cost optimization strategies Collaborate closely with software engineers, product teams, and overseas stakeholders Strengthen infrastructure security posture and vulnerability remediation processes Support data pipeline operations using Airflow and BigQuery environments where applicable Requirements: - 7+ years in DevOps/SRE/Platform Engineering. - 3+ years of deep expertise operating cloud platform (GCP) and Kubernetes (GKE/EKS) at scale. - Deep knowledge in supporting server-side architectures (monoliths, microservices, event-driven and other distributed systems). - Mastery of scaling patterns for large-scale, high-traffic services, including scaling across layers (web, database, logging). - Deep middleware knowledge: distributed caching, message queues, RPC frameworks; plus networking/IAM fundamentals. - Commitment to production service support: ensure service availability and drive recovery from system troubles as the highest priority. - Plan and monitor infrastructure resource usage to ensure cost-effectiveness; collaborate with the team to reduce unnecessary costs where possible. - Passionate technologist with ownership mindset; cost optimization and observability experience preferred. Preferred Qualifications: - Design and develop reusable DevOps libraries and CI/CD pipelines that can be easily applied across projects. - Sharp problem-solving and business-to-tech translation; strong cross-functional leadership. - Monitoring and alert design that considers business KPIs considered to the characteristics of the service. - Knowledge of Airflow and DAG in a GCP BigQuery environment, and experience building data pipelines. Languages English: Fluent Japanese: Optional / a plus Work Environment Fast-paced, dynamic global environment with collaborative teams across multiple locations Salary: 9M 12M JPY per year Location: Hybrid (4 days in the office, 1 day remote) Office Location: Tokyo, Japan Working Hours: Flexible schedule with core hours from 11:00 AM to 3:00 PM Visa Sponsorship: Available Japanese language certification (such as JLPT N2) is not required for this role, as the company is a large global organization with an international working environment. Language Requirement: English only Apply now or contact us for further information: