DevOps Engineers
Job Title: DevOps-SRE
Location: Tokyo Hybrid/Remote
Salary: 8M - 10M
Ref: #1583
Role Description
We are seeking a highly skilled and experienced individual to join our team as a DevOps + Site Reliability Engineer (SRE). This role will be responsible for building and maintaining our infrastructure, optimizing system performance, and ensuring the reliability and scalability of our applications. The ideal candidate will have a strong background in software development, operations, and automation, with a focus on implementing DevOps best practices and Site Reliability Engineering principles. Initially the primary focus will be on DevOps. As our development process progresses and matures, the responsibilities will shift toward SRE.
Responsibilities
- Implement and maintain continuous integration and continuous deployment (CI/CD) pipelines to automate software delivery processes.
- Monitor system performance, troubleshoot issues, and implement solutions to improve reliability, performance, and scalability.
- Design and implement infrastructure as code (IaC) using tools such as Terraform, Ansible, or CloudFormation to automate provisioning and configuration management.
- Manage containerized environments using Docker and orchestration tools like Kubernetes for container deployment, scaling, and management.
- Implement and maintain monitoring, logging, and alerting systems to ensure proactive detection and resolution of issues.
- Collaborate with cross-functional teams to define and implement disaster recovery and business continuity plans.
- Ensure compliance with security best practices and industry standards for data protection and system security.
- Stay informed about emerging technologies and industry trends, and evaluate and recommend tools and technologies to improve efficiency and scalability.
Qualifications
Must-Have:
- Candidates should hold a Japanese residency or work visa.
- Bilingual with both fluent Japanese & English language capability, both written and verbal.
- Bachelor's degree in Computer Science, Engineering, or a related field, or equivalent experience.
- Minimum of 5 years of experience in a DevOps, SRE, or related role.
- An expert with hands-on experience with Google Cloud Platform (GCP) & Google Kubernetes Engine (GKE).
- Experience with infrastructure automation tools such as Terraform, Ansible, or CloudFormation.
- Proficiency in containerization technologies such as Docker and container orchestration tools like Kubernetes.
- Experience with CI/CD pipelines and tools such as Jenkins, GitHub Action, ArgoCD.
- Strong knowledge of Linux/Unix systems and shell scripting.
- Experience with monitoring, logging, and alerting tools such as Prometheus, Grafana, ELK Stack, or Splunk.
- Excellent problem-solving skills and ability to troubleshoot complex issues in a distributed, high-availability environment.
- Strong communication and collaboration skills, with the ability to work effectively in a fast-paced, dynamic environment.
Good-to-Have:
- Certification in GCP, Kubernetes or relevant technologies.
- Knowledge of database technologies such as Google Spanner, MySQL, PostgreSQL.
- Experience with infrastructure security and best practices tools
#LI-NK1