Telecommunication Allowance, Meal Allowance, Transportation Allowance, Housing Allowance, Medical Reimbursement
Description
Location
Japan | Full-time
Department
Operations Department
Job Responsibilities
System Stability and Performance Optimization
Responsible for the deployment, monitoring, fault handling, and capacity planning of the company's platform systems (website, APP, API, database, middleware, etc.), ensuring efficient and stable operation.
Establishment of Automated Operations System
Build and maintain CI/CD processes, automated deployment, container orchestration (Kubernetes), configuration management (Ansible/Terraform), etc., to achieve efficient operational automation.
High Availability Architecture and Disaster Recovery Design
Design and deploy high availability system architecture across availability zones and regions, establish comprehensive disaster recovery and backup mechanisms, ensuring 24/7 service continuity.
Operational Security and Compliance Development
Assist the security team in implementing access control, data encryption, firewall strategies, DDoS protection, audit log management, etc., to build a comprehensive security system.
Monitoring and Emergency Response
Utilize tools such as Prometheus, Grafana, Zabbix, ELK, etc., to establish monitoring and alerting systems, quickly respond to and resolve abnormal events.
Collaboration with R&D and Support for Rapid Delivery
Collaborate with the development team to support testing, gradual release, and traffic switching, ensuring system stability and iteration efficiency under agile development.
Cloud Platform Resource Management
Manage resources on cloud platforms such as AWS, Alibaba Cloud, GCP, etc., for cost control and elastic architecture management, enabling multi-cloud or hybrid cloud deployment.
Operational Documentation and Process Standardization
Write and maintain operational manuals, emergency plans, log records, and other documents to enhance team collaboration efficiency and standardization.