Senior Manager – DevOps/Platform Engineering
Location
China
Posted
2 days ago
Salary
0
Seniority
Senior
Job Description
Senior Manager – DevOps/Platform Engineering
Siam Makro Public Company Limited
• Lead end-to-end software delivery processes, ensuring high performance in speed, quality, and reliability of deployment cycles. • Design and implement continuous integration, continuous delivery, and automated deployment pipelines across engineering teams. • Manage infrastructure-as-code solutions and multi-cloud environments, ensuring scalability, stability, and operational efficiency. • Mentor and coach engineering teams to strengthen coding practices, DevOps maturity, and problem-solving capability. • Define and enforce engineering standards, secure coding practices, and automated quality controls. • Identify and resolve technical bottlenecks in development, deployment, and system performance. • Drive automation initiatives to improve development efficiency, reduce manual effort, and optimize workflows. • Collaborate with product, engineering, and infrastructure teams to deliver integrated and consistent platform solutions.
Job Requirements
- Bachelor’s or Master’s degree in Computer Science, Software Engineering, Information Technology, or related field
- Minimum 8–12 years of experience in software engineering / DevOps / platform engineering roles
- At least 3–5 years in technical leadership or senior engineering management role
- Proven experience in CI/CD implementation, cloud infrastructure, and DevOps transformation
- Experience managing multi-cloud or large-scale distributed systems preferred
- Experience working in global or China-based technology environments is an advantage
- Strong English communication skills; Mandarin is an advantage
- Leadership & Behavioral Skills
- Strong technical leadership and hands-on engineering capability
- Coaching and mentoring engineering teams
- Problem-solving and critical thinking under complex environments
- Ability to challenge existing architecture and drive innovation
- Cross-functional collaboration with product and engineering teams
- Strong ownership mindset and accountability for delivery outcomes
- Change management and engineering transformation leadership
- Technical Skills**
- Deep knowledge of cloud infrastructure (Tencent, Alibaba Cloud, AWS) and cluster management tools like Kubernetes
- Experience creating automated continuous deployment pipelines using tools such as Github actions or Gitlab CI/CD
- Extensive coding experience with Terraform, Python, bash, and shell scripts
- Hands-on experience with observability tooling — Prometheus metrics collection, Grafana dashboards, alerting rules, and log aggregation (ELK / Loki)
- Deep experience managing core infra components — API gateways, CDN configuration, load balancer tuning
- VPC and network design proficiency — subnet segmentation, security groups, private/public tier separation, VPN/peering, and egress control
- Operational Management**
- Strong understanding of distributed systems design — service discovery, fault tolerance, eventual consistency, and horizontal scalability
- Cybersecurity best practices — secrets management (Vault / KMS), SAST/DAST pipeline integration, least-privilege IAM, and secure supply chain (image signing, SBOM)
- Cloud cost management expertise — resource tagging strategies, rightsizing, reserved/spot instance planning, and FinOps reporting
Benefits
- Full social insurance and housing fund
- Year-end bonus
- Phone allowance
- Transportation allowance
- Best Culture
- Clear focus.
- Diverse Workplace (Our members are from around the world!)
- Non-hierarchical and agile environment
- Growth opportunity and career path
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Maintain and support core infrastructure systems with deep knowledge of Linux (Debian/Ubuntu preferred) • Work close to the metal: BIOS, IPMI, RAID setups, and hardware-level diagnostics • Design and maintain scalable networks using VLANs, L2/L3 routing, VPNs • Automate infrastructure provisioning and operations with Ansible, Bash/Python • Set up and manage observability stacks, including Prometheus/Grafana and ELK/Graylog • Build tooling for server discovery, config auto-generation, automated OS deployments • Integrate and/or develop internal APIs for tracking compute and GPU resource allocation • Deploy and maintain virtualization and orchestration systems such as OpenStack and Proxmox VE • Support container-based workloads and isolate services efficiently
Database SRE Manager
CrowdStrikeCrowdStrike has redefined security with the world’s most advanced cloud-native platform that protects and enables the people, processes and technologies that drive modern enterprise. Tested and proven, the world's largest organizations trust CrowdStrike to stop breaches with unparalleled protection against the most sophisticated cyberattacks. The CrowdStrike culture has been built upon our Core Values since the day we began. We are Fanatical About the Customer, Relentlessly Focused on Innovation and believe that our Limitless Passion drives Unlimited Potential for every CrowdStriker. As a purpose-built remote-first company, we believe cultivating a connected culture for every employee, no matter where they are in the world, is a key ingredient in building a high-performing, diverse team. We don’t have a mission statement. We’re on a mission—to stop breaches. Ready to join a mission that matters?
• Lead and mentor a team of skilled engineers responsible for the deployment, operations, and scaling of critical data platforms including Apache Cassandra, Apache Kafka, OpenSearch, caching solutions (Memcached, Redis), relational databases (PostgreSQL, MySQL), Kubernetes, and Zookeeper. • Develop and execute long-term technical strategies to ensure the scalability, reliability, and performance of our data infrastructure. • Drive architectural decisions and innovations that align with CrowdStrike's business goals and technical roadmap. • Oversee operations in large-scale, business-critical Linux environments, balancing both cloud and bare metal infrastructures. • Collaborate with cross-functional teams to integrate data services seamlessly into CrowdStrike's broader technology ecosystem. • Implement and refine processes for continuous improvement, focusing on system reliability, performance optimization, and cost-effectiveness. • Provide technical leadership and guidance across the organization on best practices for data management and infrastructure.
Site Reliability Engineer
Your BourseTrade Execution Technology for MT4, MT5 and Crypto Brokers. Liquidity, Risk Management, Reporting Platform-as-a-Service
Role Description We are looking for a highly skilled and motivated Site Reliability Engineer to manage and scale our global infrastructure. This role involves hands-on administration of Linux servers, automation, network configuration, system hardening, and ensuring high availability and performance. You will play a key role in infrastructure planning, security, compliance, and supporting mission-critical environments. Responsibilities - Infrastructure & Network - Install, configure, and harden Ubuntu Server environments (LTS releases). - Coordinate cross-connect implementations (L2/L3) with network providers to ensure reliable connectivity and SLAs. - Design infrastructure solutions including complex network topologies. - Implement automated provisioning using Ansible and Terraform. - Weekly Maintenance & Patching - Apply OS patches and server upgrades with minimal downtime (weekend window). - Apply firmware updates and monitor global infrastructure health using Prometheus/Grafana. - Lead and execute client migrations with minimal service disruption. - Security, Backup & Support - Enforce security policies: SSH hardening, firewalls, user permissions. - Design and maintain backup strategies and disaster-recovery plans. - Provide L2/L3 support, diagnose and resolve network and server issues. - Any other duties and responsibilities relevant to the role. Qualifications - 5+ years of hands-on Linux administration (Ubuntu Server, advanced level). - Deep knowledge of BGP, networking fundamentals, and cross-connect configurations (L2/L3). - Real hands-on experience with Ansible and Terraform in production environments. - Strong scripting skills in bash and/or Python. - Solid understanding of TCP/IP, DNS, DHCP, firewalls, AppArmor/SELinux. - Experience with Docker, Prometheus/Grafana/ELK, and database administration (PostgreSQL, MySQL, ClickHouse). - Fluent in English — both written and spoken. Nice to Have - Certifications: Ubuntu Professional, RHCE, or LPIC. - Experience with cloud platforms (AWS, Azure, Google Cloud) and hybrid-cloud architectures. - Familiarity with CI/CD tools (GitHub Actions). Benefits - Competitive compensation package. - Full-time remote role. - Learning & Development support. - Paid annual leave and sick leave. - Company events and team celebrations (online and offline). - Anniversary and birthday gifts. - Clear career growth and professional development opportunities. - Supportive, inclusive, and collaborative work environment.
• Collaboration with product teams. • Participate in the launch of new projects and new features. • Participate in the design of complex information systems. • Automate infrastructure components. • Setup and maintain infrastructure. • Consult managers and company clients. • Create and maintain technical documentation.




