Job Closed

This listing is no longer active.

Binance

The World’s Leading Blockchain Ecosystem and Digital Asset Exchange

Frontend DevOps Engineer, EKS, K8s, Python, Terraform

DevOps EngineerDevOps EngineerFull Time Remote SeniorTeam 1,001-5,000Since 2017H1B No SponsorCompany Site LinkedIn

Location

Taiwan

Posted

124 days ago

Salary

Seniority

Senior

Bachelor DegreeEnglishChineseAWS Cloud Docker EC2 Google Cloud Platform Kubernetes Linux Python Terraform Go

Job Description

• Manage production incidents and conduct post-mortems to enhance system stability and reliability. • Partner with development teams to ensure seamless application and infrastructure deployments. • Maintain and optimize cloud infrastructure (AWS, AliCloud, TencentCloud) for performance, cost-efficiency, and high availability. • Administer and scale Kubernetes clusters (primarily EKS) to support scalable web service deployments. • Design, build, and maintain CI/CD pipelines using GitHub Actions, ArgoCD, and AI/LLM-powered automation. • Automate infrastructure provisioning and configuration management using Terraform and Python.

Job Requirements

Hands-on experience with **AWS services (EC2, ELB, EKS, ECS, VPC, IAM, S3, CloudFront),** AWS SDK and other cloud services (Cloudflare/GCP/AliCloud/TencentCloud).
Strong working knowledge of **Kubernetes**, particularly managed services such as **EKS**.
Proficiency in scripting languages including **Bash, Python, or Go.**
Experience with monitoring, logging, and system performance optimization.
Familiarity with DevOps toolchains including **Terraform, Docker, and Linux **administration.
Solid understanding of **networking protocols** and security best practices.
Strong analytical and problem-solving skills.
Fluent in English and Mandarin to coordinate effectively with global stakeholders and cross-functional teams.
Preferred**
Hands-on experience leveraging AI models or tools to address operational challenges and streamline deployment workflows.
Enthusiasm for exploring emerging technologies and integrating them into daily operations to drive innovation and continuous improvement.

Benefits

Competitive salary and company benefits
Work-from-home arrangement (the arrangement may vary depending on the work nature of the business team)

Related Categories

DevOps Engineer

Related Job Pages

Remote Full-time Jobs (US)Remote Python Jobs (US)More Remote Jobs

More DevOps Engineer Jobs

DevOps Engineer

Cosmote Global Solutions

DevOps Engineer124 days ago

Contract RemoteTeam 11-50H1B No Sponsor

Company Site LinkedIn

COSMOTE Global Solutions NV is seeking a skilled DevOps Engineer to join our growing team. You will be responsible for developing and maintaining our infrastructure as code, automating deployment processes, and ensuring system reliability and scalability across our cloud environments. Key Responsibilities: - Understand current policies on three Entra ID tenants - Develop terraform modules and Azure DevOps pipelines to ensure secure management of policies - Prepare Conditional Access Policy operations transition from current team to Cyber Security team - Maintenance of the Conditional Access Policies (troubleshooting, new policies implementation, improve existing policies)

View details: DevOps Engineer

Luxembourg

Apply

Job Closed

Site Reliability Engineer

Tyk Technologies

Tyk Technologies, or simply Tyk, is a computer software company that works as an open-source API gateway and management platform. Tyk is a high-performance SaaS

DevOps Engineer124 days ago

Full Time Remote

Company Site

• Maintaining global Tyk Cloud within SL(A/I/O)s you will help to define • Identifying reliability issues and working together with your squad to solve them • Identifying and introducing new metrics and building relevant dashboards • Participating in the on-call rotation • Working with your squad to expand multi-region and multi-cloud reach of the platform • Documenting operational knowledge • Conducting post-incident analysis • Automating common tasks • Be a key shaper and contributor to our continuous improvement agenda – be it the clarity of our user stories, how we estimate, communicate with other teams or customers – we expect this role to be advocate of continuous improvement • Reliability of our new global Tyk Cloud platform • Automation of operations and support • Writing and maintaining documentation on SRE processes and policies • Recommending and implementing ways of driving operational efficiency and driving down our cost to run, without impacting service • Assisting in penetration testing for Cloud through liaising with our provider, providing technical details, and environment setup • Incident management

AWS Cloud Grafana Kubernetes Linux MongoDB Prometheus Redis

View details: Site Reliability Engineer

Canada

Apply

Senior DevOps Engineer, Web3

Launch Legends

Launch Legends is a worldwide team of renegade artists, scientists, & engineers who bring back insights from the future.

DevOps Engineer124 days ago

Part Time RemoteTeam 51-200Since 2021H1B No Sponsor

Company Site LinkedIn

• Design, implement, and manage the infrastructure for our blockchain ecosystem • Take ownership of designing and managing the company's cloud infrastructure on AWS, GCP, and Cloudflare • Architect and optimize cloud solutions for scalability and security • Optimize deployment, monitoring, and scaling of blockchain nodes • Automate the deployment process using CI/CD pipelines • Ensure high availability, reliability, and security of the infrastructure • Collaborate with the development team to integrate DevOps practices • Maintain and manage code repositories and workflows on GitHub • Troubleshoot and resolve infrastructure-related issues • Lead and mentor a team of DevOps professionals

Ansible AWS Cloud Docker Google Cloud Platform Jenkins Kubernetes Oracle Python Terraform Web3

View details: Senior DevOps Engineer, Web3

United States

Apply

Senior Site Reliability Engineer – GCP

Devsu

Devsu is a technology agency that provides software development services, IT augmentation and staffing.

DevOps Engineer124 days ago

Full Time RemoteTeam 51-200H1B No Sponsor

Company Site LinkedIn

We are seeking a Site Reliability Engineer (SRE) with deep expertise in monitoring, observability, and reliability engineering to support systems running across on-premises infrastructure and Google Cloud Platform (GCP). This role is primarily responsible for designing, operating, and improving monitoring, alerting, and observability platforms, with a strong focus on Grafana and Kubernetes environments. As a secondary responsibility, this role provides backup coverage for the Application Support team during periods of resource constraints or major incidents, offering L2/L3 technical support when required. ResponsibilitiesMonitoring & Observability (Core Focus) - Own and operate the monitoring and observability stack across on-prem and GCP environments - Design, build, and maintain Grafana dashboards for infrastructure, Kubernetes, and applications - Define, tune, and maintain alerts to ensure high signal-to-noise ratio - Establish observability standards and best practices across teams - Improve visibility into system health, performance, and reliability Site Reliability Engineering - Apply SRE principles to improve availability, performance, and resilience - Define and track SLIs, SLOs, and error budgets - Participate in on-call rotations and SEV incident response - Lead or contribute to incident investigations and root cause analysis (RCA) - Drive preventative actions to reduce repeat incidents Kubernetes & Platform Reliability - Support and monitor Kubernetes environments (GKE and on-prem clusters) - Monitor cluster health, capacity, and resource utilization - Troubleshoot platform-level issues impacting application reliability - Collaborate with Platform and Engineering teams on reliability improvements Secondary Responsibilities (Backup Application Support) - These responsibilities are activated as needed, not part of day-to-day operations. - Provide L2/L3 application support coverage during: - Support team resource shortages - High-severity incidents (SEVs) - Peak support periods or escalations - Triage and troubleshoot application issues using existing runbooks and dashboards - Collaborate with Application Support and Engineering teams during incidents - Ensure all actions, findings, and resolutions are documented in ServiceNow (SNOW)

Cloud Google Cloud Platform Grafana Kubernetes Linux Prometheus Python ServiceNow

View details: Senior Site Reliability Engineer – GCP

Brazil

Apply

Frontend DevOps Engineer, EKS, K8s, Python, Terraform

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More DevOps Engineer Jobs

DevOps Engineer

Site Reliability Engineer

Senior DevOps Engineer, Web3

Senior Site Reliability Engineer – GCP