Empowering lean security operations teams of any skill to successfully secure their environments. WE ARE HIRING!
Staff SRE Engineer
Location
Taiwan
Posted
18 days ago
Salary
0
Seniority
Lead
Job Description
Staff SRE Engineer
Stellar Cyber
• Administer and maintain container orchestration platforms and containerized workloads. • Monitor and troubleshoot production systems, participating in on-call rotations to ensure reliability. • Drive observability improvements by enhancing monitoring, logging, and alerting capabilities across systems and data platforms. • Administer and optimize cloud-based environments across multiple providers. • Manage and support distributed data platforms and real-time processing systems. • Develop and maintain continuous integration and delivery pipelines for efficient and reliable deployments. • Own and implement Infrastructure as Code (IaC) practices to ensure consistency and scalability. • Automate and orchestrate infrastructure using programming and scripting languages. • Perform system administration and networking tasks to support internal and external environments. • Collaborate effectively with engineers and stakeholders across different time zones.
Job Requirements
- 5+ years of experience in Site Reliability Engineering, DevOps, or Platform Engineering roles.
- Proven success leading large-scale production systems in cloud environments (AWS, GCP, Azure, or OCI).
- Demonstrated leadership in driving incident response, on-call best practices, and reliability-focused culture.
- Strong experience with production on-call operations and incident management.
- Advanced proficiency in Kubernetes administration and troubleshooting.
- Hands-on experience with observability tools: Prometheus, Grafana, Loki, and Alertmanager.
- Knowledge in chat-based operations interfaces and/or auto-remediation controllers using AI agentic framework.
- Understanding of AI agents for Auto-triaging alerts, correlate signals and suggest/root-cause hypotheses
- Expertise in operating data platforms (Elasticsearch, MongoDB, Spark, Kafka, Redis).
- Proficiency with public cloud services (AWS, Azure, GCP, or OCI).
- Strong programming and automation skills in Python and Bash.
- Deep understanding of Infrastructure as Code (Terraform, Helm).
- Experience with CI/CD pipelines (GitHub Actions, Bitbucket, ArgoCD).
- Strong technical background in distributed systems, databases, networking, and Linux administration.
- Excellent problem-solving, communication, and leadership abilities.
- Bachelor's degree in Computer Science, Engineering, or a related technical field.
- Certifications in AWS, GCP, Observability, Linux or Kubernetes are a plus.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Design, implement, and maintain robust and scalable automation solutions for physical and virtual server infrastructure provisioning using tools like Ansible, Python, or similar • Develop and maintain automation scripts and workflows leveraging various APIs (e.g., cloud provider APIs, virtualization platform APIs, hardware management APIs, internally developed APIs) • Architect, implement, and manage our Kubernetes infrastructure for deploying, scaling, and maintaining containerized applications • Develop and maintain CI/CD pipelines to automate the build, test, and deployment of applications to Kubernetes • Implement and maintain infrastructure monitoring, alerting, and logging solutions to ensure system health and performance • Collaborate closely with development teams to integrate automation and DevOps practices into their workflows • Drive the adoption of DevOps best practices, including infrastructure as code, continuous integration, continuous delivery, and automated testing • Troubleshoot and resolve infrastructure and deployment-related issues in a timely and efficient manner • Contribute to the development and maintenance of internal tooling and documentation • Mentor and guide junior members of the team • Stay up-to-date with the latest trends and technologies in DevOps, automation, and cloud computing • Participate in on-call rotations as needed to ensure the availability of critical systems.
Senior Engineering Manager – DevOps, Infrastructure, Release Engineering
Abacus InsightsAbascus Insights is a technology company working to improve people's lives by "harnessing the healthcare data explosion" through intelligent integration software. Founded in 2017 b
• Lead, coach, and grow DevOps, Infrastructure, and Release Engineering teams across geographies • Establish clear ownership and accountability across CI/CD, environments, and production readiness • Conduct regular 1:1s, performance reviews, goal setting, and career development planning • Hire, onboard, and retain senior DevOps and release engineering talent • Foster a culture of operational ownership, blameless postmortems, and high availability • Own the release engineering strategy for a SaaS platform supporting frequent, reliable deployments • Design and continuously improve CI/CD pipelines that support: • Trunk-based or hybrid branching strategies • Automated testing gates (unit, integration, security, performance) • Progressive delivery (feature flags, canaries, blue-green deployments) • Establish deployment standards, change classification, and risk levels across teams • Reduce release toil, manual steps, and human error through automation • Drive improvements in deployment frequency, change failure rate, and mean time to recovery (DORA metrics) • Ensure consistent release processes across services, platforms, and cloud environment • Provide technical leadership across AWS, Azure, and GCP environments • Ensure infrastructure and delivery pipelines follow security, compliance, and cloud best practices • Partner with Architecture and Security to enforce: • Infrastructure as Code (Terraform, CloudFormation, ARM/Bicep) • Secure-by-default deployments • Policy-as-code and guardrails • Influence architecture decisions to improve operability, scalability, and release safety • Own production readiness standards and release go/no-go criteria • Ensure clear incident response processes, escalation paths, and on-call readiness • Partner deeply with Product, QE, Security, and Engineering leaders to align on release expectations
Staff Engineer: DevOps
DispelMoving Target Defense-based remote access systems for people and machines.
Role Description Dispel is redefining how the world’s most critical industries connect, protect, and operate. Built for both Operational Technology (OT) and security teams, our Zero Trust Engine delivers secure, scalable connectivity across every make, model, and generation of equipment—enabling fast, reliable remote access, industrial data streaming, and integrated threat monitoring in even the most complex environments. We don’t just keep operations safe—we make them better. With OTFusion, Dispel unifies applications and systems across sites, streamlining operations, cutting complexity, and driving measurable efficiency gains. Since 2015, we’ve been pioneering cybersecurity innovation: - Inventing network-level Moving Target Defense (MTD) - Securing 54 million utility users worldwide - Protecting over $500B in manufactured goods annually - Ensuring the everyday essentials people rely on—from 50% of the U.S. baby formula supply to 1 in 5 non-alcoholic beverages in America—are made and delivered safely. If you're passionate about providing security, for all, this is the place to be. Qualifications - 7+ years (IC5) of experience shipping production ready software. - Self reliant in DevOps, Infrastructure-as-code, orchestration, automation, and redundancy methodologies and techniques. - Familiarity with DevSecOps principles and integrating security into CI/CD pipelines is a plus. - Professional experience with containers, AWS, AWS Well Architected Framework, and Terraform. - Experience with delivering web applications to production in a reliable manner. - Experience guiding teams through the software development lifecycle from local development to production. - Experience crafting logging, metrics, and alerting systems that deliver value to teams. - An ability to communicate complex technical topics clearly and succinctly both in-person, through written documentation and over team chat. - A willingness to accept failure and feedback, learn and try again. - A passion for learning new disciplines and gaining a deep understanding of how others on the team do their work. - Bonus: SysAdmin experience. We want to automate anything that currently requires a SysAdmin. - Bonus: Certifications to prove knowledge and experience. Benefits - At Dispel you’ll be rewarded and recognized for your performance in an environment that will challenge you and give you clear direction on what it takes to succeed in your role as well as provide development for other roles you may be interested in. - Salary range for role: $138,000-154,000 USD annually - Comprehensive health, dental, and vision insurance - 401(k) with company match - Opportunity for incentive units grant and performance bonus - Generous paid time off and holidays - Flexible work environment with opportunities for remote work Our compensation reflects the cost of labor across several U.S. geographic markets, and we pay differently based on those defined markets. Your exact offer may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, and experience. This is a career growth opportunity and an FLSA-exempt role. The position will require working more than 40 hours per week at times to meet business needs. Beware of Hiring Scams: Dispel will never ask for payment or sensitive personal information such as social security numbers during the hiring process. All official communication will come from a verified company email address. If you receive suspicious requests or communications, please report them to people@dispel.com. All of our legitimate openings can be found on the Dispel Career Site at https://apply.workable.com/dispel/
• Provide software development, continuous integration, software delivery, systems administration, software quality, and systems documentation support to our federal government client’s digital assets • Meet with the Application Development Team and content contributors to develop and discuss strategies and plans for our government client’s web products, to analyze web content needs, and to propose ways the client's web products can address those needs • Assist with configuration, usage, and monitoring of cloud-based infrastructure services, including but not limited to Cloudbees, Ansible, Terraform, AWS EKS, EC2, RDS, Lambda, Elastic Container Service, and CloudFormation • Assist in building a strong technical foundation in build, release, and production using continuous integration tools such as Jenkins and infrastructure provisioning tools such as Ansible • Engage with various client personnel to understand requirements to develop better software for the Bureau and identify new ways in which the development team can easily solve client issues • Collaborate with the client on the design, development, and data teams to build valuable tools that benefit the client's day-to-day operations and broader missions • Provide training and expertise on a variety of DevOps methodologies, technologies, best practices, and tools, along with insight into new technologies and solutions that could help the Application Team and the client at large • Assist in the development of Use Cases, Requirements Definition Documents, User and Administration Manuals, Detailed Design Specifications, and Training Manuals and Plans



