Job Closed
This listing is no longer active.
Cribl, the Data Engine for IT and Security, empowers organizations to transform their data strategy.
Staff Site Reliability Engineer
Location
Australia
Posted
83 days ago
Salary
0
Seniority
Lead
Job Description
Staff Site Reliability Engineer
Cribl
• Engage with teams and improve service delivery and reliability across their entire lifecycle • Measure and monitor all production systems with an eye towards availability, latency and overall system health • Seek out the cause of errors and instability in our production cloud services and drive teams towards better operational excellence • Engage with product and platform teams to improve and evolve systems by lobbying for changes that improve reliability, resilience, and observability • Help Identify and drive down toil with creative innovation and automation • This position will require stand-by, on-call, or off-hours duties
Job Requirements
- Proven experience designing, implementing, and operating observability systems for complex cloud-based platforms, with deep knowledge of best practices and a strong drive to implement them leveraging Cribl products.
- Experience with Configuration Management and Infrastructure as a Code Tools like Terraform (preferred) or Ansible. Experience working with Cloud SDKs is also a plus.
- Knowledge of cloud platforms (prefer AWS and Azure) and container + orchestration technologies.
- Experience with APM and Observability and related tools such as, New Relic, Splunk, CloudWatch, Prometheus, Grafana/Kibana, Sentry etc.
- Extensive experience with enterprise scale continuous delivery environments.
- Development with JavaScript/Node.js/TypeScript in a Linux/Mac environment.
- Experience with sustainable incident response in a blameless environment.
- Background in Linux Systems Engineering.
- Experience with Incident response related tools for instance, PagerDuty, FireHydrant, Blameless etc.
- Comfortable with a high level of autonomy and working with a distributed team.
- Knowledge of Cloud and application security best practices.
- Strong knowledge of cloud design patterns for scale, data management, resiliency, etc.
- A love for high quality and a knack for testing.
- Opinions about business metrics, and SLOs.
Benefits
- Diversity drives innovation, enables better decisions to support our customers, and inspires change for the better.
- Culture where differences are valued and welcomed.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• CI/CD sobre GitHub Actions: Diseñar, implementar, mantener y optimizar pipelines CI/CD para múltiples proyectos, asegurando buenas prácticas, escalabilidad y observabilidad. • Gobernanza de branching model: Definir y supervisar modelos de branching eficientes, garantizando consistencia entre equipos y buenas prácticas para releases, hotfixing y versionado. • Automatización y Scripting: Crear y mantener scripts en Bash, YAML y Python para facilitar procesos de integración, despliegue y automatización interna. • Infraestructura de despliegue: Gestionar despliegues en Kubernetes (AKS / OpenShift) y utilizar Helm para gestionar paquetes y configuraciones reproducibles. • Integración de herramientas adicionales: Trabajar con sistemas relacionados como Kafka, SonarQube, Jira, Confluence y otros elementos de la cadena DevOps. • Colaboración Multidisciplinaria: Trabajar con equipos de desarrollo, QA y operaciones para asegurar flujos de entrega ágiles, seguros y consistentes. • Evolución de la plataforma de CI/CD: Formar parte del equipo responsable de la evolución de la plataforma corporativa basada en GitHub Actions, proponiendo mejoras, nuevas herramientas y estándares.
Configuration Engineer – DevOps
Veeam SoftwareYour Single Backup and Data Management Platform for Cloud, Virtual and Physical
• Develop, maintain, and monitor build automation systems to ensure seamless software builds and deployments • Write and maintain scripts for infrastructure deployment automation, monitoring, and configuration management • Monitoring and troubleshooting pipeline failures, optimizing build performance, and documenting pipeline architecture and operational standards • Implement and manage configuration management tools to ensure consistent environments across development, testing, and production • Document processes, procedures, and configurations to ensure clear communication and continuity • Work closely with software developers, QA engineers, and IT operations to ensure smooth delivery of the product • Serve with high autonomy as DRI for build and release infrastructure, owning CI/CD reliability and security SLAs; lead incident response and postmortems; and drive preventive, end‑to‑end improvements
Senior DevOps Engineer
AssureSoft - CareersAssureSoft is a multinational software development and information technology company providing strategic consulting, technology services, and outsourcing business processes. We work to innovate and create quality software with motivated, passionate, and qualified teams that develop in an environment of professional, stable growth and continuous learning. Inclusive Opportunities for Every Talent. At AssureSoft, we believe that true innovation is born from diversity—of ideas, experiences, and perspectives. That’s why our hiring practices are inclusive and reflect a firm commitment to equity and equal opportunity. Here, every person—regardless of origin, gender, orientation, or beliefs—finds a space to grow, contribute, and be valued not only for their talent, but also for who they are.
Role Description The key to our success is simple; we deliver the highest quality, on time, with passion and commitment, and live every day by a set of great values. We've built a company that fosters long-term careers in an environment of continuous learning, with cutting-edge benefits, tools, and resources. In addition, certifications that endorse our company as a great place to work. - Design, migrate, and operate cloud-native platforms across AWS, GCP, and OCI. - Translate AWS architectures into equivalent or optimized GCP and OCI solutions. - Map cloud services across providers and document architectural trade-offs. - Build and maintain Infrastructure as Code using Terraform. - Support and enhance CI/CD pipelines for multi-cloud environments. - Collaborate with platform teams to establish networking, IAM, and security standards. - Apply SRE practices to improve system reliability, availability, and performance. - Support Kubernetes-based workloads and cloud-native applications. - Partner with application teams to modernize workloads using containers and managed services. - Contribute to cost optimization, resilience, and disaster recovery strategies. Qualifications - 5–7+ years of experience in DevOps, Cloud Engineering, or Platform Engineering. - Strong hands-on experience with AWS (VPC, IAM, EC2, EKS, RDS, S3, Lambda, Load Balancers). - Working experience with GCP (GKE, Compute Engine, VPC, IAM, Cloud Storage). - Exposure or experience with OCI (OKE, Compute, VCN, Object Storage). - Proven ability to map AWS services to GCP and OCI equivalents. - Strong Kubernetes and cloud-native architecture knowledge. - Proficiency with Terraform across multiple cloud providers. - Solid understanding of cloud networking concepts. - Familiarity with DevOps and SRE practices. - Strong communication skills and cross-team collaboration. Requirements - Cloud certifications (AWS, GCP, OCI) and experience with multi-cloud environments. - Experience with GitOps tools (ArgoCD) or event-driven architectures. Benefits - Great Place To Work certification. - A company with more than 15 years of experience. - Work with world-class clients and long-term projects. - English scholarships for an external institute. - English classes with company teachers. - State-of-the-art tools and resources. - Certifications for your professional growth. - Recreation and leisure activities. - Compliance with the regulations and labor rights of your region.
• Design, build, and maintain AWS-based infrastructure for high-performance analytics platforms • Use Terraform for Infrastructure as Code to ensure scalable and reproducible deployments • Implement and optimize Kubernetes clusters for container orchestration • Automate CI/CD pipelines, monitoring, and alerting processes • Configure and manage databases including PostgreSQL, MongoDB, and Kafka • Ensure cloud security, cost optimization, and disaster recovery practices • Contribute to backend development using Scala and Python • Develop and maintain APIs, microservices, and data pipelines • Collaborate with engineers to ensure code quality and smooth CI/CD integration • Support cross-functional teams with scalable technical solutions



