CrowdStrike logo
CrowdStrike

CrowdStrike has redefined security with the world’s most advanced cloud-native platform that protects and enables the people, processes and technologies that drive modern enterprise. Tested and proven, the world's largest organizations trust CrowdStrike to stop breaches with unparalleled protection against the most sophisticated cyberattacks. The CrowdStrike culture has been built upon our Core Values since the day we began. We are Fanatical About the Customer, Relentlessly Focused on Innovation and believe that our Limitless Passion drives Unlimited Potential for every CrowdStriker. As a purpose-built remote-first company, we believe cultivating a connected culture for every employee, no matter where they are in the world, is a key ingredient in building a high-performing, diverse team. We don’t have a mission statement. We’re on a mission—to stop breaches. Ready to join a mission that matters?

Database SRE Manager

DevOps EngineerDevOps EngineerFull TimeRemoteLeadTeam 5,001-10,000Since 2011H1B SponsorCompany SiteLinkedIn

Location

Australia

Posted

2 days ago

Salary

0

Seniority

Lead

Job Description

Database SRE Manager

CrowdStrike

• Lead and mentor a team of skilled engineers responsible for the deployment, operations, and scaling of critical data platforms including Apache Cassandra, Apache Kafka, OpenSearch, caching solutions (Memcached, Redis), relational databases (PostgreSQL, MySQL), Kubernetes, and Zookeeper. • Develop and execute long-term technical strategies to ensure the scalability, reliability, and performance of our data infrastructure. • Drive architectural decisions and innovations that align with CrowdStrike's business goals and technical roadmap. • Oversee operations in large-scale, business-critical Linux environments, balancing both cloud and bare metal infrastructures. • Collaborate with cross-functional teams to integrate data services seamlessly into CrowdStrike's broader technology ecosystem. • Implement and refine processes for continuous improvement, focusing on system reliability, performance optimization, and cost-effectiveness. • Provide technical leadership and guidance across the organization on best practices for data management and infrastructure.

Job Requirements

  • 8+ years of experience in software engineering, with at least 5 years in a leadership role managing teams of 10+ engineers
  • Deep technical expertise in distributed systems, cloud-native architectures, and data platforms
  • Proven experience in scaling and optimizing large-scale data services in high-growth environments
  • Proven experience in administrating one of these, Apache Cassandra Apache Kafka, OpenSearch, caching solutions (Memcached, Redis), relational databases (PostgreSQL, MySQL), Kubernetes, or Zookeeper
  • Strong understanding of cloud technologies, AWS, GCP, OCI and Azure
  • Experience in managing and optimizing hybrid cloud and bare metal infrastructures
  • Excellent communication skills, with the ability to articulate complex technical concepts to both technical and non-technical stakeholders
  • Track record of successful project delivery in fast-paced, rapidly evolving environments
  • Bachelor's degree in Computer Science, Engineering, or a related field; advanced degree preferred.

Benefits

  • Market leader in compensation and equity awards
  • Comprehensive physical and mental wellness programs
  • Competitive vacation and holidays for recharge
  • Paid parental and adoption leaves
  • Professional development opportunities for all employees regardless of level or role
  • Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections
  • Vibrant office culture with world class amenities
  • Great Place to Work Certified™ across the globe

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Your Bourse logo

Site Reliability Engineer

Your Bourse

Trade Execution Technology for MT4, MT5 and Crypto Brokers. Liquidity, Risk Management, Reporting Platform-as-a-Service

DevOps Engineer2 days ago
Full TimeRemoteTeam 11-50Since 2017H1B No Sponsor

Role Description We are looking for a highly skilled and motivated Site Reliability Engineer to manage and scale our global infrastructure. This role involves hands-on administration of Linux servers, automation, network configuration, system hardening, and ensuring high availability and performance. You will play a key role in infrastructure planning, security, compliance, and supporting mission-critical environments. Responsibilities - Infrastructure & Network - Install, configure, and harden Ubuntu Server environments (LTS releases). - Coordinate cross-connect implementations (L2/L3) with network providers to ensure reliable connectivity and SLAs. - Design infrastructure solutions including complex network topologies. - Implement automated provisioning using Ansible and Terraform. - Weekly Maintenance & Patching - Apply OS patches and server upgrades with minimal downtime (weekend window). - Apply firmware updates and monitor global infrastructure health using Prometheus/Grafana. - Lead and execute client migrations with minimal service disruption. - Security, Backup & Support - Enforce security policies: SSH hardening, firewalls, user permissions. - Design and maintain backup strategies and disaster-recovery plans. - Provide L2/L3 support, diagnose and resolve network and server issues. - Any other duties and responsibilities relevant to the role. Qualifications - 5+ years of hands-on Linux administration (Ubuntu Server, advanced level). - Deep knowledge of BGP, networking fundamentals, and cross-connect configurations (L2/L3). - Real hands-on experience with Ansible and Terraform in production environments. - Strong scripting skills in bash and/or Python. - Solid understanding of TCP/IP, DNS, DHCP, firewalls, AppArmor/SELinux. - Experience with Docker, Prometheus/Grafana/ELK, and database administration (PostgreSQL, MySQL, ClickHouse). - Fluent in English — both written and spoken. Nice to Have - Certifications: Ubuntu Professional, RHCE, or LPIC. - Experience with cloud platforms (AWS, Azure, Google Cloud) and hybrid-cloud architectures. - Familiarity with CI/CD tools (GitHub Actions). Benefits - Competitive compensation package. - Full-time remote role. - Learning & Development support. - Paid annual leave and sick leave. - Company events and team celebrations (online and offline). - Anniversary and birthday gifts. - Clear career growth and professional development opportunities. - Supportive, inclusive, and collaborative work environment.

Worldwide
SOFTSWISS logo

Systems/DevOps Engineer

SOFTSWISS

Winning combination of software products for iGaming

DevOps Engineer2 days ago
Full TimeRemoteTeam 1,001-5,000H1B No Sponsor

• Collaboration with product teams. • Participate in the launch of new projects and new features. • Participate in the design of complex information systems. • Automate infrastructure components. • Setup and maintain infrastructure. • Consult managers and company clients. • Create and maintain technical documentation.

Poland
Poland and Eastern Europe logo

PHP Developer

Poland and Eastern Europe

Xebia is a global tech company with a journey in CEE that started with two Polish companies – PGS Software and GetInData. We are a team of 1,000+ experts delivering top-notch work across cloud, data, and software. We work on impactful projects across various sectors including fintech, e-commerce, aviation, logistics, media, and fashion, helping clients build scalable platforms and cutting-edge applications. Our clients include notable names like McLaren, Aviva, Deloitte, Spotify, Disney, ING, UPS, Tesco, Truecaller, AllSaints, Volotea, Schmitz Cargobull, Allegro, and InPost.

DevOps Engineer2 days ago
ContractRemoteTeam 1,001-5,000

Role Description We are looking for a freelance PHP Developer for an initial 3-month engagement (July–September), with the possibility of extension based on project needs and performance. - Developing and implementing a data integration between AWS-based services and an existing Yii2 application. - Working on a project that combines legacy PHP solutions with modern components built using Yii2 and Vue.js. - Building and maintaining backend functionalities and integration services. - Collaborating with the project team to ensure reliable data exchange and system stability. - Supporting application deployment and operation in Linux-based environments. Qualifications - 4+ years of commercial experience in PHP development. - Hands-on experience with Yii2. - Experience designing and integrating REST APIs. - Understanding of TCP/IP networking concepts. - Experience working with MariaDB or similar relational databases. - Familiarity with Linux environments and containerized applications. - Exposure to Kubernetes-based environments. - Knowledge of HTML5, CSS, JavaScript, and Vue.js. - Practical experience using AI-powered assistants (e.g. Claude Code, GitHub Copilot, Cursor) to improve productivity, quality, or decision-making in software delivery. - Ability to work independently and deliver solutions within defined timelines. - Good command of English (min. B2). - Immediate availability or availability to start in July is highly important. - Work from the European Union region and a work permit are required. Requirements - Knowledge of German (nice to have). - Experience working with AWS services and integrations (nice to have). - Experience applying GenAI in a more structured way within the SDLC, including defined workflows, prompt patterns, or tool integrations embedded into daily work (nice to have). - Interest in and familiarity with emerging AI-driven practices (e.g. agent-based workflows, automation patterns, AI-augmented development), with a willingness to explore and experiment beyond standard approaches (nice to have). Recruitment Process - CV review - HR call - Technical Interview - Client Interview - Decision

Romania
Dev.Pro logo

Junior Site Reliability Engineer

Dev.Pro

Software Development Partner. Result-driven. Quality-obsessed.

DevOps Engineer2 days ago
Full TimeRemoteTeam 501-1,000Since 2011H1B No Sponsor

Role Description We invite a Junior Site Reliability Engineer based in Chile to join our team and support the stability and performance of a modern mobile point-of-sale (POS) platform that offers features like mobile payments, reporting, inventory, and customer management. In this role, you’ll be responsible for providing first-line operational support, monitoring production systems, and helping resolve incidents. Qualifications - 1+ year of experience in operational support, technical support, or SRE/DevOps-related roles - Understanding of production support, incident response, and troubleshooting - Familiarity with at least one programming language and basic debugging skills - Scripting skills in Bash, PowerShell, or Python for automation - Exposure to AWS, Azure, or GCP - Experience working with APIs and troubleshooting integrations - Familiarity with monitoring and observability tools, including using logs and metrics for troubleshooting - Basic Git knowledge (cloning repositories, committing changes) - Basic Linux/Windows administration (user management, process monitoring, package installation, and troubleshooting) - Understanding of virtualization and containerization, including creating and running containers in a test environment - Exposure to MDM solutions and mobile POS platforms - Strong problem-solving, communication, and cross-functional collaboration skills - Intermediate English level Requirements - Desirable: Exposure to containerization tools such as Docker - Basic understanding of CI/CD concepts and deployment workflows - Basic understanding of microservices architecture - Hands-on experience with IaC tools such as Terraform or Ansible in a test environment - Basic networking knowledge, including TCP/UDP, HTTP, DNS, OSI model, routing, and subnetting Key Responsibilities - Provide first-line operational support and monitor production systems - Troubleshoot issues across cloud services, APIs, and integrations, and apply corrective actions - Manage incident escalations and collaborate with teams on resolutions - Participate in a 24/7 on-call rotation for incident response coverage - Support remote software deployments and assist with MDM administration - Manage system access and user permissions - Support basic internet security (VPN, firewall, and SSL certificates) - Write and maintain simple automation scripts (Bash, PowerShell, or Python) - Contribute to runbooks, documentation, and internal knowledge base updates - Participate in post-incident reviews and contribute to operational improvements - Work under the guidance of a mentor or team lead, following defined processes and escalating risks or issues as needed Benefits - 99.9% remote — you can work from anywhere in the world - 30 paid days off per year to use however you like — vacations, holidays, or personal time - 5 paid sick days, up to 60 days of medical leave, and up to 6 paid days off per year for major family events like weddings, funerals, or the birth of a child - Partially covered health insurance after the probation, plus a wellness bonus for gym memberships, sports nutrition, and similar needs after 6 months - We pay in U.S. dollars and cover all approved overtime - Join English lessons and Dev.Pro University programs, and take part in fun online activities and team-building events

Chile
Job Closed