Modern Data Orchestration
Customer Reliability Engineer, Airflow
Location
Worldwide
Posted
3 days ago
Salary
$125K - $130K / year
Seniority
Mid Level
No structured requirement data.
Job Description
Customer Reliability Engineer, Airflow
Astronomer
Role Description As an Airflow Reliability Engineer on the Customer Reliability Engineering (CRE) team at Astronomer, you will have the opportunity to become an Apache Airflow expert, learning directly from leaders of the Airflow project. You’ll provide Apache Airflow expertise directly to customers to help them make the best possible use of our managed Airflow service. CRE is Astronomer’s support team. Because our customers are sophisticated organizations who need and expect high levels of expertise to help them keep mission critical uses of Apache Airflow working consistently, we look a little different from most support teams. Nearly every ticket you will work requires an intersection of strong technical knowledge and customer empathy to understand what the customer needs and how to get them there. Every day is a new challenge and a new thing to learn. When you learn a new piece of technology, are you aiming not just to get started but to become the expert? Do you listen to the plumber when they tell you what is wrong with the pipes? Are you the kind of person who takes an MIT OpenCourseWare course and actually finishes it? Then this role could be for you. What you get to do: - Learn and build expertise across several software engineering disciplines, including: - Airflow and data engineering - Kubernetes - Cloud Engineering - Gain exposure to the big picture; learn about product, engineering, customer relationship management, and more. - Solve challenging Airflow problems for our customers. From optimizing configuration to identifying world-first Airflow bugs, you’ll see it all here. - Spend up to 20% of your time on side projects that contribute to Astronomer’s overall success, such as contributing to the open-source Airflow repository or developing Astronomer’s internal monitoring and alerting systems built on Airflow. - Work on a modern, sophisticated, cloud-native product that customers use to connect to dozens of other systems. Gain depth and breadth of learning! - Work directly with our customers’ data engineers, system admins, DevOps teams, and management. - Provide feedback from your experience that can shape the direction of the Airflow project. - Own the customer experience, working directly with customers to prioritize and solve issues, meet SLAs, and provide “white glove” guidance on the path to production. - Participate remotely within a fully distributed team. - Help maintain 24x7 coverage through a specified 6-hour pager period during your work day. - Participate in paid on-call rotation for weekend coverage. Qualifications - Data Engineering background - 4 years of experience with Python - 1 year of experience in Airflow administration and DAG creation - Experience with Kubernetes/Docker/Containers - Experience working with a distributed system with any major cloud provider (AWS, GCP, Azure) - Problem-solving and troubleshooting abilities - Ability to work well with autonomy and independence - Strong written and verbal communication for connecting with our customers over our ticketing system and through Zoom - Experience mentoring junior team members Requirements - Bonus points if you have: - Contributions to open-source projects - Customer Support experience - Familiarity with SQL and PostgreSQL - Experience with Databricks, Snowflake, Redshift, dbt, or other similar data engineering tools Benefits - The estimated total compensation for this role ranges from $125,000 - $130,000 based on leveling and geography, along with an equity component and a comprehensive benefits package. This range is merely an estimate; actual compensation may deviate from this range based on skills, experience, and qualifications. Company Description At Astronomer, we value diversity. We are an equal opportunity employer: we do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, or disability status.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
• Maintain and support existing AWS-based Kubernetes infrastructure and services. • Ensure platform reliability, availability, and security across cloud environments. • Develop and maintain GitLab CI/CD pipelines to support software delivery. • Build and maintain Terraform modules and infrastructure-as-code solutions. • Create and manage Helm charts for Kubernetes application deployments. • Troubleshoot infrastructure, platform, and application issues across AWS and Kubernetes environments. • Manage AWS IAM permissions and cross-account access configurations. • Partner with software and platform engineering teams to support operational needs and platform improvements. • Continuously improve system performance, automation, observability, and operational efficiency.
• Design, implement, and support CI/CD pipelines, build automation, and deployment workflows across development, test, and production environments. • Engineer secure and scalable cloud platform capabilities using infrastructure-as-code, container platforms, and automation tooling. • Support platform reliability, observability, access controls, secrets management, and operational readiness. • Partner with software engineers to improve developer experience, deployment efficiency, and release repeatability. • Implement and maintain DevSecOps controls across build, deploy, scanning, monitoring, and incident response workflows. • Manage platform components such as containers, orchestration platforms, artifact repositories, and shared runtime services. • Troubleshoot environment, pipeline, and deployment issues and support root cause analysis and service restoration. • Contribute to platform standards, runbooks, technical documentation, and continuous improvement initiatives. • Support change management, patching, upgrades, and operational governance for shared services. • Collaborate with architects and leadership to align platform evolution with future-state delivery needs.
Senior Site Reliability Engineer, Government
SentinelOneSecure your enterprise with the autonomous cybersecurity platform. Endpoint. Cloud. Identity. XDR. Now.
• Drive continuous software delivery, resolve incidents, run post mortems, and create automation strategies for deployment, self-testing, and alerting. • Lead and execute incident management for production issues, ensuring rapid recovery, root cause analysis, and preventative follow-up actions. • Improve and optimize the observability strategy by collaborating with application engineering teams to design monitoring solutions that enhance alerting capabilities and reduce noise. • Define, implement, and monitor SLOs, SLIs, and SLAs in collaboration with product and engineering teams to align with business objectives. • Design, develop, and maintain software solutions that address operational, compliance, and pipeline challenges. • Own and coordinate all government environment releases, driving process improvements to enhance the release pipeline's efficiency, reliability, and visibility. • Partner cross-functionally with engineering, product, SecOps, compliance, and leadership teams to align priorities, define testing strategies, and resolve challenges. • Ensure all infrastructure and deployments meet FedRAMP, government regulations, and industry standards, while maintaining required release documentation and risk assessments.
Senior DevOps Engineer
UnqorkUsing CaaS (Codeless-as-a-Service) to accelerate time-to-market & eliminate legacy code for the enterprise 🚀
• Reporting to the Director of DevOps • Build the next-generation control plane that provisions, configures, and manages Unqork's Kubernetes fleet across commercial, government, and edge customer environments, continuing to push toward an architecture that is automated, modular, and built to scale • Design and deliver self-service infrastructure tooling that enables Ops and Support teams to execute common operational workflows without engineering intervention, shifting operational work left and freeing engineers to build • Drive observability improvements across the fleet, establishing alerting and instrumentation that produces cleaner signal, enables faster mitigation, and supports deeper root cause analysis when incidents occur • Improve the internal engineering experience by building faster CI/CD pipelines, better tooling, and paved-road patterns that reduce cognitive load and make the right way to build and ship the obvious way • Collaborate closely with the Principal Architect to shape technical direction across the control plane, infrastructure automation, and cross-cutting infrastructure concerns




