The most trusted source of semiconductor analysis and market information
Senior Site Reliability Engineer
Location
Poland
Posted
20 days ago
Salary
zł18.8K - zł20K / year
Seniority
Senior
Job Description
Senior Site Reliability Engineer
TechInsights
• Own SLOs, SLIs, and error budgets for all production services; drive error budget discipline across engineering • Design reliability patterns for AI agent pipelines: LLM observability, tool-use tracking, failure detection, and graceful degradation • Architect for blast radius containment — agent failures must have bounded customer impact through isolation, circuit breaking, and rapid recovery • Mature our Canada Central/West active-active architecture toward 24-hour RTO with full regional failover • Lead incident response and post-incident reviews that produce durable fixes; maintain DR procedures through regular testing • Serve as the primary reliability liaison to Software and AI Engineering, translating requirements into actionable standards • Partner with AI Engineering on compute provisioning, model serving, inference latency, and workload isolation • Own CI/CD pipeline strategy (Bitbucket Pipelines, GitHub Actions) — set standards, optimize deployment frequency, and ensure teams can ship confidently • Drive IDP adoption and enable teams on SRE practices: on-call readiness, SLO definition, runbook development, and self-service tooling • Represent reliability in architectural discussions; surface risk before it's committed to design • Operate Datadog as the single pane of glass for service health, infrastructure, and agentic pipeline telemetry • Extend observability to AI workloads: LLM latency, token consumption, agent completion rates, and pipeline throughput • Build golden path templates in Backstage and/or Atlassian Compass so teams ship reliably without routine SRE involvement • Own infrastructure as code via Terraform and GitOps; enforce IaC policy in partnership with Trust Assurance • Own FinOps visibility into AWS cost segments; model cloud cost impact as AI/ML workloads scale • Formally mentor junior and intermediate SRE engineers, with accountability for their technical growth and career progression • Build AI-assisted automation to progressively reduce toil and scale the team's operational capacity
Job Requirements
- Bachelor's degree in Computer Science, Engineering, or equivalent combination of education and experience
- 6–8 years of progressive experience in site reliability engineering, platform engineering, or DevOps, with demonstrated technical leadership at the senior individual contributor level
- Deep expertise in AWS (EKS, Lambda, CloudWatch, AWS Config) and multi-region architecture patterns
- Proficiency with Terraform and GitOps; experience with policy-as-code (Sentinel, OPA/Rego, or equivalent)
- Hands-on Datadog experience at operational depth: dashboards, SLO tracking, alerting, log management, distributed tracing
- Strong containerization expertise: Docker, Kubernetes (EKS preferred)
- Proficiency in Python and/or Bash; experience building operational tooling; solid understanding of Java and Spring Boot microservice architecture sufficient to make reliability and deployment decisions for EKS-hosted services
- Deep expertise in CI/CD pipeline design and optimization using Bitbucket Pipelines and GitHub Actions
- Familiarity with IDP tooling (Backstage, Atlassian Compass, or equivalent) strongly preferred
- Experience with AI/ML workload infrastructure, LLM API integration, or agentic system operations considered a strong asset
Benefits
- Company-sponsored training and development opportunities
- Comprehensive benefits package (health, wellness, life insurance, fitness, English classes)
- Flexible vacation policy
- Community involvement opportunities through charitable alliances
- Wellness resources and support
- Inclusive environment that prioritizes diversity, equity, and accessibility
- High-growth company driven by high performance
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Lead Mission Engineer (DevOps)
STRSTR makes the world a safer place by developing technology and applying it to solve emerging national security challenges.
About the Team: Information processing and sense-making systems are the lifeblood of national security efforts, facilitating an understanding of the global situation, strategic planning, and tactical execution. The reliability, accessibility, and sophistication of these systems can determine the outcomes of conflicts before they begin. STR’s Analytics and Command & Control (AC2) Division focuses on developing actionable, advanced technology solutions to provide asymmetric advantages within the information domain. The Mission Applications (MA) Group within the AC2 Division specializes in ensuring the technology that STR creates produces outsized mission impact. Staff within the MA group have a deep understanding of missions and technology needs for national security with skills targeting transition of technology to operational use, including product management, UX Product implementation, software integration & mission engineering, platform engineering, DevOps, and program management. MA staff combine with other science and engineering staff from the various STR research and development groups to form dedicated product teams focused on accelerating the operational transition of cutting-edge software technologies. The Role: As a Lead Mission Engineer (DevOps) in the Mission Applications Group, you will develop and deploy software essential for a specific, real-world objective or "mission" in partnership with our customers. You have: 1) expertise in developing and deploying software solutions in complex and high-security environments, 2) deep domain knowledge in the complex missions executed across the Department of Defense, and 3) expertise in modern frameworks, building full stack applications, and integrating APIs for deployment in high-security environments. You will work as a leader of software teams to design, develop, and maintain infrastructure and tooling to deploy software in operational use. You have experience managing relationships with customer environments while simultaneously leading engineering teams to ensure the delivered product meets the highest standards of quality, reliability, and performance. What you will do: - Supporting cloud (e.g., AWS) toolsets in unclassified environments - Configuring and maintaining multiple CI/CD environments at different classification levels, focusing on developer productivity, application performance, security monitoring, and alerting - Deployment and Field Testing: Traveling to integration sites to deploy, test, and triage software in secure operational environments - Running and managing incident response and post-outage reviews to improve deployment, monitoring, and alerting processes - Coaching team members on DevOps best practices while continuously helping to improve processes. Who You Are: - Active Top Secret Security Clearance with SCI eligibility, for which U.S citizenship is needed by the U.S government - BS in Computer Science or related technical field with at least 7 years of work experience. Equivalent experience will be considered - Experience deploying software for DoD or IC missions - A passion for crafting user-centric interfaces and a keen eye for design details - Demonstrated strong knowledge of modern technologies such as React, TypeScript, and Python - Knowledge of performance-focused tools and techniques for rendering large datasets and building responsive, smooth interfaces - A demonstrated ability to adopt new languages, libraries, and technologies - Organized, detail-oriented, and with an ability to work both independently and collaboratively - Demonstrated experience as an effective communicator to both technical and non-technical audiences Even Better: - Active TS/SCI security clearance - Advanced Degree in Computer Science, Information Technology, or related technical field - Prior US Military Service with experience in joint planning, large force battle monitoring, and fluidity with large force debrief best practices - Located in San Diego, CA, Denver, CO, Arlington, VA, Boston, MA, or willing to relocate Pay Information Full-Time Salary Range: $175,000 - $240,000 The salary range listed is based on external market data. Offers are based on factors, such as but not limited to, the candidate’s experience, education, training, key skills/critical skills, security clearances, and prevailing market and business conditions. STR is a growing technology company with locations near Boston, MA, Arlington, VA, near Dayton, OH, Melbourne, FL, and Carlsbad, CA. We specialize in advanced research and development for defense, intelligence, and national security in: cyber; next generation sensors, radar, sonar, communications, and electronic warfare; and artificial intelligence algorithms and analytics to make sense of the complexity that is exploding around us. STR is committed to creating a collaborative learning environment that supports deep technical understanding and recognizes the contributions and achievements of all team members. Our work is challenging, and we go home at night knowing that we pushed the envelope of technology and made the world safer. STR is not just any company. Our people, culture, and attitude along with their unique set of skills, experiences, and perspectives put us on a trajectory to change the world. We can't do it alone, though - we need fellow trailblazers. If you are one, join our team and help to keep our society safe! Visit us at www.str.us for more info. STR is an equal opportunity employer. We are fully dedicated to hiring the most qualified candidate regardless of race, color, religion, sex (including gender identity, sexual orientation and pregnancy), marital status, national origin, age, veteran status, disability, genetic information or any other characteristic protected by federal, state or local laws. If you need a reasonable accommodation for any portion of the employment process, email us at appassist@str.us and provide your contact info. Pursuant to applicable federal law and regulations, positions at STR require employees to obtain national security clearances and satisfy the requirements for compliance with export control and other applicable laws.
Lead Mission Engineer (DevOps)
STRSTR makes the world a safer place by developing technology and applying it to solve emerging national security challenges.
About the Team: Information processing and sense-making systems are the lifeblood of national security efforts, facilitating an understanding of the global situation, strategic planning, and tactical execution. The reliability, accessibility, and sophistication of these systems can determine the outcomes of conflicts before they begin. STR’s Analytics and Command & Control (AC2) Division focuses on developing actionable, advanced technology solutions to provide asymmetric advantages within the information domain. The Mission Applications (MA) Group within the AC2 Division specializes in ensuring the technology that STR creates produces outsized mission impact. Staff within the MA group have a deep understanding of missions and technology needs for national security with skills targeting transition of technology to operational use, including product management, UX Product implementation, software integration & mission engineering, platform engineering, DevOps, and program management. MA staff combine with other science and engineering staff from the various STR research and development groups to form dedicated product teams focused on accelerating the operational transition of cutting-edge software technologies. The Role: As a Lead Mission Engineer (DevOps) in the Mission Applications Group, you will develop and deploy software essential for a specific, real-world objective or "mission" in partnership with our customers. You have: 1) expertise in developing and deploying software solutions in complex and high-security environments, 2) deep domain knowledge in the complex missions executed across the Department of Defense, and 3) expertise in modern frameworks, building full stack applications, and integrating APIs for deployment in high-security environments. You will work as a leader of software teams to design, develop, and maintain infrastructure and tooling to deploy software in operational use. You have experience managing relationships with customer environments while simultaneously leading engineering teams to ensure the delivered product meets the highest standards of quality, reliability, and performance. What you will do: - Supporting cloud (e.g., AWS) toolsets in unclassified environments - Configuring and maintaining multiple CI/CD environments at different classification levels, focusing on developer productivity, application performance, security monitoring, and alerting - Deployment and Field Testing: Traveling to integration sites to deploy, test, and triage software in secure operational environments - Running and managing incident response and post-outage reviews to improve deployment, monitoring, and alerting processes - Coaching team members on DevOps best practices while continuously helping to improve processes. Who You Are: - Active Top Secret Security Clearance with SCI eligibility, for which U.S citizenship is needed by the U.S government - BS in Computer Science or related technical field with at least 7 years of work experience. Equivalent experience will be considered - Experience deploying software for DoD or IC missions - A passion for crafting user-centric interfaces and a keen eye for design details - Demonstrated strong knowledge of modern technologies such as React, TypeScript, and Python - Knowledge of performance-focused tools and techniques for rendering large datasets and building responsive, smooth interfaces - A demonstrated ability to adopt new languages, libraries, and technologies - Organized, detail-oriented, and with an ability to work both independently and collaboratively - Demonstrated experience as an effective communicator to both technical and non-technical audiences Even Better: - Active TS/SCI security clearance - Advanced Degree in Computer Science, Information Technology, or related technical field - Prior US Military Service with experience in joint planning, large force battle monitoring, and fluidity with large force debrief best practices - Located in San Diego, CA, Denver, CO, Arlington, VA, Boston, MA, or willing to relocate Pay Information Full-Time Salary Range: $175,000 - $240,000 The salary range listed is based on external market data. Offers are based on factors, such as but not limited to, the candidate’s experience, education, training, key skills/critical skills, security clearances, and prevailing market and business conditions. STR is a growing technology company with locations near Boston, MA, Arlington, VA, near Dayton, OH, Melbourne, FL, and Carlsbad, CA. We specialize in advanced research and development for defense, intelligence, and national security in: cyber; next generation sensors, radar, sonar, communications, and electronic warfare; and artificial intelligence algorithms and analytics to make sense of the complexity that is exploding around us. STR is committed to creating a collaborative learning environment that supports deep technical understanding and recognizes the contributions and achievements of all team members. Our work is challenging, and we go home at night knowing that we pushed the envelope of technology and made the world safer. STR is not just any company. Our people, culture, and attitude along with their unique set of skills, experiences, and perspectives put us on a trajectory to change the world. We can't do it alone, though - we need fellow trailblazers. If you are one, join our team and help to keep our society safe! Visit us at www.str.us for more info. STR is an equal opportunity employer. We are fully dedicated to hiring the most qualified candidate regardless of race, color, religion, sex (including gender identity, sexual orientation and pregnancy), marital status, national origin, age, veteran status, disability, genetic information or any other characteristic protected by federal, state or local laws. If you need a reasonable accommodation for any portion of the employment process, email us at appassist@str.us and provide your contact info. Pursuant to applicable federal law and regulations, positions at STR require employees to obtain national security clearances and satisfy the requirements for compliance with export control and other applicable laws.
Lead Mission Engineer (DevOps)
STRSTR makes the world a safer place by developing technology and applying it to solve emerging national security challenges.
About the Team: Information processing and sense-making systems are the lifeblood of national security efforts, facilitating an understanding of the global situation, strategic planning, and tactical execution. The reliability, accessibility, and sophistication of these systems can determine the outcomes of conflicts before they begin. STR’s Analytics and Command & Control (AC2) Division focuses on developing actionable, advanced technology solutions to provide asymmetric advantages within the information domain. The Mission Applications (MA) Group within the AC2 Division specializes in ensuring the technology that STR creates produces outsized mission impact. Staff within the MA group have a deep understanding of missions and technology needs for national security with skills targeting transition of technology to operational use, including product management, UX Product implementation, software integration & mission engineering, platform engineering, DevOps, and program management. MA staff combine with other science and engineering staff from the various STR research and development groups to form dedicated product teams focused on accelerating the operational transition of cutting-edge software technologies. The Role: As a Lead Mission Engineer (DevOps) in the Mission Applications Group, you will develop and deploy software essential for a specific, real-world objective or "mission" in partnership with our customers. You have: 1) expertise in developing and deploying software solutions in complex and high-security environments, 2) deep domain knowledge in the complex missions executed across the Department of Defense, and 3) expertise in modern frameworks, building full stack applications, and integrating APIs for deployment in high-security environments. You will work as a leader of software teams to design, develop, and maintain infrastructure and tooling to deploy software in operational use. You have experience managing relationships with customer environments while simultaneously leading engineering teams to ensure the delivered product meets the highest standards of quality, reliability, and performance. What you will do: - Supporting cloud (e.g., AWS) toolsets in unclassified environments - Configuring and maintaining multiple CI/CD environments at different classification levels, focusing on developer productivity, application performance, security monitoring, and alerting - Deployment and Field Testing: Traveling to integration sites to deploy, test, and triage software in secure operational environments - Running and managing incident response and post-outage reviews to improve deployment, monitoring, and alerting processes - Coaching team members on DevOps best practices while continuously helping to improve processes. Who You Are: - Active Top Secret Security Clearance with SCI eligibility, for which U.S citizenship is needed by the U.S government - BS in Computer Science or related technical field with at least 7 years of work experience. Equivalent experience will be considered - Experience deploying software for DoD or IC missions - A passion for crafting user-centric interfaces and a keen eye for design details - Demonstrated strong knowledge of modern technologies such as React, TypeScript, and Python - Knowledge of performance-focused tools and techniques for rendering large datasets and building responsive, smooth interfaces - A demonstrated ability to adopt new languages, libraries, and technologies - Organized, detail-oriented, and with an ability to work both independently and collaboratively - Demonstrated experience as an effective communicator to both technical and non-technical audiences Even Better: - Active TS/SCI security clearance - Advanced Degree in Computer Science, Information Technology, or related technical field - Prior US Military Service with experience in joint planning, large force battle monitoring, and fluidity with large force debrief best practices - Located in San Diego, CA, Denver, CO, Arlington, VA, Boston, MA, or willing to relocate Pay Information Full-Time Salary Range: $175,000 - $240,000 The salary range listed is based on external market data. Offers are based on factors, such as but not limited to, the candidate’s experience, education, training, key skills/critical skills, security clearances, and prevailing market and business conditions. STR is a growing technology company with locations near Boston, MA, Arlington, VA, near Dayton, OH, Melbourne, FL, and Carlsbad, CA. We specialize in advanced research and development for defense, intelligence, and national security in: cyber; next generation sensors, radar, sonar, communications, and electronic warfare; and artificial intelligence algorithms and analytics to make sense of the complexity that is exploding around us. STR is committed to creating a collaborative learning environment that supports deep technical understanding and recognizes the contributions and achievements of all team members. Our work is challenging, and we go home at night knowing that we pushed the envelope of technology and made the world safer. STR is not just any company. Our people, culture, and attitude along with their unique set of skills, experiences, and perspectives put us on a trajectory to change the world. We can't do it alone, though - we need fellow trailblazers. If you are one, join our team and help to keep our society safe! Visit us at www.str.us for more info. STR is an equal opportunity employer. We are fully dedicated to hiring the most qualified candidate regardless of race, color, religion, sex (including gender identity, sexual orientation and pregnancy), marital status, national origin, age, veteran status, disability, genetic information or any other characteristic protected by federal, state or local laws. If you need a reasonable accommodation for any portion of the employment process, email us at appassist@str.us and provide your contact info. Pursuant to applicable federal law and regulations, positions at STR require employees to obtain national security clearances and satisfy the requirements for compliance with export control and other applicable laws.
• Manage and optimize production infrastructure on AWS, ensuring scalability and reliability. • Deploy and orchestrate containerized applications using Kubernetes. • Implement and maintain infrastructure as code (IaC) using Terraform. • Set up and manage CI/CD pipelines using tools like Jenkins or Github Actions to streamline deployment processes. • Troubleshoot and resolve infrastructure issues to ensure high availability and performance. • Collaborate with cross-functional teams to define technical requirements and deliver solutions.

