Job Closed
This listing is no longer active.
Where software concepts come alive™
SRE / DevOps / Infrastructure Engineer
Location
Slovakia
Posted
67 days ago
Salary
0
Seniority
Senior
Job Description
SRE / DevOps / Infrastructure Engineer
Intetics
• Build, operate, and improve the infrastructure powering Parasail’s distributed inference platform • Own reliability, scalability, and operational excellence across AWS-based control planes and our multi-provider GPU fleet • Design and maintain the networking layer connecting control planes, Kubernetes clusters, and geographically distributed GPU hosts • Operate and improve Kubernetes-based inference orchestration, primarily on EKS • Manage deployments and infrastructure changes using Helm, FluxCD, and Terraform • Improve observability across the platform using metrics, logs, traces, dashboards, and alerting built on Prometheus, Grafana, Loki, Jaeger, and OpenTelemetry • Tune alerts, improve runbooks, and strengthen operational readiness as the system scales • Respond to production issues, perform root cause analysis, and implement durable fixes • Work closely with engineers across time zones using clear asynchronous communication and handoff practices, especially through Slack • Help expand Europe-based infrastructure coverage to support sustainable operations outside US business hours
Job Requirements
- 5+ years of experience in SRE, DevOps, platform engineering, or infrastructure engineering
- Strong production experience with networking and Kubernetes
- Experience operating AWS infrastructure in production, especially EKS
- Strong hands-on experience managing Linux hosts, clusters, and distributed systems in environments that are not fully abstracted by a major cloud provider
- Experience with Prometheus, Grafana, Loki, Jaeger, and OpenTelemetry
- Experience with deployment and GitOps workflows using tools such as Helm and FluxCD
- Experience with infrastructure as code, ideally Terraform
- Familiarity with alert tuning, runbook development, and practical incident management in production systems
- Strong operational judgment: able to troubleshoot independently, respond calmly to incidents, and improve systems without constant direction
- Comfortable working in a fast-moving startup where infrastructure, product, and customer demands are changing quickly
- Clear communicator who can work effectively in an async environment and handle shift handoffs cleanly
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
DevOps Specialist
GFT TechnologiesAs a pioneer for digital transformation GFT develops sustainable solutions across new technologies.
• Keep and evolve Kubernetes clusters (with a focus on AKS), including provisioning, upgrades, optimization, configuration and troubleshooting. • Implement and optimize CI/CD pipelines using Azure DevOps, including builds, automated tests, code quality, security and deployment strategies. • Design and maintain infrastructure as code using Crossplane and/or Terraform. • Create and maintain Helm charts and YAML manifests. • Implement and operate observability using the LGTM stack, defining metrics, logs, dashboards and alerts for applications and infrastructure. • Implement and evolve DevSecOps practices (vulnerability scanning in containers/images, SAST/DAST, security policies in pipelines and clusters). • Support daily environment management: incident analysis, performance troubleshooting, cost optimization in Azure and developer support. • Work closely with development teams, continuously improving software lifecycle processes (with a focus on DevEx).
Job DetailsJob Location: Canadian Headquarters - St Johns, NL A1C 6C9Position Type: Full TimeSalary Range: $95,000.00 - $125,000.00 Salary/yearTravel Percentage: NoneJob Category: Engineering Senior DevOps Engineer What We Seek We are looking for a Senior DevOps Engineer to help ensure that the systems our students and teachers rely on every day are reliable, secure, scalable, observable, and high performing. This is a hands-on engineering role for someone who enjoys improving systems, reducing toil through automation, enabling developers, and strengthening operational excellence. You will work across CI/CD, infrastructure as code, observability, incident response, and cloud reliability, while helping modernize legacy practices and improve the developer experience. You should be a strong communicator who collaborates well in a distributed environment and is comfortable partnering across engineering, QA, support, and product teams. We value people who are practical, curious, accountable, and motivated by continuous improvement. Our environment includes AWS, Jenkins, CloudFormation, ECS/Fargate, GitHub, Jira, Splunk, New Relic, Cortex.io, Slack, Snowflake, Databricks, and other modern engineering tools and platforms. Location is flexible! This role can work from our St. John's, NL office on the water, or remote anywhere within Canada. Candidates must already be residing in Canada. No visa sponsorship is available. What Your Day Will Look Like DevOps, CI/CD and Developer Enablement Develop and maintain Jenkins shared libraries and Jenkins pipelines using Groovy Improve build, test, and deployment workflows to make software delivery more reliable and efficient Partner with development and QA teams to support internal development and test environments Help teams adopt better engineering practices around release quality, automation, and deployment confidence Carnegie Learning is evolving toward a platform engineering model to improve developer experience and operational excellence, leveraging Cortex.io to centralize service cataloging, ownership, standards, and engineering insights. Infrastructure and Platform Engineering Build and manage AWS infrastructure using Infrastructure as Code, primarily CloudFormation Design, deploy, and improve secure, scalable cloud environment Troubleshoot infrastructure and platform issues independently and drive long-term fixes Help modernize legacy tooling and operational practices Production Reliability and Operations Design and implement monitoring, alerting, trend analysis, and self-healing capabilities Support SLIs and SLOs and help teams use reliability metrics to improve service health Monitor and respond to alerts and production issues across applications and infrastructure Participate in incident response and post-incident reviews, identifying both technical and process improvements Assist support and engineering teams with log analysis, troubleshooting, and root cause investigation Collaboration and Ownership Work effectively in a remote-first environment using tools like Slack, Jira, and shared documentation Keep tasks, documentation, and operational runbooks current Communicate clearly during both planned technical sessions and real-time incident situations Contribute to a strong culture of teamwork, accountability, and customer focus On-call on a monthly rotating basis (not heavy after hours but some is needed on occasion) What Should Be In Your Bookbag 5+ years of experience building and operating production-grade cloud solutions, preferably in AWS Cloud certification beyond Practitioner level, such as SysOps, DevOps, Solutions Architect, or Security Strong hands-on experience with Jenkins, including Jenkins DSL, plugin ecosystem, CI/CD pipelines, and Git-based workflows Strong scripting and automation skills, including Groovy and at least one additional language such as Python, Go, Java, or Bash Experience with web applications and modern frameworks/languages such as JavaScript, TypeScript, Angular, Node.js, Django, or Laravel Strong troubleshooting skills across the SDLC, including failed builds, pipeline issues, and infrastructure bottlenecks Experience designing and implementing secure AWS infrastructure using Infrastructure as Code, preferably CloudFormation Hands-on experience with Docker, containers, and container orchestration, especially ECS Fargate Experience with high availability, load balancing, and content delivery platforms and practices Strong cloud security and networking experience, including least-privilege access models, IAM policies, and secure infrastructure design Experience with observability, logging, and performance monitoring tools for troubleshooting and capacity planning, preferably Splunk and New Relic Experience with production change management, including rollback planning and documentation Strong communication, presentation, and customer service skills, with the ability to work independently and solve complex technical problems Experience leveraging AI-powered tools or platforms to improve operational efficiency, troubleshooting, automation, developer experience, or service reliability Bonus points for experience with: Database knowledge (writing queries, troubleshooting, performance, and monitoring) DevSecOps, including integrating code analysis and vulnerability scanning tools into the CI/CD pipeline; additionally, familiarity with cybersecurity and regulatory frameworks (e.g., NIST, SOC 2, ISO, and COBIT) What Gives Us Purpose Others teach what; we teach why. For over 25 years, Carnegie Learning has developed solutions that help students achieve Aha moments and set them up for long-term learning success. We’re driven to find new, better ways to support teachers and reach more students, so we put our leading cognitive science research and real-world insights into solutions for how students think, learn, and do their best. This is a key part of our commitment to helping teachers spark student curiosity. As a Carnegie Learning team member, you’ll work alongside a team of passionate individuals dedicated to making a real difference in the lives of students and educators. What We Provide Holistic Wellbeing Cost-shared health and dental benefits plan Competitive Retirement Savings Matching Program to plan for your future Fostering Joy Flexible work arrangements with our Work From Anywhere Policy Your Time, Your Way - paid time off that you can use as you see fit to recharge and nurture your personal life Empowering Parenthood Top-Up Parental Leave Reduced working hours on full pay for new parents Free access to CL products for employees and their children A Place for Connection Quarterly Wellness Incentives Monthly employee activities + recognition program Employee Allyship Groups (EAGs) What We Believe We celebrate the unique attributes, characteristics, and perspectives that define each person's individuality. This fusion of perspectives enriches our collective knowledge, fosters innovation, and empowers us all. Together, we can collectively and more effectively address issues that face our business and industry. If our commitment to building an inclusive workplace resonates with you, we invite you to join our mission and welcome you to apply with us. Carnegie Learning is an Equal Opportunity Employer, including veterans and individuals with disabilities. Qualifications
Job DetailsJob Location: Canadian Headquarters - St Johns, NL A1C 6C9Position Type: Full TimeSalary Range: $95,000.00 - $125,000.00 Salary/yearTravel Percentage: NoneJob Category: Engineering Senior DevOps Engineer What We Seek We are looking for a Senior DevOps Engineer to help ensure that the systems our students and teachers rely on every day are reliable, secure, scalable, observable, and high performing. This is a hands-on engineering role for someone who enjoys improving systems, reducing toil through automation, enabling developers, and strengthening operational excellence. You will work across CI/CD, infrastructure as code, observability, incident response, and cloud reliability, while helping modernize legacy practices and improve the developer experience. You should be a strong communicator who collaborates well in a distributed environment and is comfortable partnering across engineering, QA, support, and product teams. We value people who are practical, curious, accountable, and motivated by continuous improvement. Our environment includes AWS, Jenkins, CloudFormation, ECS/Fargate, GitHub, Jira, Splunk, New Relic, Cortex.io, Slack, Snowflake, Databricks, and other modern engineering tools and platforms. Location is flexible! This role can work from our St. John's, NL office on the water, or remote anywhere within Canada. Candidates must already be residing in Canada. No visa sponsorship is available. What Your Day Will Look Like DevOps, CI/CD and Developer Enablement Develop and maintain Jenkins shared libraries and Jenkins pipelines using Groovy Improve build, test, and deployment workflows to make software delivery more reliable and efficient Partner with development and QA teams to support internal development and test environments Help teams adopt better engineering practices around release quality, automation, and deployment confidence Carnegie Learning is evolving toward a platform engineering model to improve developer experience and operational excellence, leveraging Cortex.io to centralize service cataloging, ownership, standards, and engineering insights. Infrastructure and Platform Engineering Build and manage AWS infrastructure using Infrastructure as Code, primarily CloudFormation Design, deploy, and improve secure, scalable cloud environment Troubleshoot infrastructure and platform issues independently and drive long-term fixes Help modernize legacy tooling and operational practices Production Reliability and Operations Design and implement monitoring, alerting, trend analysis, and self-healing capabilities Support SLIs and SLOs and help teams use reliability metrics to improve service health Monitor and respond to alerts and production issues across applications and infrastructure Participate in incident response and post-incident reviews, identifying both technical and process improvements Assist support and engineering teams with log analysis, troubleshooting, and root cause investigation Collaboration and Ownership Work effectively in a remote-first environment using tools like Slack, Jira, and shared documentation Keep tasks, documentation, and operational runbooks current Communicate clearly during both planned technical sessions and real-time incident situations Contribute to a strong culture of teamwork, accountability, and customer focus On-call on a monthly rotating basis (not heavy after hours but some is needed on occasion) What Should Be In Your Bookbag 5+ years of experience building and operating production-grade cloud solutions, preferably in AWS Cloud certification beyond Practitioner level, such as SysOps, DevOps, Solutions Architect, or Security Strong hands-on experience with Jenkins, including Jenkins DSL, plugin ecosystem, CI/CD pipelines, and Git-based workflows Strong scripting and automation skills, including Groovy and at least one additional language such as Python, Go, Java, or Bash Experience with web applications and modern frameworks/languages such as JavaScript, TypeScript, Angular, Node.js, Django, or Laravel Strong troubleshooting skills across the SDLC, including failed builds, pipeline issues, and infrastructure bottlenecks Experience designing and implementing secure AWS infrastructure using Infrastructure as Code, preferably CloudFormation Hands-on experience with Docker, containers, and container orchestration, especially ECS Fargate Experience with high availability, load balancing, and content delivery platforms and practices Strong cloud security and networking experience, including least-privilege access models, IAM policies, and secure infrastructure design Experience with observability, logging, and performance monitoring tools for troubleshooting and capacity planning, preferably Splunk and New Relic Experience with production change management, including rollback planning and documentation Strong communication, presentation, and customer service skills, with the ability to work independently and solve complex technical problems Experience leveraging AI-powered tools or platforms to improve operational efficiency, troubleshooting, automation, developer experience, or service reliability Bonus points for experience with: Database knowledge (writing queries, troubleshooting, performance, and monitoring) DevSecOps, including integrating code analysis and vulnerability scanning tools into the CI/CD pipeline; additionally, familiarity with cybersecurity and regulatory frameworks (e.g., NIST, SOC 2, ISO, and COBIT) What Gives Us Purpose Others teach what; we teach why. For over 25 years, Carnegie Learning has developed solutions that help students achieve Aha moments and set them up for long-term learning success. We’re driven to find new, better ways to support teachers and reach more students, so we put our leading cognitive science research and real-world insights into solutions for how students think, learn, and do their best. This is a key part of our commitment to helping teachers spark student curiosity. As a Carnegie Learning team member, you’ll work alongside a team of passionate individuals dedicated to making a real difference in the lives of students and educators. What We Provide Holistic Wellbeing Cost-shared health and dental benefits plan Competitive Retirement Savings Matching Program to plan for your future Fostering Joy Flexible work arrangements with our Work From Anywhere Policy Your Time, Your Way - paid time off that you can use as you see fit to recharge and nurture your personal life Empowering Parenthood Top-Up Parental Leave Reduced working hours on full pay for new parents Free access to CL products for employees and their children A Place for Connection Quarterly Wellness Incentives Monthly employee activities + recognition program Employee Allyship Groups (EAGs) What We Believe We celebrate the unique attributes, characteristics, and perspectives that define each person's individuality. This fusion of perspectives enriches our collective knowledge, fosters innovation, and empowers us all. Together, we can collectively and more effectively address issues that face our business and industry. If our commitment to building an inclusive workplace resonates with you, we invite you to join our mission and welcome you to apply with us. Carnegie Learning is an Equal Opportunity Employer, including veterans and individuals with disabilities. Qualifications
Release Engineer II
DICK'S Sporting GoodsHeadquartered in Coraopolis, Pennsylvania, DICK’S Sporting Goods offers sports fans and enthusiasts a “big store” selection of name-brand sports equipment, products, shoes, a
• Oversee planning, scheduling, and deployment of software releases across the enterprise • Coordinate software releases across multiple applications within various portfolios • Ensure releases are executed smoothly while balancing competing priorities • Create, maintain, and publish a centralized release calendar • Identify, manage, and mitigate risks to release scope, quality, or schedules • Collaborate closely with development, quality assurance, and product management teams • Measure and monitor progress to ensure timely software delivery




