Tecsys Inc. logo
Tecsys Inc.

Equipping supply chain greatness.

Infrastructure Reliability Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 501-1,000Since 1983H1B No SponsorCompany SiteLinkedIn

Location

Canada

Posted

69 days ago

Salary

0

Seniority

Senior

Bachelor DegreeFrenchTerraform

Job Description

Infrastructure Reliability Engineer

Tecsys Inc.

• Collaborate with other engineering teams to support services before they go live through activities such as systems design consultation, platform and software framework development, capacity planning, and launch reviews. • Continuously innovate by identifying weaknesses, proposing creative solutions, and leading initiatives that simplify, scale, and harden the platform. • Maintain services once they are live by measuring and monitoring availability, latency, and overall system health. • Ensure **optimized observability**: improve and expand monitoring and alerting using Datadog; define SLOs/SLIs and build actionable dashboards that drive reliability outcomes. • Develop and promote automation: enhance internal tooling, IaC frameworks, and pipelines (Terraform, GitLab CI/CD) to reduce manual interventions and enable self-healing systems. • Scale systems sustainably through automation and by driving changes that improve reliability and velocity. • Practice sustainable incident management and blameless post-incident analysis. Lead post-incident reviews (RCA) and identify long-term fixes that improve stability, reliability, and developer experience. • Implement monitoring, logging, alerting, and SLA reporting. • Create and maintain technical documentation. • Implement, maintain, and evolve SRE best practices. • Act as **incident commander** during incidents: coordinate cross-team response, manage communications, and ensure rapid service restoration.

Job Requirements

  • On-call rotation for incident escalation
  • Occasional travel (quarterly on-site visits, conferences - less than 10%)

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Particle41 logo

DevOps Engineer, Azure

Particle41

We provide world-class teams for App Development, DevOps & Data Science.

DevOps Engineer69 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

• Work closely with software developers, system administrators, and other stakeholders to understand the requirements and objectives of projects. • Collaborate on the design, implementation, and maintenance of continuous integration and delivery pipelines. • Create and maintain comprehensive documentation for systems, processes, and configurations. • Design, implement, and manage automation processes for software build, deployment, and configuration. • Evaluate, select, and implement tools and technologies to enhance the efficiency of the development and deployment processes. • Manage and maintain Azure cloud infrastructure to ensure scalability, reliability, and security. • Implement infrastructure as code (IaC) using tools such as Terraform, ARM Templates, and others. • Establish and maintain CI/CD pipelines to automate the software delivery process, including build, test, and deployment phases. • Develop and implement monitoring solutions using Azure Monitor, Application Insights, and Log Analytics to ensure the health and performance of systems and applications. • Proactively identify and address issues related to system performance, reliability, and scalability. • Implement and maintain security best practices in infrastructure and application deployment including Azure AD, Key Vault, and Network Security Groups. • Ensure compliance with regulatory requirements and company security policies. • Provide support for development and operations teams, addressing issues related to build failures, deployment problems, and system outages. • Participate in on-call rotation to respond to and resolve critical incidents.

Mexico
Particle41 logo

DevOps Engineer, GCP

Particle41

We provide world-class teams for App Development, DevOps & Data Science.

DevOps Engineer69 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

• Work closely with software developers, system administrators, and other stakeholders to understand the requirements and objectives of projects. • Collaborate on the design, implementation, and maintenance of continuous integration and delivery pipelines. • Create and maintain comprehensive documentation for systems, processes, and configurations. • Design, implement, and manage automation processes for software build, deployment, and configuration. • Evaluate, select, and implement tools and technologies to enhance the efficiency of the development and deployment processes. • Manage and maintain GCP infrastructure to ensure scalability, reliability, and security. • Implement infrastructure as code (IaC) using tools such as Terraform, Deployment Manager, and others. • Establish and maintain CI/CD pipelines to automate the software delivery process, including build, test, and deployment phases. • Develop and implement monitoring solutions using Cloud Monitoring and Cloud Logging to ensure the health and performance of systems and applications. • Proactively identify and address issues related to system performance, reliability, and scalability. • Implement and maintain security best practices in infrastructure and application deployment including IAM, Security Command Center, and VPC Service Controls. • Ensure compliance with regulatory requirements and company security policies. • Provide support for development and operations teams, addressing issues related to build failures, deployment problems, and system outages. • Participate in on-call rotation to respond to and resolve critical incidents.

Mexico
Particle41 logo

DevOps Engineer, AWS

Particle41

We provide world-class teams for App Development, DevOps & Data Science.

DevOps Engineer69 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

• Work closely with software developers, system administrators, and other stakeholders to understand the requirements and objectives of projects. • Collaborate on the design, implementation, and maintenance of continuous integration and delivery pipelines. • Create and maintain comprehensive documentation for systems, processes, and configurations. • Design, implement, and manage automation processes for software build, deployment, and configuration. • Evaluate, select, and implement tools and technologies to enhance the efficiency of the development and deployment processes. • Manage and maintain AWS cloud infrastructure to ensure scalability, reliability, and security. • Implement infrastructure as code (IaC) using tools such as Terraform, CloudFormation, and others. • Establish and maintain CI/CD pipelines to automate the software delivery process, including build, test, and deployment phases. • Develop and implement monitoring solutions using CloudWatch and other AWS monitoring tools to ensure the health and performance of systems and applications. • Proactively identify and address issues related to system performance, reliability, and scalability. • Implement and maintain security best practices in infrastructure and application deployment including IAM, security groups, and VPC configurations. • Ensure compliance with regulatory requirements and company security policies. • Provide support for development and operations teams, addressing issues related to build failures, deployment problems, and system outages. • Participate in on-call rotation to respond to and resolve critical incidents.

Argentina
Particle41 logo

DevOps Engineer, GCP

Particle41

We provide world-class teams for App Development, DevOps & Data Science.

DevOps Engineer69 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

• Work closely with software developers, system administrators, and other stakeholders to understand the requirements and objectives of projects. • Collaborate on the design, implementation, and maintenance of continuous integration and delivery pipelines. • Create and maintain comprehensive documentation for systems, processes, and configurations. • Design, implement, and manage automation processes for software build, deployment, and configuration. • Evaluate, select, and implement tools and technologies to enhance the efficiency of the development and deployment processes. • Manage and maintain GCP infrastructure to ensure scalability, reliability, and security. • Implement infrastructure as code (IaC) using tools such as Terraform, Deployment Manager, and others. • Establish and maintain CI/CD pipelines to automate the software delivery process, including build, test, and deployment phases. • Develop and implement monitoring solutions using Cloud Monitoring and Cloud Logging to ensure the health and performance of systems and applications. • Proactively identify and address issues related to system performance, reliability, and scalability. • Implement and maintain security best practices in infrastructure and application deployment including IAM, Security Command Center, and VPC Service Controls. • Ensure compliance with regulatory requirements and company security policies. • Provide support for development and operations teams, addressing issues related to build failures, deployment problems, and system outages. • Participate in on-call rotation to respond to and resolve critical incidents.

Argentina