Job Closed
This listing is no longer active.
Deliver Data-Driven Results with our suite of solutions for distributors, suppliers, and operators.
Site Reliability Engineer – Team Lead
Location
Idaho
Posted
105 days ago
Salary
$120K - $150K / year
Seniority
Senior
Job Description
Site Reliability Engineer – Team Lead
Meal Ticket
• Lead a team of DevOps engineers and applicative developers, providing technical guidance and mentorship • Oversee the team’s tasks including planning, prioritization, and execution • Design, develop, and maintain infrastructure as code using Terraform • Oversee database administration tasks, ensuring performance, reliability, and availability • Implement best practices for security, compliance, and operational excellence • Identifying and troubleshooting issues within the Marketman infrastructure, while suggesting and implementing improvements • Collaborating effectively with cross-functional teams (team leads, tech leads, developers, product owners, etc.) to ensure high-quality software releases • Improve CI/CD pipelines (github actions, azure devops), automation, and monitoring to enhance operational efficiency • Work with different technologies and tools, including: Azure Cloud, K8S, Git actions, MSSQL, MySQL, Mongo, .net supporting infrastructure, DataDog, and other internal tools • Thinking outside the box, continuously pushing for improvements and optimizations.
Job Requirements
- 6+ years of proven experience architecting, building and maintaining scalable and highly available production systems in the cloud
- Experience with containerizing services and managing containerized deployments
- Expertise in Terraform and Infrastructure as Code (IaC) best practices
- Hands-on experience with Azure Cloud, including networking, security, and cost management
- Solid understanding of CI/CD pipelines, automation, and deployment strategies
- Experience leading teams, managing projects, and driving technical initiatives
- Strong knowledge of database administration, including performance tuning and backup strategies
- Experience with FinOps practices to optimize cloud spend and efficiency
- Ability to collaborate with cross-functional teams and communicate complex technical concepts effectively
- A problem-solving mindset, ownership mentality, and a passion for innovation
- Solid understanding of web development fundamentals
- Good communication skills
- Self-motivated, detail-oriented, and organized
- A team player with strong social and interpersonal skills
- Excellent verbal and written communication skills in English.
Benefits
- Continuous learning and personal development
- Collaborative and inclusive work environment
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Principal Site Reliability Engineer
JabilFounded in 1966, Jabil is a company in the electronic manufacturing sector. From its headquarters in St. Petersburg, Florida, the company employs a team of over 200,000 people in 2
• Lead the design, implementation, and management of test infrastructure automation using Infrastructure as Code (IaC) tools such as Ansible. • Planning test engineering Infrastructure and Network strategies for the Intelligent Infrastructure - GCTD organization. • Ensuring global accurate and efficient governance policy deployment and adherence to test networks, security, and infrastructure areas. • Debug complex problems related to test network and business-critical applications. • Drive post-incident analysis and root cause investigation implementing corrective action to prevent future occurrences.
• Automate the deployment of environments using IAC tooling such as Terraform. • Implement and maintain CI/CD pipelines to deploy services to. • Partner with Network, Security, Development, and QA teams to determine how best to migrate business systems. • Work with development to design and implement improved deployment, provisioning, and integration pipelines; ensure environments are in good working order and identify areas and plans for improvement. • Develop monitoring for business systems to track performance, resource utilization, and consumption using tools such as CloudWatch, CloudTrail or Elasticsearch. • Build reusable AMI’s for scripted deployments of infrastructure in AWS across multiple regions. • Work with existing DevOps to develop needed log retention for AWS instance business applications for retrieval per audit policy and PCI compliance. • Maintain and administer servers, storage, virtualization platforms, and endpoint management tools. • Deploy and manage cloud infrastructure on platforms like AWS, Azure, or Google Cloud. • Monitor and optimize cloud resources for performance, availability, and cost-efficiency. • Build and manage containerized applications using Docker. • Deploy and maintain Kubernetes clusters or other orchestration platforms. • Implement and maintain security best practices across infrastructure and pipelines. • Ensure compliance with internal policies and external regulations. • Set up and manage monitoring, logging, and alerting systems (e.g., Prometheus, Grafana, ELK, Datadog). • Respond to incidents, perform root cause analysis, and implement preventive measures. • Work closely with development, QA, and IT teams to support application deployment and infrastructure needs. • Provide technical guidance and support for DevOps tools and practices.
Principal Engineer – Platform Engineering, Site Reliability
FICO - Fair Isaac CorporationFICO, also known as Fair Isaac Corporation, is one of the world’s leading credit history and financial analysis organizations. It was founded in 1956 on the i
• Design, deploy, and manage scalable cloud solutions on AWS public cloud platform via Infrastructure as Code. • Manage infrastructure as code (IaC) leveraging Terraform, CloudFormation and GoLang. • Design and implement Kubernetes-based platform solutions with focus on scalability, reliability, and security. • Support and maintain large Kubernetes clusters in production environments. • Implement security best practices and ensure compliance with industry standards and regulations. • Work closely with development, operations, and security teams to integrate infrastructure as code practices. • Develop automation to build and deploy Docker Containers through CI/CD pipelines for engineering teams deploy and test services. • Write policy & standard validation tests and integrations with Security Scanning software to ensure compliance. • Implement and support Observability solutions to ensure platform performance, reliability, and scalability. • Create Dashboards and integrate into Backstage IDP for visibility into system health. • Provide guidance and mentorship to team members on best practices in GitOps, CI/CD, and infrastructure management. • Work closely with development, operations, and security teams to integrate infrastructure as code practices across the organization.
DevOps Engineer
MI-C3 International LtdWe deliver trusted software solutions to make organizations effective.
• ensuring the reliability, performance, and continuous improvement of our mission-critical platforms. • work closely with engineers, product teams, and leadership to design, deploy, and optimise the infrastructure that drives our products. • play a key part in shaping our operational landscape • championing automation • ensuring seamless delivery across the business.



