Job Closed
This listing is no longer active.
Senior DevOps Engineer
Location
Minnesota
Posted
63 days ago
Salary
$140K - $160K / year
Seniority
Senior
Job Description
Senior DevOps Engineer
In Tandem
• Design, build, and maintain scalable, reliable infrastructure to support In Tandem’s technical platform • Partner with Engineering teams as an internal consultant on CI/CD pipelines while developing and maintaining best practices for managing maintainable pipelines. • Architect and operate containerized environments using primarily AWS ECS and RDS codified in Terraform • Manage and evolve cloud infrastructure across AWS services • Troubleshoot and resolve complex system issues related to networking, security, performance, and reliability • Partner closely with application engineers to enable safe, efficient, and repeatable software delivery • Lead infrastructure initiatives that improve platform resilience • Influence DevOps best practices, mentor teammates, and drive continuous improvement across the engineering organization
Job Requirements
- 5+ years of experience in DevOps, infrastructure, or site reliability engineering
- Experience working in AWS and cloud environments
- Strong experience with containerization and orchestration (Docker, Docker Swarm, Kubernetes, ECS, or similar)
- Experience with Terraform or similar infrastructure-as-code platform to both build out new and codify existing infrastructure
- Experience building and maintaining CI/CD pipelines (Bitbucket Pipelines, Bamboo, Jenkins, GitHub Actions or similar tools)
- Strong troubleshooting skills across networking, security, performance, backups, patching, and system reliability
- Experience partnering closely with application engineers in a product-driven organization
- An inclination to make small improvements where possible and large improvements where necessary
- Fluency with AI (MCP, Code Generation, Automated PR Reviewing, etc.)
- What would be great to have:**
- Experience with centralized logging and monitoring tools (Splunk, New Relic, Cloudwatch, or similar)
- Exposure to databases, data modeling, data architecture, and an understanding of application performance considerations
- Network design experience
- Experience with security compliance control systems (ex: Vanta)
- Exposure to serverless technologies (ex: AWS Lambda)
- Skills in cost optimization for AWS infrastructure
- Experience modernizing or migrating legacy infrastructure
Benefits
- Medical: In Tandem pays 100% of the premium for employees AND 99% for all additional family members
- 401k: Up to a 4% match with immediate vesting
- Paid leave for all new parents
- Learning & Development stipend for employees
- Paid Time Off: 11 Holidays + Winter Break (3 Days) + Volunteer Time Off (1 Day) + Floating Holiday (1 Day)
- Personal Time Off:
- 15 days for 0-1 years of employment
- 20 days 1-3 years of employment
- Supportive and flexible working environment – work from anywhere!
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior DevOps Engineer
Stalwart ThemiusEnabling the scalability of every business with the right talent, process, and technology supply.
• Create scripts and leverage software tools to explore new features and provide new customer capabilities • Assist in troubleshooting and resolving issues that affect release scope, schedule, and quality • Create documentation around best practices and common issues for system configuration and operation • Consult with the Infrastructure team to uphold access control, data integrity and file system security for the computer/data center environment • After-hours and on-call work is required as part of the release cycle • Implementing and managing application development and deployment pipelines • Implements quality control and review systems throughout the deployment processes • Collaborate with technology leadership to improve the software engineering processes and practices associated with continuously building, deploying, and updating software and environments • Oversight and maturation of DevOps processes • Provide guidance to help project teams manage and deploy releases in a fast-paced Agile environment • Contribute to defining the strategic direction for DevOps and Release processes to focus on automating delivery model • Creates and monitors DevOps and Release Management metrics
Site Reliability Engineer, Linux
JWay Group, Inc.100+ projects, 10M+ talent reach across multiple industries and professions, 12k+ prequalified talent database
• Maintain and support existing IT infrastructure and automation tools. • Lead and implement projects for continued infrastructure improvement, infrastructure automation, and deployments of new systems. • Combines understanding of automation needs and IT aspects to provide stable, efficient, and self-service R&D Stacks.
Lead Site Reliability Engineer
Coupa SoftwareSpend is the fuel to help your company deliver performance, profitability, and purpose!
• Build, deploy, and troubleshoot microservices in Kubernetes and Amazon EKS, ensuring scalability and reliability. • Design secure, highly available web applications with a focus on capacity planning and performance optimization. • Deploy and manage the lifecycle of LLMs and embedding models, defining KPIs to measure and improve AI application performance. • Evaluate and integrate emerging technologies such as RAG systems, MCP servers, AI Agents, and agentic workflows into our platform. • Manage AWS core and GenAI services (S3, IAM, EKS, Bedrock, etc.) using infrastructure-as-code tools like Terraform and Chef, while maintaining observability through tools like New Relic or PagerDuty. • Collaborate across product, platform, and engineering teams on architecture design, security patching, incident response, and release management to ensure the reliability of our ML and GenAI infrastructure
DevOps Engineer – L2/L3
N3XT SPORTSN3XT Sports is a sports consulting firm that specializes in digital transformation, innovation and investment strategy.
• Own and maintain corporate IT infrastructure using Terraform, ensuring configurations are versioned, auditable, and secure. • Design, build, and deploy automations using serverless automations in the cloud to streamline operational workflows and reduce manual effort. • Own alerting and notification pipelines using platforms such as incident.io and other incident management tools, ensuring anomalies and critical events surface to the appropriate responders. • Participate in and improve incident response workflows, including maintaining and iterating on runbooks, conducting post-incident reviews, and driving down mean time to resolution. • Package, deploy, and maintain internal tooling using Docker to support IT operations and automation efforts. • Develop targeted scripts and lightweight applications in Bash, Python, and JavaScript/TypeScript to solve operational problems and integrate corporate systems. • Collaborate cross-functionally with Security, Platform/SRE Engineering, and business stakeholders to align IT initiatives with organizational needs. • Maintain and troubleshoot network infrastructure fundamentals, including DNS, VPN, and firewall configurations.




