Job Closed
This listing is no longer active.
Your Source for Success!
DevOps/SRE Engineer
Location
United States
Posted
105 days ago
Salary
0
Seniority
Lead
Job Description
DevOps/SRE Engineer
NextGen IT Services
• Optimize release deployments and maintain secure cloud infrastructure • Handle day-to-day operations and problem-solving • Ingest new solutions and products from the Build/Automation organization • Use monitoring and logging tools to solve issues • Conduct post-mortem analysis and identify potential issues for improvement • Setup, monitor, and maintain DevOps cloud-based SAAS products and solutions • Maintain security and data privacy and ensure compliance • Work with architects on deployment architecture, security, and CI/CD implementations • Setup and maintain Kubernetes clusters on cloud environments • Analyze and solve operational issues, and respond to incidents • Conduct root cause analysis and implement continuous improvements • Evaluate new technology options and vendor products
Job Requirements
- BS/BA degree in Computer Science, Management Information Systems, or related IT discipline preferred
- ALLOWABLE SUBSTITUTION: An additional four (4) years of experience can be substituted for a BS or BA degree
- 8+ years of experience
- Expertise with source code management such as SVN, GitHub or GitLab
- Experience with binary resource management tools such as JFrog Artifactory or Harbor
- Strong background in Linux/Unix administration
- Expertise with building, implementing, and/or supporting monitoring tools
- Experience deploying high volume applications in Google Cloud Platform, AWS or Azure using automation.
- Expert understanding of web services, networking, virtualization, and internet protocols
- Ability to multitask and handle various projects, deadlines and changing priorities
- Excellent communication and prioritization skills
- Expertise with security fundamentals as they pertain to SaaS Multitenant Application systems
- Strong interpersonal, presentation, and customer service skills.
Benefits
- Remote work options
- Teamwork and group-based troubleshooting
- Support and mentorship environment to learn and grow
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
SRE – Platform Engineer
DroneUpDroneUp is a leader in drone flight services that transforms organizations using drone technology and delivery solutions. The company develops SaaS platforms that have mobile app t
• Broad domain architect for the internal developer platform and all cloud engineering • Drive architecture for tooling or in-house software • Mentor other platform engineers to drive strong engineering practices • Enablement of platform engineering technical capabilities in our internal client teams in software engineering • Peer with the senior architects and engineers in software engineering • Architecture and engineering focused on GCP environment • Architect and oversee GKE cluster operations and workload management • Provide feedback to others and participate in peer reviews / pair programming • Drive the broad adoption of Test Driven Development through designing, development, and debugging unit and integration tests for new and existing infrastructure and code • Continuous curiosity of existing implementations and new technologies and sharing with the team • Practice continuous improvement across all job areas and personally / professionally • Clearly communicate with platform engineering teams and other stakeholders and provide technical direction while doing so • Stay current with platform changes and third-party libraries. • Proactively investigate better solutions for current solutions • An understanding of Open Telemetry and true observability and the difference between it and monitoring and logging • Grow the engineering culture towards a high-performing team • Practice the arts of self-service, least privilege and security by default in all solutions • Define and maintain Service Level Objectives (SLOs), Service Level Indicators (SLIs), and error budgets • Lead incident response, including on-call rotations, root cause analysis, and post-mortem reviews • Implement and optimize monitoring, alerting, and observability systems for system reliability • Collaborate on capacity planning and performance optimization to ensure high availability • Other duties as assigned
• Design, implement, and maintain CI/CD pipelines to support automated build, test, and deployment workflows • Partner with engineering teams to streamline release processes and improve deployment reliability • Implement and manage monitoring, logging, and alerting solutions to ensure system health and performance • Define and maintain cost monitoring and alerting strategies to optimize cloud spend and prevent unexpected usage • Automate infrastructure provisioning and configuration using Infrastructure as Code (IaC) • Troubleshoot production issues and lead root cause analysis efforts • Establish DevOps best practices around reliability, security, and operational excellence • Continuously evaluate tools and processes to improve scalability, availability, and efficiency • Mentor junior engineers and contribute to a strong DevOps culture
• Lead and manage SRE operations supporting 24/7/365 availability • Own uptime, SLA compliance, SLIs, SLOs, error budgets, MTTR, and incident trends • Oversee incident management, on-call rotations, and post-incident reviews • Lead FinOps practices across hybrid environments • Drive right-sizing, optimization, and elimination of infrastructure waste • Establish cost visibility, allocation, and reporting • Define and maintain observability standards across hybrid environments, such as AWS, Azure and Vsphere • Utilize platforms such as Coralogix, Open Telemetry, and FireHydrant • Champion GitOps practices and pull request governance • Lead Terraform-based infrastructure automation initiatives • Partner across Product, Engineering, Infrastructure, Finance, and Support teams • Lead, mentor, and develop a high-performing SRE team
Senior DevOps Engineer
spiderSilkspiderSilk delivers tip of the spear threat detection technology for the public and private sectors, globally.
• Design, build, and maintain robust CI/CD pipelines and cloud infrastructure to accelerate software delivery. • Monitor system performance, troubleshoot issues, and proactively respond to incidents to minimize downtime. • Collaborate closely with software engineers to enable rapid, secure, and reliable releases. • Automate deployment, testing, and scaling processes to enhance operational efficiency. • Implement Infrastructure as Code (IaC) and cloud security best practices to ensure compliance and reduce risk. • Optimize system reliability and performance through proactive capacity planning and tuning. • Champion DevOps best practices, including observability, disaster recovery, and cost optimization. • Stay ahead of emerging technologies and evaluate new tools to improve our tech stack.




