None specified.
DevOps Engineer II
Location
United States
Posted
5 days ago
Salary
$123.6K - $135K / year
Seniority
Mid Level
No structured requirement data.
Job Description
DevOps Engineer II
Golden 1 Talent Acquisition Team
Role Description The DevOps Engineer 2 is responsible for leading the automation processes for deploying Infrastructure as Code in both Microsoft Azure and On-Premises environments. The engineer will deploy product updates, identify production issues, and implement integrations that meet our customers’ needs. The ideal candidate will have a solid background in DevOps and Site Reliability Engineering, with significant experience in Terraform, Python, and PowerShell. The engineer will lead the infrastructure-as-code process, manage Linux/Kubernetes cluster environments, and support development teams on API integration strategies. The engineer will design, implement, and optimize CI/CD pipelines for faster and more reliable software releases. Additionally, the engineer will monitor systems, create alerts, and ensure application uptime and performance. Responsibilities also include provisioning and setting up metrics, creating alerts and managing alert suppression, and proposing automation solutions to reduce workload. This role is responsible for implementing and operating cloud platform services and standards defined by Cloud Engineering, with a focus on reliability, security, and scalability. Qualifications - Bachelor of science degree (or equivalent) in computer science, engineering, or relevant field. - Over 4 years as a DevOps Engineer in medium to large-scale environments. - Proficient in Windows Server, Linux, and hybrid cloud deployments using Microsoft Azure and VMWare. - Skilled in Git/GitHub workflows, Terraform, Python, PowerShell, and container orchestration (Tanzu, Docker, Kubernetes, OpenShift). - Experienced with CI/CD tools (Jenkins, GitLab CI, Azure DevOps) and observability platforms (Datadog, Prometheus, Grafana, ThousandEyes). - Knowledgeable in log management (ELK Stack) and database technologies (PostgreSQL, MySQL, NoSQL). - Strong background in automating infrastructure provisioning and application deployment using Terraform, Ansible, and Kubernetes. - Proficient in creating and maintaining monitoring dashboards, SLIs, SLOs, and error budgets to ensure application uptime and performance. - Experienced in ensuring infrastructure security, driving automation initiatives, and collaborating across teams to improve reliability and scalability. - Experienced in building observability pipelines and performing advanced queries in log management tools like Splunk for troubleshooting. - Experience implementing and operating Azure-based shared services defined by platform or cloud engineering teams. Requirements - Independently lead infrastructure-as-code development using Terraform and scripting languages such as Python and PowerShell to support scalable and reliable deployments. - Manage Linux/Kubernetes cluster environments. - Deploy solutions in accordance with Change Management Processes. - Support development teams on API integration strategy and standards development. - Ensure systems are secure against cybersecurity threats. - Identify technical problems and develop software updates and fixes. - Strong Splunk skills for administration, query optimization, alerting, and dashboard development. - Build tools to reduce errors and improve customer experience. - Propose ideas and solutions within the Infrastructure Department to reduce workload through automation. - Design, implement, and optimize CI/CD pipelines for faster and more reliable software releases. - Independently conduct root cause analysis and implement corrective actions. - Design and write tests to investigate infrastructure failure and scaling. - Create and maintain response playbooks across incident management and monitoring tools. - Develop automation to ensure repeatability, eliminate toil, and reduce time to action and repair services. - Analyze key operational metrics to identify opportunities to improve availability. - Implement effective monitoring, alerting, and reduction of alert fatigue. - Manage container orchestration environments and optimize deployment workflows to enhance scalability, reliability, and operational efficiency. - Design, build, and manage containerized environments using Docker. - Create and maintain SLIs, SLOs, and error budgets. - Design and optimize monitoring dashboards and alerting systems to proactively detect and address application performance and uptime issues. - Implement code branching strategies using GitHub functions. - Advanced Terraform syntax and GitLab CI/CD configuration, pipelines, jobs. - Provisioning and setting up metrics in Prometheus, Thanos, and Grafana, creating and managing alerts. - Implement cloud engineering standards, reusable modules, and platform patterns in Microsoft Azure. - Operate shared cloud platform services according to Cloud Engineering defined architectures. - Ensure infrastructure changes comply with reliability, security, and cost controls established by Cloud Engineering. - Maintain operational documentation and runbooks for cloud platform services. Benefits - Competitive salary range of $123,600.00 - $135,000.00 Annually. - Flexible working conditions including remote work options. - Comprehensive health benefits. - Opportunities for professional development and certifications.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Role Description Do you want to love what you do at work? Do you want to make a difference, an impact, transform people's lives? Do you want to work with a team that believes in disrupting the normal, boring, and average? If yes, then this is the job you're looking for. UXBERT Labs is one of the leading digital and user experience design agencies in the GCC, working with top regional and international brands such as STC, Amazon, Gucci, and more. As part of the Supertech Group, we are continuously expanding our innovation footprint. - CI/CD & Automation: Design, implement, and manage robust CI/CD pipelines for Node.js and React applications using GitHub Actions and Bitbucket Pipelines across Dev, Staging, and Production. - Infrastructure as Code (IaC): Maintain and scale cloud infrastructure using Terraform (or CloudFormation). - Containerization & Orchestration: Containerize applications using Docker and orchestrate using Kubernetes (optional) / Excellent experience working on Kubernetes clusters (GKE/AKS). - Monitoring & Observability: Monitor system health, security, and performance. Solid experience with New Relic, Datadog, and APM, setting up APM cadence, APM alerts, and rules and policies for support and disaster management. - Cloud & Networking: Excellent experience with cloud-based VPC and networking within load balancing and orchestration. - Future-Proofing: Ability to learn fast, understand the world of LLM, complex CI/CD setups in the future, such as AI machine learning and AI coding. - Development & Infrastructure: Solid at quickly setting up servers in Node.js, whitelisting IPs and providing Docker and secure environments for developers, production and testers. - Coding Background: 7+ years of DevOps experience across cloud platforms. An engineering background in PHP or a strong programming background would be beneficial; solid JavaScript or Python is a must. - Cloud & Caching Expertise: Proficiency with GCP and AWS (must have experience in both). Must be solid in CDNs, CloudFlare, configuration of complex caching in Redis and more. GCP certified would be very, very advantageous as Saudi prefer to work with GCP. - Database & Scaling: Understanding of database setup, high scaling, high traffic sites in CloudFlare and without - complex setups of database optimisation, DFD, migration. - Modern Workflows: Experience with Vercel for front-end deployment workflows. - Environment Familiarity: Familiarity with Node.js environments, Linux, and monitoring tools. - AI Integration: All engineers must be using AI vibe coding as part of their day to day work (Claude, Grok, ChatGPT). - Fully remote role (must align with KSA working hours). - Opportunity to work on high-impact Saudi government and enterprise projects. - Collaborative culture with strong design and engineering teams. Qualifications - 7+ years of DevOps experience across cloud platforms. - Strong programming background in JavaScript or Python. - GCP and AWS proficiency. Requirements - Experience with CI/CD pipelines, Terraform, Docker, and Kubernetes. - Solid experience with monitoring tools like New Relic and Datadog. - Understanding of database setup and optimization. Benefits - Competitive salary. - Professional development budget for certifications and training.
Role Description The company is systematically building out its security and compliance function. We have already launched the SOC 2 and ISO 27001 processes on Drata, with the goal of completing them by the end of Q2. In the mid-term roadmap, we also plan to cover GDPR, HIPAA, and HITRUST. We are looking for our first dedicated DevSecOps Engineer who will take ownership of this area. Above all, we are seeking a strong, hands-on engineer, someone who can not only describe security and compliance processes but also independently implement them across infrastructure, CI/CD, Kubernetes, cloud environments, and production services. This role is not about “paper compliance”. However, working with policies, procedures, and evidence will also be an important part of the responsibility. We need someone who can connect compliance requirements to real technical controls and ensure they are properly implemented, validated, documented, and audit-ready. Qualifications - 5+ years of hands-on experience in security / DevSecOps for production infrastructure. - Direct experience with SOC 2 implementation: controls, evidence collection, audit preparation, and communication with auditors. - Ability to write security policies and procedures yourself — and implement them in a way that actually works in day-to-day operations. - Strong hands-on experience with Docker, Kubernetes, and cloud environments — GCP and/or AWS. - Strong understanding of IAM/SSO: centralized access management, provisioning/deprovisioning, and periodic access reviews. - Experience building onboarding and offboarding processes from a security and compliance perspective. - Ability to automate routine work using Python and/or Bash. - Ownership mindset: you take responsibility for a task, drive it to completion, and think one step ahead. - Friendly, non-toxic, and pleasant to work with. - Strong communication with developers: you can clearly and constructively explain your position, defend it when needed, and find common ground. - Willingness and ability to mentor, teach, and share knowledge with others. - Analytical mindset: you dig down to the root cause instead of just treating symptoms. - Proactivity: you would rather prevent an outage than heroically fight it later. - Strong attention to detail and reliability. Requirements - Experience with GDPR, HIPAA, and HITRUST — these are the next steps on our roadmap. - Experience in regulated industries such as banking, fintech, or healthcare, including customer/vendor security audits. - Experience with both on-prem and SaaS environments. - Kubernetes security tooling: Falco, OPA/Gatekeeper, Pod Security Standards, Trivy. - Experience using AI agents to automate routine tasks. - Terraform/Ansible and GitOps experience. - Experience with bug bounty or responsible disclosure programs. Responsibilities - Own Drata, controls, evidence collection, and communication with auditors. Support SOC 2 and ISO 27001, with GDPR, HIPAA, and HITRUST planned next. - Develop and maintain security policies and procedures, including Vulnerability Management, Access Control, Incident Response, Data Protection, and others. - Build onboarding, offboarding, and access review as a real process, automating it through SSO, centralized IAM, and automated provisioning/deprovisioning. - Drive SDLC security: Dependabot, CodeQL/SAST, SCA, dependency update policies, secrets management, and related controls. - Own vulnerability management: scanning, CVE triage, patching, annual penetration testing, vendor selection, coordination, and follow-up on findings. - Participate in response to critical vulnerabilities and security incidents. - Improve security observability: audit logging, change tracking, and reporting across all production platforms. - Spend around 60% of your time on the general infrastructure track: Kubernetes, deployments, monitoring, automation, and on-call. Benefits - The team has built award-winning AI products for tech corporations — devices, voice assistants, products that are actually in the world. - Cutting-edge tech stack: Speech Technologies, NLP, Generative AI (LLMs, diffusion models), voice-first agentic architecture with privacy-first and on-premises deployment. - High engineering bar and real ownership — the team cares about what actually works in production. - Fast career progression — a senior-heavy team and a high volume of real problems means you grow faster than you would anywhere else. - Startup pace with enterprise stability — real clients, real revenue, no bureaucracy. - Fully remote across Europe. - 21 vacation days + public holidays + 5 sick days. - Private English lessons via Preply.
• Own and continuously optimize our CI/CD pipelines and delivery workflows, ensuring fast feedback loops and secure deployments. • Build and evolve the local developer experience, making it seamless for engineers to spin up, test, and debug services locally across a variety of languages and frameworks. • Own and maintain part of our cloud infrastructure and container orchestration platforms using Terraform and Kubernetes. This will also require participating in identifying and solving infrastructure-related production issues and performance troubleshooting, upgrading our platforms for long-term resilience. • Be a technical referent and drive engineering standards forward by acting as a trusted partner for product engineering squads on delivery and infrastructure best practices, sharing expertise and providing guidance. • Encourage a culture of technical curiosity by frequently evaluating, benchmarking, and prototyping emerging technologies to bring the best tool sets to the team and promoting a culture of continuous learning across the organization.
Intermediate Cloud Engineer
Apex SystemsApex Systems, an IT staffing and workforce solutions firm, provides recruiting and staffing services to large and small companies alike. Founded in 1995 by thre
Support the buildout of scalable AWS cloud infrastructure, assist with networking setup and IAM configurations, and document technical work while collaborating with architects and security teams to ensure compliance and security standards.


