DevOps Engineer

Location

United States

Posted

39 days ago

Salary

0

Seniority

Mid Level

No structured requirement data.

Job Description

DevOps Engineer

Archesys Inc

Location: This is a remote position. Residency Requirement: Candidates MUST have lived in the U.S. for at least 3 of the past 5 years and be authorized to work in the U.S. (Citizen, Permanent Resident, or EAD). Sponsorship: This position does not offer any type of sponsorship. Candidates must already be authorized to work in the U.S. (e.g., Citizen, Permanent Resident, or EAD). Clearance: Public Trust Clearance (or ability to obtain) Archesys is a technology firm specializing in innovative cloud solutions and services for clients across various industries. We pride ourselves on cutting-edge technologies, exceptional customer service, and a collaborative work environment. This is a fully remote, full-time position. We’re looking for a mid to senior-level DevOps Engineer to strengthen our end-to-end delivery and operational capabilities across CI/CD automation, build and release engineering, container security, AWS cloud enablement, and reliability tooling. This role is also highly hands-on and innovation-driven: you’ll regularly prototype new DevOps and cloud automation approaches, validate them quickly, and turn successful prototypes into reusable standards (shared pipeline modules, secure base artifacts, and repeatable infrastructure/patterns). You’ll integrate quality and security controls using tools like JFrog Artifactory/Xray, SonarQube, GitHub, Grafana, and optionally Datadog/New Relic, while building Python-based automation that reduces toil and improves consistency across environments—including AWS services such as RDS, Glue, and QuickSight. Key Responsibilities: DevOps Integration & CI/CD Pipelines: - Design, implement, and maintain robust CI/CD pipelines using Jenkins, including Shared libraries/reusable modules to standardize delivery patterns. - Build automated workflows for build, test, security scanning, artifact publishing, and deployment, including environment promotion strategies. - Integrate CI/CD with GitHub (branching strategy, PR checks, release tagging) and enforce best practices through templates and guardrails. - Implement and maintain semantic versioning practices for pipelines, artifacts, and container images to ensure traceability and prevent overwrite. Containerization & Secure Gold Image Patterns: - Build, maintain, and continuously improve Docker base “gold images” (minimal, hardened, patched) used across engineering teams. - Apply container best practices: image hygiene, least privilege, dependency pinning, SBOM awareness, and secure defaults. - Establish an image lifecycle model (build → scan → sign/promote → deprecate) and automate patching and rebuild workflows. - Artifact Management & Supply Chain Security (JFrog) - Manage artifact workflows using JFrog Artifactory (repository structure, promotion, retention, access control). - Partner with teams to remediate findings efficiently and improve overall supply chain posture over time. - Build and maintain automation using Docker Templates for deploying a new version of gold images for existing images. Cloud Infrastructure Development & Management: - Build automation and operational patterns across AWS services, including RDS, Glue, and QuickSight. - Develop and maintain Python tooling to automate cloud operations, validations, deployments, and reporting workflows. - Improve platform reliability and repeatability through automated provisioning patterns and standardized configurations. - Build and deploy AWS resources using Terraform, and deployment using CI/CD pipeline . DevOps Tooling, Quality Gates, and Observability: - Integrate and operationalize SonarQube quality gates (coverage thresholds, code smells, vulnerability checks) within CI/CD. - Support and extend observability integrations with Grafana (dashboards/alerts, operational visibility), and optionally Datadog/New Relic. - Collaborate with engineering and SRE stakeholders to reduce toil, improve incident readiness, and standardize runbooks. Cloud Infrastructure Development & Management: - Build automation and operational patterns across AWS services, including RDS, Glue, and QuickSight. - Develop and maintain Python tooling to automate cloud operations, validations, deployments, and reporting workflows. - Improve platform reliability and repeatability through automated provisioning patterns and standardized configurations. - Build and deploy AWS resources using Terraform, and deployment using CI/CD pipeline . DevOps Tooling, Quality Gates, and Observability: - Integrate and operationalize SonarQube quality gates (coverage thresholds, code smells, vulnerability checks) within CI/CD. - Support and extend observability integrations with Grafana (dashboards/alerts, operational visibility), and optionally Datadog/New Relic. - Collaborate with engineering and SRE stakeholders to reduce toil, improve incident readiness, and standardize runbooks. Security & Compliance: - Implement and manage security controls, ensuring system integrity and compliance with industry standards and organizational policies. - Work closely with the security team to monitor for vulnerabilities and implement necessary patches and updates. - Support compliance initiatives by adhering to established IT policies and guidelines. Collaboration & Documentation: - Collaborate with cross-functional teams to develop cohesive and innovative solutions. - Create and maintain comprehensive documentation outlining system architecture, procedures, and best practices. - Document procedures, configurations, and specifications, ensuring that knowledge is effectively shared across the team. - Contribute to the maintenance of a knowledge base to facilitate continuous learning and team development. - Collaborate closely with cross-functional teams to understand their needs and develop solutions accordingly. - Participate in knowledge-sharing sessions to enhance team skills and expertise. - Assist in the creation and maintenance of technical documentation, including system designs, configurations, and procedures. Professional Development: - Continuously learn and keep up to date with emerging technologies and best practices in the cloud and DevOps space. - Take part in internal and external training to develop the necessary skills and knowledge for career growth. Qualifications: Educational Background & Experience: - Bachelor’s degree in computer science, Software Engineering, or related field, or equivalent practical experience. - 4–8+ years’ experience in DevOps / Platform Engineering / CI/CD / Cloud Engineering (mid to senior level). - Strong hands-on expertise in Jenkins, including Jenkins Shared Libraries and reusable pipeline modules. - Strong knowledge of Docker and container fundamentals (container OS concepts, layering, caching, base image strategies). - Hands-on experience with JFrog Artifactory and JFrog Xray (policies, scanning, governance, promotion workflows). - Infrastructure as Code experience (Terraform / CloudFormation / Ansible). - Strong understanding of semantic versioning and release management discipline. - Solid experience with AWS and familiarity with RDS, Glue, and QuickSight. - Proficiency in Python for automation, tooling, and integrations. - Experience with DevOps tools such as GitHub, SonarQube, and Grafana. - Strong communication skills; comfortable collaborating in a fully remote environment. Nice to Have: - AWS Certifications (e.g., Solutions Architect, DevOps Engineer). - Experience with other observability tools (e.g., Datadog, New Relic, OpenTelemetry). - Contributions to open-source projects related to Grafana or observability. - Kubernetes/ECS/EKS experience (helpful but not required). - Experience in compliance-driven environments (e.g., secure baselines, audit evidence, change control). Personal Attributes: - Excellent problem-solving abilities and analytical thinking. - Strong communication skills with the ability to articulate complex technical concepts clearly. - A collaborative mindset, able to work effectively in team environments and contribute to knowledge sharing and best practices. Additional Requirements: - A willingness to continually learn and adapt to the evolving technology landscape. - Ability to work in a fast-paced environment and manage multiple projects simultaneously. All work must be conducted within the U.S., excluding U.S. territories. Some federal contracts require U.S. citizenship to be eligible for employment. You must be legally authorized to work in the U.S. now and in the future without sponsorship. As the US Government is our clientele, you may be required to obtain a public trust or security clearance. Some of our available roles are on federal contracts that require a degree or additional years of experience as a substitute. What We Offer: - Competitive salary and benefits package, including health, dental, and vision insurance, retirement plan, and generous paid time off. - Opportunity to work with a talented team of professionals on exciting and innovative projects. - Flexible work arrangements, including remote work options. - Continuous learning and development opportunities, including access to training resources and professional development programs. - A collaborative, inclusive work environment that values diversity and encourages growth. Join us at Archesys and be part of a team dedicated to delivering cutting-edge solutions for clients in the public sector. Your expertise and passion for technology will help us continue to innovate and grow. We look forward to welcoming you to our team and supporting your success as an DevOps Engineer. Archesys participate in E-Verify. Upon hire, we will provide the federal government with your Form I-9 information to confirm that you are authorized to work in the U.S. Archesys is an equal opportunity employer committed to creating a diverse and inclusive workplace. We welcome applications from all qualified candidates, regardless of race, color, religion, sex.

Related Categories

Related Job Pages

More DevOps Engineer Jobs

BJAK logo

DevOps Engineer

BJAK

Bjak is a technology company focused on making financial services easy, fun and more rewarding for everyone

DevOps Engineer39 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

About BJAK BJAK is building the next-generation insurance and financial services platform - designed to be intuitive, intelligent and personalised. Presently we are the largest insurance platform in Southeast Asia, and expanding globally with a strong focus on technology and product superiority. We are looking for talented, ownership-minded individuals. In return, expect growth, trust, autonomy, rewards, and impact. About the Role As a DevOps Engineer, you will be responsible for building and operating the infrastructure that enables BJAK’s engineering teams to ship reliably, securely, and at scale. You will work closely with developers to design deployment workflows, maintain production systems, and continuously improve reliability, performance, and security across our cloud environments. This is a hands-on role for someone who enjoys owning systems end-to-end, solving real production issues, and building platforms that support fast-moving product teams. What you'll be doing: - Design, build, and maintain CI/CD pipelines to support automated build, test, and deployment workflows - Manage, operate, and scale Kubernetes clusters (EKS, GKE, AKS, or self-managed) - Provision and maintain cloud infrastructure using Infrastructure as Code (Terraform, Pulumi, CloudFormation) - Improve system reliability, performance, availability, and scalability - Implement and maintain monitoring, logging, and alerting systems (e.g. Prometheus, Grafana, ELK, Datadog) - Support zero-downtime deployments, rollbacks, and release strategies - Troubleshoot production issues including latency, errors (e.g. 502s), pod crashes, OOMs, and networking problems - Implement and enforce security best practices, including secrets management, IAM, and network policies - Collaborate closely with software engineers on deployment strategies, system design, and operational readiness What you'll need: - Bachelor’s degree in Computer Science or equivalent practical experience - Strong experience with Linux environments and shell scripting - Hands-on experience with Docker and Kubernetes - Experience working with at least one major cloud platform (AWS, GCP, or Azure) - Solid understanding of CI/CD systems (GitHub Actions, GitLab CI, Jenkins, etc.) - Experience implementing Infrastructure as Code - Good understanding of networking fundamentals (DNS, TCP/IP, load balancing) - Comfortable debugging and resolving production issues - Strong understanding of performance, scalability, and reliability principles - Experience with backend or web application environments - Proficiency in at least one scripting language such as Bash or Python - Ability to work independently, take ownership, and drive initiatives end-to-end Nice to have: - Experience with Groovy - Familiarity with frontend frameworks such as React or AngularJS - Familiarity with JavaScript / TypeScript - Experience working with Node.js - Familiarity with MongoDB Why join BJAK: - Work on production systems that operate at real scale across multiple markets - High ownership role with meaningful impact on reliability, security, and developer velocity - Collaborate closely with experienced engineers in a fast-moving environment - Flat structure where execution and results matter more than titles - Opportunity to build, improve, and own critical infrastructure end-to-end - Competitive compensation and long-term growth opportunities

Indonesia
Tech9 logo

Senior DevOps Engineer

Tech9

World-class software solutions built by embedded experts, delivered seamlessly with clarity, trust, and consistency.

DevOps Engineer39 days ago
Full TimeRemoteTeam 51-200Since 2015H1B No Sponsor

About the role We are seeking a Senior DevOps Engineer to own and evolve the infrastructure, deployment pipelines, and operational reliability of a high-traffic SaaS platform running on AWS. You will manage a Kubernetes-based microservices architecture serving multiple web applications, APIs, and asynchronous workloads across production, staging, and development environments. Technology Environment - Cloud: AWS (EKS, Aurora PostgreSQL, S3, ALB, WAF, IAM, VPC)| - Orchestration | Kubernetes on EKS - IaC: Terraform (modular, multi-environment) - Containers: Docker (20+ custom images) - CI/CD: GitHub Actions - Monitoring: Prometheus, Grafana, Alertmanager - Logging: Elasticsearch, Logstash, Kibana, Filebeat - Data Stores: MongoDB (replica set), Aurora PostgreSQL, Redis, RabbitMQ - Application: PHP / Symfony, Node.js, React Required Technical Skills - Cloud Infrastructure - - Deep experience with AWS, including compute, networking, storage, IAM, and managed database services. You should be comfortable designing and maintaining VPCs, load balancers, and multi-environment cloud architectures. - Container Orchestration - - Strong hands-on experience with Kubernetes in production: deploying, scaling, and troubleshooting workloads including Deployments, StatefulSets, DaemonSets, CronJobs, Ingress, RBAC, and storage. Experience with autoscaling strategies (HPA, event-driven, or scheduled) is expected. - Infrastructure as Code - - Proficiency with Terraform or similar IaC tools for provisioning and managing cloud infrastructure. Experience with modular design, remote state management, and multi-environment workflows. - Containerization - - Solid experience with Docker, including writing and optimizing Dockerfiles, managing image registries, and supporting containerized local development environments. - CI/CD Pipelines - - Experience building and maintaining automated build, test, and deployment pipelines using GitHub Actions or similar platforms. Understanding of container image promotion workflows and environment-based deployment strategies. - Monitoring & Observability - - Experience with metrics-based monitoring and alerting using tools such as Prometheus, Grafana, and Alertmanager or equivalent. Comfortable building dashboards, writing alert rules, and integrating with notification channels. - Logging & Log Aggregation - - Familiarity with centralized logging pipelines using tools such as the ELK stack (Elasticsearch, Logstash, Kibana) or similar. Understanding of log collection, transformation, indexing, and retention strategies. - Database & Data Store Operations - - Working knowledge of operating and supporting MongoDB, PostgreSQL, Redis, and RabbitMQ or similar technologies in production. This includes backup/restore procedures, replica management, and basic performance tuning. - Networking & Security - - Understanding of cloud networking fundamentals (VPCs, subnets, security groups, DNS), TLS/SSL certificate management, WAF configuration, and securing internal tools. Familiarity with authentication proxies and access control patterns. - Scripting & Automation - - Proficiency in Bash and/or Python for operational automation. Ability to read and understand application configurations (PHP, Node.js) enough to support deployment and troubleshooting workflows. Preferred Technical Skills - Experience with event-driven autoscaling tools such as KEDA - Familiarity with GitOps workflows and an interest in evolving deployment practices - Exposure to workflow automation platforms (n8n or similar) - Experience with cost optimization strategies (spot instances, right-sizing, lifecycle policies) - Familiarity with error tracking platforms such as Sentry - Experience upgrading or migrating major infrastructure tooling versions in production Interview Process - 30 Minute Screening call with a member of our recruiting team - 1 hour Technical Interview with one of our lead DevOps Engineers - 1 hour Technical and culture fit Inter view with the Hiring Manager - 30min to 1 hour Meeting with the client. (Optional) Why Join Tech9 At Tech9, we believe great talent deserves great opportunities — no matter where you are in Latin America. We build real teams, not just rosters, and we’re committed to making Tech9 a place where you can do your best work, grow your career, and thrive on your own terms. - Work with World-Class U.S. Clients - - Gain hands-on experience with top-tier American companies and high-impact projects that challenge you and elevate your career. - Competitive, USD-Based Compensation - - Your skills have global value. We pay competitively in U.S. dollars, giving you financial stability and recognition for what you bring to the table. - Grow Your Skills on Us - - All Tech9ers get access to Udemy’s full course library. Whether you want to deepen your expertise or explore something new, we invest in your growth. - A Community, Not Just a Contract - - When you join Tech9, you become a Tech9er. That means being part of a collaborative, inclusive community of talented professionals across LatAm who support each other and grow together. - Modern Tech & Agile Teams - - Work with current technologies, frameworks, and methodologies alongside teams that move fast, communicate well, and value your input.

Mexico
Tech9 logo

Senior DevOps Engineer

Tech9

World-class software solutions built by embedded experts, delivered seamlessly with clarity, trust, and consistency.

DevOps Engineer39 days ago
Full TimeRemoteTeam 51-200Since 2015H1B No Sponsor

• Own and evolve the infrastructure, deployment pipelines, and operational reliability of a high-traffic SaaS platform • Manage a Kubernetes-based microservices architecture serving multiple web applications, APIs, and asynchronous workloads • Design and maintain VPCs, load balancers, and multi-environment cloud architectures • Deploy, scale, and troubleshoot workloads including Deployments, StatefulSets, DaemonSets • Write and optimize Dockerfiles and manage image registries • Build dashboards, write alert rules, and integrate with notification channels • Understand log collection, transformation, indexing, and retention strategies • Support MongoDB, PostgreSQL, Redis, and RabbitMQ in production • Secure internal tools and manage TLS/SSL certificates

Costa Rica
Full TimeRemoteTeam 10,001+Since 1986H1B No Sponsor

As a leading financial services and healthcare technology company based on revenue, SS&C is headquartered in Windsor, Connecticut, and has 27,000+ employees in 35 countries. Some 20,000 financial services and healthcare organizations, from the world's largest companies to small and mid-market firms, rely on SS&C for expertise, scale, and technology. Job Description Get To Know Us: SS&C is leading the way. We continue to look for today's and tomorrow’s brightest talent, those who embody a spirit to improve not only their lives, but those around them. From college students to seasoned and experienced professionals, we encourage you to apply. SS&C prides itself on hiring diverse, honest, dynamic individuals who value collaboration, accountability, and innovation, to name a few. SRE Manager Locations: UK remote and occasional travel to London The Opportunity We are seeking a highly motivated, hands-on Site Reliability Engineering Manager to lead and optimise our cloud infrastructure and operations. You will report to the Director of Site Reliability Engineering. This role involves managing a small team of cloud engineers and SREs, ensuring system reliability, performance, and security across our multi-cloud environments. This role is pivotal in bridging the gap between development and operations, ensuring our systems are robust, automated, and continuously improving. You will be supporting our Next Gen Cloud Native Intelligent Automation business, which enables us to deliver RPA and cloud RPA capabilities. What You Will Get To Do: Team Leadership & Development - Lead, mentor, and grow a high-performing team of cloud engineers and SREs. - Foster a culture of ownership, continuous learning, and operational excellence. - Drive career development and performance management for team members. Operations, Reliability & Performance - Oversee daily operations of cloud environments (AWS preferred). - Implement and maintain monitoring, alerting, and automation frameworks. - Ensure compliance with security, governance, and cost management policies. - Collaborate with development and architecture teams to optimise infrastructure. - Own the availability, latency, performance, and capacity of critical services. - Help define and monitor SLAs, SLOs, and SLIs to ensure service reliability. - Lead incident response and postmortem processes, driving root cause analysis and long-term fixes. Automation & Tooling - Champion automation to reduce toil and improve system reliability. - Oversee the development and maintenance of internal observability, tools and platforms. - Collaborate with engineering and DevOps teams to embed reliability into the software development lifecycle. Collaboration & Strategy - Partner with product, engineering, DevOps and Customer Support teams to align on priorities and roadmaps. - Contribute to the strategic direction of infrastructure and reliability initiatives. - Advocate for best practices in observability, CI/CD, and infrastructure as code. What You Will Bring: - Proven experience managing a team of SREs, DevOps or platform engineers, providing performance feedback and goals. You have mentored, hired, and built a high-performing team. - Experience in defining and managing SLIs (Service Level Indicators) and SLOs (Service Level Objectives) to ensure system uptime and performance meet business needs. You have implemented improvements to prevent recurring incidents and enhance monitoring/observability. - Hands-on experience as a DevOps engineer/cloud platform engineer on AWS and container orchestration with Kubernetes. - Managing on-call rotations and overseeing incident response and triage on problems. - Experience with FinOps or cloud cost management tools. - Strong understanding of ITIL processes, security compliance, and incident management. - Proficiency in monitoring, alerting, and incident management tools (Prometheus, Grafana, PagerDuty). - Solid understanding of networking, distributed systems, and performance tuning. Preferred Qualifications - Experience in a high-scale, high-availability SaaS environment. - Familiarity with security and compliance in cloud-native environments. Why You Will Love It Here! - Your Future: Professional Development Reimbursement, including access to SS&C University - Work/Life Balance: Competitive holiday scheme - Your Wellbeing: Competitive benefits designed to support the well-being of our staff - Diversity & Inclusion: Committed to Welcoming, Celebrating and Thriving on Diversity - Training: Hands-On, Team-Customised throughout your career We encourage applications from people of all backgrounds to enable us to bring diverse perspectives to our thinking and conversation. It's important to us that we strive to have a workforce that is diverse in the widest sense. Thank you for your interest in SS&C! If applicable, to further explore this opportunity, please apply directly with us through our Careers page on our corporate website @ www.ssctech.com/careers. Unless explicitly requested or approached by SS&C Technologies, Inc. or any of its affiliated companies, the company will not accept unsolicited resumes from headhunters, recruitment agencies, or fee-based recruitment services. Applications will be accepted on an ongoing basis until the position is filled. SS&C Technologies is an Equal Employment Opportunity employer and does not discriminate against any applicant for employment or employee on the basis of race, color, religious creed, gender, age, marital status, sexual orientation, national origin, disability, veteran status or any other classification protected by applicable discrimination laws.

United Kingdom