Job Closed

This listing is no longer active.

Divio logo
Divio

Operate, orchestrate, protect, mirror, backup, and migrate your web apps on Divio's external development platform.

Senior DevOps Engineer / Site Reliability Engineer

DevOps EngineerDevOps EngineerFull TimeRemoteSeniorTeam 11-50Since 2001H1B No SponsorCompany SiteLinkedIn

Location

Switzerland

Posted

96 days ago

Salary

0

Seniority

Senior

Job Description

Senior DevOps Engineer / Site Reliability Engineer

Divio

• Keeping the lights on: patching, tuning, incident response, and keeping the stack healthy • Pushing long-term improvements: migrations, internal tooling, security hardening, monitoring revamps • Shaping the infrastructure: evolving our setup to stay modern, secure, and developer-friendly • helping: our internal support crew for technical questions or the external dev team with their day-to-day

Job Requirements

  • Must have:**
  • Excellent command of written and spoken English
  • Solid expertise in **Docker**, **AWS**, and cloud-native infrastructure
  • Programming experience, ideally in **Python** and **TypeScript** (but Go, Java, etc. also welcome)
  • Experience with configuration management and Infrastructure as Code, ideally **Ansible** and **AWS CDK**
  • Strong foundational knowledge (Linux, networking, the TCP/IP stack, load balancing, etc.)
  • A reliable and proactive mindset, you take responsibility and can work independently
  • Comfort with support tasks and professionalism when communicating with clients
  • Nice to have:**
  • Hands-on Linux system administration and tuning experience
  • Familiarity with Django or other Python web frameworks
  • Experience using AWS CDK, specifically with **TypeScript**
  • Operational knowledge of services like PostgreSQL, Redis, RabbitMQ, Elasticsearch
  • Any experience with the tools and technologies mentioned in our stack

Related Categories

Related Job Pages

More DevOps Engineer Jobs

Innovaccer logo

3954- Site Reliability Engineer II

Innovaccer

Two years in a row: Innovaccer Awarded Best in KLAS Data & Analytics Platforms Category.

DevOps Engineer96 days ago
OtherRemoteTeam 1,001-5,000Since 2014H1B Sponsor

About the Role We at Innovaccer are looking for a Site Reliability Engineer-II to build secured modern healthcare cloud infrastructure and a massive data stack and aim to write everything as code A Day in the Life - Take ownership of SRE pillars: Deployment, Reliability, Scalability, Service Availability (SLA/SLO/SLI), Performance, and Cost. - Lead production rollouts of new releases and emergency patches using CI/CD pipelines while continuously improving deployment processes. - Establish robust production promotion and change management processes with quality gates across Dev/QA teams. - Roll out a complete observability stack across systems to proactively detect and resolve outages or degradations. - Analyze production system metrics, optimize system utilization, and drive cost efficiency. - Manage autoscaling of the platform during peak usage scenarios. - Perform triage and RCA by leveraging observability toolchains across the platform architecture. - Reduce escalations to higher-level teams through proactive reliability improvements. - Participate in the 24x7 OnCall Production Support team. - Lead monthly operational reviews with executives covering KPIs such as uptime, RCA, CAP (Corrective Action Plan), PAP (Preventive Action Plan), and security/audit reports. - Operate and manage production and staging cloud platforms, ensuring uptime and SLA adherence. - Collaborate with Dev, QA, DevOps, and Customer Success teams to drive RCA and product improvements. - Implement security guidelines (e.g., DDoS protection, vulnerability management, patch management, security agents). - Manage least-privilege RBAC for production services and toolchains. - Build and execute Disaster Recovery plans and actively participate in Incident Response. - Work with a cool head under pressure and avoid shortcuts during production issues. - Collaborate effectively across teams with excellent verbal and written communication skills. - Build strong relationships and drive results without direct reporting lines. - Take ownership, be highly organized, self-motivated, and accountable for high-quality delivery.  What You Need - 4–7 years in production engineering, site reliability, or related roles. - Solid hands-on experience with at least one cloud provider (AWS, Azure, GCP) with automation focus (certifications preferred). - Strong expertise in Kubernetes and Linux. - Proficiency in scripting/programming (Python required). - Observability is very critical for the scale of our systems and ability to find insights/behavior, detect problem/failures. Looking for leads to drive this charter spanning across logs, metrics, mesh, tracing etc.  - Knowledge of CI/CD pipelines and toolchains (Jenkins, ArgoCD, GitOps). - Familiarity with persistence stores (Postgres, MongoDB), data warehousing (Snowflake, Databricks), and messaging (Kafka). - Exposure to monitoring/observability tools such as ElasticSearch, Prometheus, Jaeger, NewRelic, etc. - Proven experience in production reliability, scalability, and performance systems. - Experience in 24x7 production environments with process focus. - Familiarity with ticketing and incident management systems. - Security-first mindset with knowledge of vulnerability management and compliance. - Advantageous: hands-on experience with Kafka, Postgres, and Snowflake. - Excellent judgment, analytical thinking, and problem-solving skills. - Ability to quickly identify and drive optimal solutions within constraints. - Lead least privilege based RBAC for various production services and tool chains. - Able to perform with cool head under pressure situations without taking any shortcuts. - Collaboration with solid verbal and oral communication skills are very critical to this role. Strong cross-functional collaboration skills, relationship building skills, and ability to achieve results without direct reporting relationships - Ability to quickly identify and drive to the optimal solution when presented with a series of constraints. - Excellent judgment, analytical thinking, and problem-solving skills. - Self-motivated individual that possesses excellent time management and organizational skills. - Strong sense of personal responsibility and accountability for delivering high quality work.  We offer competitive benefits to set you up for success in and outside of work. Here’s What We Offer - Generous Paid Time Off: Recharge and relax with 22 days of fixed time off per year, in addition to company holidays—because we believe work-life balance fuels performance. - Best-in-Class Parental Leave: Spend quality time with your growing family. We offer one of the industry’s most generous parental leave policies to support you during life’s most important moments. - Recognition & Rewards: We celebrate wins—big and small. Get rewarded with monetary incentives and company-wide recognition for your impact and dedication. Your hard work won’t go unnoticed. - Comprehensive Insurance Coverage: Stay covered with medical, dental, and vision insurance, plus 100% company-paid short- and long-term disability and basic life insurance. Optional perks include discounted legal aid and pet insurance. Innovaccer Inc. is an equal opportunity employer. We celebrate diversity and are committed to fostering an inclusive workplace where all employees feel valued and empowered regardless of any characteristic protected by federal, state or local law including, without limitation, race, color, religion, gender, gender identity or expression, sexual orientation, national origin, genetics, medical condition, disability, age, marital status, or veteran status. Innovaccer Inc. participates in the E-Verify program to confirm employment eligibility of all newly hired employees based out of the U.S. and employed by Innovaccer Inc. For any additional information, please visit the below websites: E-Verify Right to Work (English) Right to Work (Spanish) Disclaimer:  Innovaccer does not charge fees or require payment from individuals or agencies for securing employment with us. We do not guarantee job spots or engage in any financial transactions related to employment. If you encounter any posts or requests asking for payment or personal information, we strongly advise you to report them immediately to our HR department at px@innovaccer.com. Additionally, please exercise caution and verify the authenticity of any requests before disclosing personal and confidential information, including bank account details.

United States
Job Closed
Toast logo

Forward Deployment Engineer

Toast

We empower the restaurant community to delight guests, do what they love, and thrive.

DevOps Engineer96 days ago
OtherRemoteTeam 1,001-5,000Since 2013H1B Sponsor

• Hands-on Execution: Design and Ship production grade agentic solutions tailored specifically for high-velocity GTM motions • Rapid Prototyping: Transition from ambiguous business problems to functional AI Proof-of-Concepts (PoCs) within 1–2 week sprint cycles • Process Discovery: Embed directly with Sales, Service, and Marketing leaders to deconstruct high-friction manual workflows ripe for AI-driven transformation • Strategic Integration: Architect the flow between LLMs and the GTM stack (Salesforce, Clay, Snowflake) with a "Golden Record" mindset to ensure data integrity at scale • ROI Modeling: Translate technical metrics into business outcome driven metrics • Architectural Resilience: Act as the domain expert for complex synchronization challenges, ensuring GTM systems are built for concurrency, scale, and reliability

United States
$200K - $320K / year
Job Closed
Typeform logo

DevSecOps Engineer

Typeform

Typeform is a Barcelona, Spain-based online SaaS (Software-as-a-Service) organization that creates software for building online forms. Typeform allows companies to create beautiful

DevOps Engineer96 days ago

• Embed security into the software development lifecycle, enabling teams to ship features safely at high velocity. • Partner with engineering and AI teams to assess and mitigate security risks for new AI features, infrastructure, and pipelines. • Develop and maintain secure CI/CD pipelines, tooling, and automation to support rapid deployment. • Conduct threat modeling, vulnerability assessments, and code reviews for new services and AI workloads. • Advise on secure architecture patterns for infrastructure, agent systems, and model-serving environments. • Implement monitoring, alerting, and incident response practices for critical security events. • Define best practices, standards, and policies for secure feature delivery across engineering teams. • Act as the internal security advocate, balancing risk management with product velocity.

United Kingdom
GovCIO logo

Senior Developer/DevSecOps Engineer

GovCIO

GovCIO is a service-disabled-veteran-owned small business (SDVOSB) that offers technology services to improve business performance for government organizations.

DevOps Engineer96 days ago

Overview GovCIO is currently hiring for a Senior Developer/DevSecOps Engineerto support our client’s contract needs. This position is located in the Washington, DC and will be a remote position with intermittent visits to customer location.  Responsibilities JBOSS - Install JBoss EAP on supported platforms (Linux, RHEL, Windows). - Configure in standalone or domain mode, depending on architecture needs. - Apply Red Hat-supported RPMs or ZIP installations and ensure compliance with licensing. - Deploy and manage Java EE applications (WAR/EAR) via: - Management CLI - Admin Console - Automation scripts (Ansible, shell) - Enable rolling deployments, hot deployment - Set up HTTPS/SSL with trusted certificates and secure keystores. - Enforce RBAC (Role-Based Access Control) using the management realm. - Configure security domains, JAAS, and Elytron security (modern Red Hat EAP security subsystem). - Manage key EAP subsystems: - Datasources (JDBC) - JMS (ActiveMQ Artemis) - Web (undertow) - EJB, JPA, JAX-RS, JTA, JNDI - Modify configurations via: - Management CLI - xml or domain.xml - JBoss Management API - Monitor JVM and application performance with tools like: - JConsole - JMC (Java Mission Control) - JBoss CLI - Tune JVM options, garbage collection, connection pools, and thread pools. - Analyze logs (server.log, boot.log) and configure log rotation and log levels. - Apply Red Hat-provided patches and updates using RHSM or offline methods. - Maintain backup procedures for: - Configuration files - Deployed apps - Domain/host controllers (in domain mode) - Prepare and test disaster recovery procedures and environment restoration. - Integrate JBoss EAP with: - Red Hat AMQ - Connect to external systems like databases, message brokers, or logging systems (ELK stack). - Maintain up-to-date documentation on: - Configuration changes - System architecture - Patching history - Implement audit logging and track changes for compliance. - Work with DevSecOps teams to ensure EAP adheres to security best practices. - Troubleshoot: - Deployment failures - Classloading conflicts - Transaction rollbacks - Application or subsystem crashes - Interface with Red Hat Support via the Customer Portal and create support cases when needed. - Automate tasks using: - Ansible (especially Red Hat Certified Collections) - JBoss CLI scripting - Shell/Python scripts - Integrate EAP deployments with CI/CD pipelines (Jenkins, GitLab, Tekton). - Support EAP clustering, session replication, and high availability. - Manage load balancing with Apache HTTPD, mod_cluster, or HAProxy. - Manage SSL certificates and domain configurations, ensure SSL certificates are renewed on a timely manner - Stay up-to-date with JBOSS releases and new features. - Execute, test and document upgrade procedures in lower and production environments    Artifactory - Deploy and configure Artifactory instances, ensuring they meet organizational requirements for scalability and high availability. - Tune Artifactory settings, implement caching strategies, and optimize storage solutions to enhance performance and scalability. - Utilize tools like Prometheus, Grafana, and JFrog Mission Control to monitor system health, set up alerts, and ensure continuous operation. - Define and manage user roles and permissions to control access to repositories and artifacts, ensuring security and compliance. - Integrate Artifactory with LDAP, SSO, or other authentication systems to streamline user management. - Integrate JFrog Xray with Artifactory to scan artifacts for security vulnerabilities and license compliance. - Implement fine-grained access control using users, groups, permissions, and permission targets. - Ensure that backups are encrypted and access-controlled to prevent unauthorized access to sensitive data. - Pipeline Integration: Integrate Artifactory with CI/CD tools like Jenkins, GitLab CI, and others to automate artifact storage and retrieval. - Implement processes to promote artifacts through different stages of the development lifecycle, such as development, staging, and production. - Develop scripts to automate routine tasks, such as repository cleanup and artifact promotion. - Set up and manage local, remote, virtual, and federated repositories to organize and control access to artifacts. - Regularly clean up repositories by removing obsolete artifacts and optimizing storage usage. - Configure repository replication and federated repositories to ensure consistent access to artifacts across geographically distributed teams. - Monitor the health and performance of Artifactory instances using integrated monitoring tools. - Generate reports on repository usage, artifact storage, and user activity to inform decision-making. - Set up proactive alerting mechanisms to detect and resolve issues promptly. Apply security patches and updates in a timely manner.   DevSecOps Engineering - Embed security checks into CI/CD pipelines (e.g., GitHub Actions, Jenkins, GitLab CI). - Automate code scanning, dependency scanning, and container image scanning. - Integrate tools like: - SAST (Static Application Security Testing) — e.g., SonarQube, Fortify - DAST (Dynamic Application Security Testing) — e.g., OWASP ZAP, Burp Suite - SCA (Software Composition Analysis) — e.g., Snyk, WhiteSource, Black Duck - Promote secure coding practices via developer training and secure coding guidelines. - Define and enforce security policies for app configuration, secrets, encryption, etc. - Use Infrastructure as Code (IaC) tools like Terraform or Ansible securely. - Scan IaC templates for misconfigurations (e.g., with Checkov, tfsec, Terrascan). - Secure cloud resources (AWS, Azure, GCP) using Cloud Security Posture Management (CSPM) tools. - Set up IAM policies, network segmentation, and encryption at rest/in transit. - Participate in threat modeling sessions with development teams. - Identify potential attack vectors in the architecture (e.g., privilege escalation, insecure APIs). - Prioritize and remediate identified risks based on severity and impact. - Monitor and manage vulnerabilities in: - Code - Containers - Dependencies - Infrastructure - Integrate tools like Trivy, Clair, Aqua, or Anchore into pipelines. - Track vulnerability metrics, triage findings, and enforce SLAs for remediation. - Harden container images using minimal base images and security scanning. - Enforce policies using tools like OPA/Gatekeeper, Kyverno, or PodSecurity Standards. - Configure Kubernetes RBAC, network policies, and secrets management. - Implement runtime protections with tools like Falco, Sysdig, or Kube-bench. - Develop custom scripts/tools for security automation (Python, Bash, Go). - Automate certificate management, secrets rotation, and access provisioning. - Maintain DevSecOps toolchains across dev, test, and prod environments. - Collaborate with development, QA, operations, and security teams. - Align with compliance standards (e.g., SOC 2, ISO 27001, PCI-DSS, HIPAA). - Define security policies, guardrails, and governance workflows. - Integrate security monitoring into observability platforms (e.g., ELK, Grafana, Splunk). - Enable SIEM and SOAR integrations for real-time threat detection and alerting. - Support incident response and forensics when security events occur. Qualifications - Bachelor’s degree with 12 years (or commensurate experience) Master’s degree and 7 years of experience. Required Skills and Experience - Experience with JBOSS, Java EE applications, Red Hat - In-depth knowledge of Artifactory - Proven experience with DevSecOps Engineering Clearance Required: Must be able to obtain and maintain AOUSC Public Trust   Preferred Skills and Experience - Masters degree Company Overview GovCIO is a team of transformers--people who are passionate about transforming government IT. Every day, we make a positive impact by delivering innovative IT services and solutions that improve how government agencies operate and serve our citizens. But we can't do it alone. We need great people to help us do great things - for our customers, our culture, and our ability to attract other great people. We are changing the face of government IT and building a workforce that fuels this mission. Are you ready to be a transformer?   What You Can Expect   Interview & Hiring Process If you are selected to move forward through the process, here’s what you can expect: - During the Interview Process - Virtual video interview conducted via video with the hiring manager and/or team - Camera must be on - A valid photo ID must be presented during each interview - During the Hiring Process - Enhanced Biometrics ID verification screening - Background check, to include: - Criminal history (past 7 years) - Verification of your highest level of education - Verification of your employment history (past 7 years), based on information provided in your application Employee Perks At GovCIO, we consistently hear that meaningful work and a collaborative team environment are two of the top reasons our employees enjoy working here. In addition, our employees have access to a range of perks and benefits to support their personal and professional well-being, beyond the standard company offered health benefits, including: - Employee Assistance Program (EAP) - Corporate Discounts - Learning & Development platform, to include certification preparation content - Training, Education and Certification Assistance* - Referral Bonus Program - Internal Mobility Program - Pet Insurance - Flexible Work Environment *Available to full-time employees   Our employees’ unique talents and contributions are the driving force behind our success in supporting our customers, which ultimately fuels the success of our company. Join us and be a part of a culture that invests in its people and prioritizes continuous enhancement of the employee experience.   We are an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, gender, gender identity or expression, sexual orientation, national origin, disability, or status as a protected veteran. EOE, including disability/vets.   Posted Pay Range The posted pay range, if referenced, reflects the range expected for this position at the commencement of employment, however, base pay offered may vary depending on multiple individualized factors, including market location, job-related knowledge, skills, education, experience, and internal equity. The total compensation package for this position may also include other compensation elements, to be discussed during the hiring process. If hired, employee will be in an “at-will position” and the GovCIO reserves the right to modify base salary (as well as any other discretionary payment or compensation program) at any time, including for reasons related to individual performance, GovCIO or individual department/team performance, and market factors. Posted Salary Range USD $145,000.00 - USD $155,000.00 /Yr.

United States
Job Closed