Quality People. Quality Software.
Senior DevOps Engineer
Location
Romania
Posted
83 days ago
Salary
0
Seniority
Senior
Job Description
Senior DevOps Engineer
Lateral Group
• Take ownership of cloud infrastructure across Azure and AWS - with responsibility for uptime, performance, security, and scalability. • Build Infrastructure as Code (IaC) using tools like Terraform, Pulumi, CloudFormation, ARM/Bicep, or similar. • Design and maintain CI/CD pipelines enabling safe, fast, and repeatable deployments. • Implement SRE best practices: monitoring, alerting, observability, incident response, root-cause analysis, and ongoing reliability improvements. • Automate operational tasks to reduce manual work, improve reliability, and increase engineering velocity. • Perform environment hardening, secret management, identity and access management, network configuration, and secure cloud patterns. • Proactively identify risks, gaps, and bottlenecks and fix them before they become incidents. • Collaborate with engineering teams to architect cloud solutions that balance reliability, speed, and cost efficiency. • Participate in on-call rotations (sensible, fair, and well-scoped) as needed for reliability-sensitive projects. • Communicate clearly with internal stakeholders and occasionally with clients, explaining technical decisions, tradeoffs, and mitigation plans.
Job Requirements
- Proven experience in DevOps or SRE roles supporting modern cloud-native systems at scale.
- Strong cloud expertise - especially with AWS and Azure; cloud certifications strongly preferred (AWS Solutions Architect, AWS DevOps Engineer, Azure DevOps Engineer Expert, Azure Solutions Architect, etc.).
- Strong Infrastructure as Code experience (Terraform ideal; Pulumi/CloudFormation/ARM also welcome).
- Expertise in building and owning CI/CD pipelines (GitHub Actions, GitLab CI, Azure DevOps, Jenkins, etc.).
- Experience implementing and operating monitoring, logging, metrics, alerting, and full observability stacks (Prometheus, Grafana, CloudWatch, Azure Monitor, ELK/EFK, Datadog, etc.).
- Strong understanding of security, networking, identity, container orchestration, and cloud architecture.
- A reliability-first mindset: eliminate toil, automate everything possible, design for failure, measure everything.
- Excellent communication skills: you can explain risks, incidents, tradeoffs, and architecture clearly to both technical and non-technical people.
- High integrity and sound judgment: you will be trusted deeply; reliability and discretion matter more than any tool or technology.
Benefits
- Real Impact: You’ll work on meaningful products that make a measurable difference - from healthcare and commerce to sustainability and next-gen tech.
- Remote-First, Office Friendly: Work from wherever you’re most productive - whether that’s your home, a co-working space, or one of our offices.
- An Outstanding Team: Talented, kind, and hard-working people who care deeply about their craft - and about each other. No egos. No politics. Just professionals doing their best work.
- Growth: You’ll be supported in growing your craft, exploring new paths, and stepping into greater responsibility - at your own pace.
- A Culture of Excellence: We care deeply about doing the right thing - for our clients, our team, and ourselves. No burnout. No crunch. Just high-quality work, delivered sustainably.
- Variety & Stability: We’re profitable, independent, and over a decade strong. Yet every project brings a fresh challenge. You’ll never be bored here.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Field Deployment Engineer
AIMExplain AI, And Its Commercial, Social And Political Impact. For Brand collaborations, write to info@aimmediahouse.com
• Lead physical installation of AIM hardware on heavy equipment (compute, sensors, wiring, power). • Execute mechanical and electrical integration following standardized install procedures. • Perform initial power-on, safety validation, and hardware acceptance checks. • Validate E-Stops, lockout/tagout, and safety-critical wiring. • Identify install-related failure modes (vibration, dust, heat, strain relief). • Propose improvements to mounts, harnessing, enclosures, and service access. • Create and maintain install guides, wiring diagrams, and checklists. • Define install acceptance criteria and handoff standards. • Train and enable dealers to perform installs safely and independently. • Act as the bridge between customers and AIM engineering - clear, honest, technically grounded communication. • Manage expectations around downtime, features, maintenance windows, and machine readiness. • Build trust through competence, transparency, and ownership.
• Build and operate the engineering automation platform that enables repeatable, secure, and scalable deployment of the GPU cloud platform across environments. • Design and maintain the tooling, pipelines, and infrastructure-as-code foundations used to deploy vendor components, platform services, and supporting environments consistently and at scale. • Ensure that deployments are reproducible, secure, and maintainable. • Design, build, and maintain CI/CD pipelines for platform components and supporting services. • Manage infrastructure automation using Terraform and related IaC tooling. • Implement GitOps-style deployment workflows for Kubernetes-based platform services. • Standardize and automate environment provisioning across development, staging, and production. • Maintain developer environments and improve engineering productivity through better tooling and automation. • Automate deployment of vendor and platform components such as Kubernetes, databases, storage, observability, identity, and messaging systems. • Manage secrets and configuration workflows using centralized secret management practices.
• Work with AWS and GCP infrastructure (IaC, networking, compute, IAM, secrets), secure access and authorization mechanisms (DB auth, IAM flows, audit). • Communicate with various stakeholders to understand different infrastructure-related challenges, provide solution plans and ensure success of the initiatives you own. • Collaborate closely with developers to understand their workflows, pain points, and constraints; ensure platform decisions improve developer experience.
• LCI/CD Evolution: Automate and optimize CI/CD pipelines so changes flow from commit to production with speed and predictable quality. • Infrastructure & Orchestration: Strengthen container orchestration to support smooth scaling during load spikes and variable traffic patterns. • Performance & Observability: Improve monitoring, dashboards, and alerting to surface issues early and reduce disruptions for end users. • Security & Data Integrity: Enhance infrastructure-level security and data protection across environments. • Internal Tooling & Developer Velocity: Improve internal tools so engineers can debug faster, experiment safely, and maintain development momentum.




