Job Closed
This listing is no longer active.
Connecting top IT and Executive talents with great companies in EMEA/LATAM through tailored recruitment solutions.
Senior DevOps Engineer, AWS
Location
Bulgaria
Posted
129 days ago
Salary
0
Seniority
Senior
Job Description
Senior DevOps Engineer, AWS
RecruityTalent
• Lead operations for multi-tenant SaaS workloads on AWS, ensuring scalability, high availability, and cost efficiency • Design, implement, and maintain reliable infrastructure for production, data, and AI/ML workloads • Own incident response, postmortems, and operational runbooks to improve system reliability and reduce MTTR • Manage and enhance CI/CD pipelines supporting both application and ML deployment workflows • Build and maintain infrastructure automation using Infrastructure as Code (AWS CDK or Terraform) • Enable self-service capabilities for engineering and data science teams • Monitor and optimize cloud usage across compute, GPU, and storage resources, implementing cost controls and forecasting • Support and automate ML pipelines, including training, testing, and deployment using AWS SageMaker, Kubeflow, or MLflow • Manage GPU and compute clusters (EKS, ECS, EC2) for model training and inference workloads • Develop and maintain monitoring, alerting, observability, and security best practices • Collaborate closely with Engineering, Data, AI/ML, and PlatformOps teams to ensure smooth cross-team delivery
Job Requirements
- 7+ years of experience in DevOps/ CloudOps/ SRE
- Solid hands-on experience with AWS (Fargate, EKS, EC2, S3, RDS, Lambda, IAM, CloudWatch, CloudTrail), Kubernetes and containerized workloads
- Proficiency with CI/CD tools, Infrastructure as Code (IaC), infrastructure automation, and scripting (Python, Bash, or similar)
- Proven experience with AI/ML platforms (AWS SageMaker, Kubeflow, MLflow, or equivalent), and cost‑efficient GPU/compute optimization
- Working knowledge of MongoDB operations, monitoring, and performance tuning
- Solid understanding of FinOps principles, cloud cost monitoring, and right-sizing strategies
- Experience with production monitoring & incident management (Splunk, Grafana, OpenTelemetry)
- Exposure to multi-tenant SaaS architectures and security or compliance frameworks is a plus
- Strong collaboration, mentoring, and communication skills, with the ability to thrive in a fast-paced, evolving environment
- Excellent spoken and written English language skills.
Benefits
- Health insurance
- Paid time off
- Professional development opportunities
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Spalding, a Saalex Company is seeking a DevOps Engineer in Patuxent River, MD. Spalding, a Saalex Company is a professional services company delivering cutting-edge solutions to the Department of Defense since 2001. Our expert-level solutions include software development, information technology, program management, financial management and business intelligence services. Spalding, a Saalex Company offers competitive compensation, career development, flexible work schedules and excellent benefits. Position Type: Full-Time Salary: $115K-$135K (depending on experience) Work Location: This is a remote position. **On-Site Requirements: On-boarding will require 1-2 visits to Patuxent River, MD for candidates that are local to the area. Candidates out of state will be onboarded virtually. Training will be virtual and telework maximized/permitted to the greatest extent possible, however for local candidates, training/tasking may require on-site work a few hours per week. Future on-site/telework requirements/schedules may change as additional client direction is received. Essential Functions: - Develops DevOps functionality for CI/CD pipeline solutions. - Improves and maintains GitLab pipeline configurations. - Collaborates and assists software engineers with the design, configuration, implementation, and maintenance of CI/CD pipelines. - Assist with GitLab upgrades as received from the vendor (i.e. bi-weekly, monthly, etc.; requires evening support) - Onboards new applications/customers to the CI/CD environment. - Provides recommendations for technology advancement to streamline CI/CD tools and processes. - Provides technical assistance and troubleshooting to applications and systems deployed within a DevOps CI/CD pipeline. - Identifies, troubleshoots, and resolves pipeline issues. - Other duties as assigned or required.
SRE – Clickhouse Team
PostHogProduct analytics, session replay, feature flags, A/B testing, data warehouse, CDP, surveys. PostHog does that.
• Manage large fleets of EC2-based VMs, disks, and networking for data-intensive workloads • Improving operational tooling around deploys, schema changes, backups, restores, and incident response • Working closely with ClickHouse engineers to turn database-level needs into infra-level solutions • Reducing operational load by identifying repeat pain points and eliminating them through code and self-healing automation • Participating in on-call and incident response, with a strong focus on making incidents rarer over time • You’ll have room to design and automate, not just respond to alerts.
• Seeking a Lead AI DevOps Engineer to oversee design and delivery of advanced AI/ML/GenAI solutions. • Combines cloud engineering and automation with hands-on leadership in deploying and integrating LLM/SLM models into enterprise applications. • Leading architecture and deployment of AI/ML/GenAI solutions (LLM/SLM at scale). • Driving automation of infrastructure, model lifecycle and inference pipelines. • Overseeing CI/CD processes for AI/ML/GenAI workloads. • Designing secure, scalable cloud infrastructures (Azure-focused). • Acting as technical advisor for stakeholders and client-facing solution design. • Mentoring engineers, promoting best practices, and fostering innovation in GenAI adoption. • Coordinating cross-functional teams to align AI engineering with business outcomes. • Ensuring cost optimization, monitoring and compliance across environments.
Senior iOS DevOps Engineer
Advanced Solutions International, Inc.We help people achieve great things though innovative solutions.
• Owning the end-to-end production stability and operational health of the iOS application • Serving as the primary engineer accountable for iOS production deployments and release management • Reviewing, approving, and enforcing code quality standards for all contributions from the offshore development team • Ensuring offshore-delivered features meet performance, reliability, security, and maintainability expectations • Designing and maintaining robust CI/CD pipelines for iOS builds, testing, and App Store deployments • Automating build, signing, versioning, and release workflows to reduce manual risk • Implementing and enforcing best practices for branching strategy, pull request hygiene, and release governance • Monitoring production crashes, performance metrics, and user-impacting issues using tools such as Crashlytics, Sentry, or equivalent • Leading incident response efforts for iOS-related production outages or regressions • Establishing proactive alerting and monitoring for mobile production health • Partnering with product and engineering leadership to ensure predictable, high-quality delivery cycles • Driving improvements in test coverage, regression prevention, and deployment confidence • Maintaining secure management of certificates, provisioning profiles, and sensitive mobile deployment credentials • Coordinating closely with backend, QA, and platform teams to ensure seamless end-to-end releases • Documenting operational processes and ensure repeatable, auditable release practices • Mentoring offshore and internal engineers on iOS DevOps discipline, production ownership, and engineering excellence




