Job Closed

This listing is no longer active.

SRE – OPENSTACK / PRIVATE CLOUD OPERATIONS

Cloud EngineerCloud EngineerContractRemoteMid LevelTeam 51-200

Location

Mexico

Posted

28 days ago

Salary

0

Seniority

Mid Level

Job Description

SRE – OPENSTACK / PRIVATE CLOUD OPERATIONS

DYNE Group

Role Description Looking for an SRE with strong experience in OpenStack and private cloud environments. Role focuses on production support, troubleshooting, and platform reliability. Requires hands-on expertise in Linux, networking, and storage. Involves close collaboration with engineering teams and customer interaction. Key Responsibilities - Troubleshoot complex issues in OpenStack and Linux environments - Manage and support OpenStack services including Nova, Neutron, Cinder, and Keystone - Perform root cause analysis (RCA) and drive long-term fixes - Participate in incident management and on-call rotations - Monitor system performance, availability, and reliability - Collaborate with engineering teams on fixes and improvements - Communicate effectively with customers via calls and written channels - Perform system optimization and performance tuning Qualifications - Linux, Networking & Storage Fundamentals - Strong understanding of Linux internals and system performance - Experience with kernel tuning and troubleshooting - Hands-on experience with filesystems and disk management - Knowledge of partitions and system-level troubleshooting - Experience with LVM and SCSI multipath - Basic understanding of Ceph - Ability to troubleshoot IO and performance issues - Knowledge of DHCP, DNS, VLANs, and network bonding - Understanding of basic routing concepts Requirements - Hands-on experience with OpenStack services such as Nova, Neutron, Cinder, and Keystone - Experience managing production environments - Strong troubleshooting and debugging skills - Ability to handle customer-facing technical issues - Experience performing root cause analysis Good to Have Skills - Basic understanding of Kubernetes concepts - Experience with monitoring tools like Prometheus and Grafana - Knowledge of metrics, logging, and alerting systems - Basic scripting skills in Python or Go - Exposure to automation and observability practices Soft Skills - Strong problem-solving and analytical thinking - Ability to work in high-pressure production environments - Clear and effective communication skills - Proactive mindset toward issue prevention - Comfortable working in remote, distributed teams

Related Categories

Related Job Pages

More Cloud Engineer Jobs

Senior Software Engineer, Cloud Infrastructure

Zoo

Infrastructure for Hardware Design.

Cloud Engineer28 days ago
Full TimeRemoteTeam 11-50H1B No Sponsor

• Design, implement, and maintain core systems and services that support our platform’s reliability, performance, and scalability. • Build automation and tooling that reduce operational overhead, improve developer workflows, and enforce system-level consistency. • Develop integrations between our systems and external platforms (e.g., billing, CRM, authentication, analytics). • Improve observability, monitoring, and alerting across services to ensure strong operational visibility. • Architect and maintain infrastructure components (e.g., distributed services, data pipelines, deployment automation). • Contribute to security-focused systems work, such as permissions controls, access flows, and auditability. • Troubleshoot issues across services and layers, backend, infrastructure, and occasionally frontend, taking full ownership from diagnosis to resolution. • Jump into frontend code as needed to close the loop on system-level changes or fix issues that block broader system reliability.

California
$145K - $195K / year
Sev1Tech LLC logo

Cloud Engineer

Sev1Tech LLC

Better Solutions. Faster.

Cloud Engineer28 days ago
Full TimeRemoteTeam 501-1,000Since 2010H1B No Sponsor

• Serve as the principal IT engineer to provide engineering expertise and guidance for the design, development, implementation, and sustainment of IT solutions that enhance overall system performance and availability. • Act as primary liaison between NAVSUP OIS customer and third-party cloud broker (NAVAIR) in operational maintenance of the NAVSUP OIS Cloud architecture and ecosystem, to include access, cost estimation, service requests, and new resource requests • Assists in development of agile methodology (SAFe, Scrum, Scrum of Scrums, Kanban, DevSecOps). This includes backlog structure and management, governance/change control, and other key processes that lead the execution at the portfolio level. • Conduct engineering system development tasks including system requirements analysis, system architecture and design, preliminary design, critical design, system integration, formal verification, and formal validation. • Perform Requirements Analysis Engineering and other system engineering practices. • Produce technical documentation in support of engineering solutions that include, but are not limited to, architectural drawings, detailed design specifications, and thorough implementation instructions • Assist with operational, technical tool standardization, maintenance of CI/CD pipelines, and system and services views for architectures using applicable Navy set security standards. • Maintain standards compliance • Guide and assist DevOps/DevSecOps teams with pipeline best practices and tool selection • Develops, updates, and maintains program of record documentation. • Manages and monitors all installed systems and infrastructure • Assist with the installation, configuration, testing, and maintenance of operating systems, application software and system management tools • Pro-actively ensures the highest levels of systems and infrastructure availability • Monitor and test application performance for potential bottlenecks, identify possible solutions, and work with developers to implement those fixes • Maintain security, backup, and redundancy strategies • Guide and assist team in writing and maintaining custom scripts to increase system efficiency and lower the human intervention time on any tasks

Pennsylvania
Job Closed
Mindex logo

Cloud Engineer

Mindex

We don’t just get the job done: we constantly think about how to get it done better.

Cloud Engineer28 days ago
Full TimeRemoteTeam 201-500Since 1998H1B Sponsor

Role Description We are looking for a motivated Cloud Operations Engineer to support our SaaS platform's infrastructure. Working as part of the CloudOps team, you will collaborate with Software Engineering to implement, automate, and maintain the AWS environments that power our software. You’ll leverage your Windows Systems Administration background to help transition legacy components into modern, automated cloud workflows while ensuring our production environment remains stable, secure, and performant. - Collaborative Implementation: Work alongside Software Engineering to deploy and configure the AWS resources required for new product features, ensuring they align with established platform standards. - Infrastructure Automation: Develop and update Infrastructure-as-Code (IaC) templates (Terraform or CloudFormation) to manage our SaaS environments, moving away from manual configurations to repeatable automation. - Environment Support: Manage and troubleshoot the health of our Windows based cloud instances, ensuring high availability for our customer base. - Deployment Support: Assist in managing CI/CD pipelines to facilitate smooth code releases. Help troubleshoot deployment failures to ensure minimal impact on the development lifecycle. - Proactive Monitoring: Configure and maintain monitoring, logging, and alerting systems (such as CloudWatch) to provide visibility into platform performance and identify potential issues before they impact users. - Security & Compliance Tasks: Implement security best practices as directed, including managing IAM roles, VPC configurations, and security group updates to maintain a hardened SaaS posture. - Resource Optimization: Monitor resource utilization and assist in executing cost-optimization tasks, such as decommissioning unused resources or adjusting instance types for better efficiency. - Incident Support: Participate in the incident management process by investigating technical alerts, performing initial root-cause analysis, and coordinating with senior engineers and developers for resolution. - Continuous Learning: Stay current on AWS services and DevOps tools, contributing to the team’s documentation and helping to refine internal operational processes. - Availability: Participate in a team on-call rotation and perform scheduled after-hours technical work to support platform stability and high-priority maintenance. Qualifications - AS or BS degree in Computer Science, Engineering, or related area, and/or an equivalent combination of education and experience. - 2+ years of hands-on experience with cloud infrastructure and services. Requirements - Strong understanding of cloud security principles and best practices. - Proficiency in Infrastructure-as-Code (IaC) tools such as Terraform or CloudFormation. - Hands-on experience supporting MSSQL environments, including a solid understanding of database fundamentals and maintenance. - Strong scripting skills in PowerShell and Python to automate operational tasks. - Excellent communication, interpersonal, and problem-solving skills. - Strong focus on exceptional customer service. - Aptitude and desire to work in a highly collaborative, self-organizing, team environment. - Highly organized & accurate. Benefits - Health insurance - Paid holidays - Paid time off - 401k retirement savings plan and company match with pre-tax and ROTH options - Dental insurance - Vision insurance - Employer paid disability insurance - Life insurance and AD&D insurance - Employee assistance program - Flexible spending accounts - Health savings account with employer contributions - Accident, critical illness, hospital indemnity, and legal assistance - Adoption assistance - Domestic partner coverage - Mindex Perks - Tickets to local sporting events - Teambuilding events - Holiday and celebration parties - Professional Development - Leadership training - License to Udemy online training courses - Growth opportunities

United States
$90K - $120K / year
2COMS Consulting Pvt. Ltd. logo

Project Manager – Colo to Public Cloud Migration

2COMS Consulting Pvt. Ltd.

Recruitment I General Staffing I IT Staffing I GigForce I Apprenticeship Implementation I Hire Train Deploy

Cloud Engineer28 days ago
Full TimeRemoteTeam 10,001+Since 1999H1B No Sponsor

• Own end-to-end delivery of assigned scope, including: A customer migration wave or A core enablement workstream critical to migration success • Build and manage detailed project plans, dependencies, milestones, and delivery timelines • Proactively manage risks, issues, assumptions, and decision tracking • Act as the primary PM for assigned customers • Coordinate with Customer Success, Professional Services, Managed Services, or Partners • Ensure deliverables meet readiness criteria for downstream migration waves • Adhere strictly to the program’s reporting and execution framework • Provide consistent, accurate status, dependency, and risk reporting • Escalate cross-workstream impacts that could affect customer migration timelines • Contribute to continuous improvement and readiness-to-execution transition efforts

India
Job Closed