Job Closed
This listing is no longer active.
We make sense of data to drive your business forward. #MakeSenseofData #DriveYourBusinessForward #PartnerYourWay
Lead Cloud Engineer
Location
United States
Posted
25 days ago
Salary
$155K / year
Seniority
Senior
Job Description
Lead Cloud Engineer
EXL
• Design, implement, and manage cloud infrastructure on AWS to support AI, data, and application workloads • Develop and maintain Infrastructure as Code (IaC) using tools such as Terraform, CloudFormation, or similar • Build and manage CI/CD pipelines to support automated build, test, and deployment of applications and data services • Manage containerized environments using Docker and container orchestration platforms • Implement monitoring, logging, and alerting frameworks to ensure high availability and reliability of systems • Optimize infrastructure performance, scalability, and cost across cloud environments • Implement and maintain security best practices, including IAM policies, network security, and secrets management • Exposure to AI initiatives and Experience supporting infrastructure needs for AI workflows. • Collaborate with data engineers to support deployment and scaling of AI and data platforms • Support platform reliability through automation, system monitoring, and incident response processes • Maintain documentation for infrastructure architecture, deployment processes, and operational procedures
Job Requirements
- 6–9 years of experience in DevOps, Cloud Engineering, or Infrastructure Engineering roles
- Strong hands-on experience with AWS cloud services
- Experience implementing Infrastructure as Code using Terraform, CloudFormation, or similar tools
- Experience with CI/CD tools and automation pipelines
- Familiarity with containerization technologies such as Docker and container orchestration platforms
- Experience implementing monitoring and logging solutions for production systems
- Understanding of cloud networking, security best practices, and identity management
- Experience supporting data platforms, distributed systems, or AI/ML workloads is a plus
- Strong troubleshooting, automation, and collaboration skills
Benefits
- For more information on benefits and what we offer please visit us at https://www.exlservice.com/us-careers-and-benefits
Related Guides
Related Categories
Related Job Pages
More Cloud Engineer Jobs
Cloud Support Engineer II/Azure VMware Solution Operations Support Specialist
ASM ResearchIt is the policy of ASM that an individual's race, color, religion, sex, disability, age, sexual orientation or national origin are not and will not be considered in any personnel or management decisions. We affirm our commitment to these fundamental policies. All recruiting, hiring, training, and promoting for all job classifications is done without regard to race, color, religion, sex, disability, or age. All decisions on employment are made to abide by the principle of equal employment.
Role Description Seeking skilled professionals to join our Azure VMware Solution (AVS) operations support team. In this role, you will be responsible for providing comprehensive 24/7 technical support for Microsoft's cloud-based VMware infrastructure. The ideal candidate will excel in a fast-paced environment, delivering critical support services across time zones to ensure maximum platform availability and performance. This position requires strong diagnostic skills, effective communication with customers during critical incidents, and the ability to drive swift resolution of service-impacting events. - Troubleshoot complex issues related to VMware vSphere infrastructure, NSX-T networking, vSAN storage, and ExpressRoute connectivity within the AVS environment. - Participate in on-call rotations, collaborate with cross-functional engineering teams, and continuously improve operational efficiency through automation and process refinement. - Respond to incident tickets in an operational environment to meet SLA objectives, typically responding to the more complex incidents. - Troubleshoot system issues using diagnostic tools like netmom, windbg, and custom application tools. - Review system logs to identify and mitigate system issues. - Leverage knowledge base to help troubleshoot, identify, and resolve systems issues. - Update knowledge base troubleshooting guides and lessons learned as required. - Document incident fixes and make recommendations to the engineering team for system improvements for consideration in future releases. - Document system issues resulting in system outages and coordinate change through the change management process. - Support collaboration across operations, development teams, and external partners. - Support "tiger team" calls to streamline knowledge sharing and timely resolution of system issues. - Monitor solution performance according to client specifications and SLAs. Serve as an escalation point on more complex issues. Qualifications - BS in Computer Science or other technical discipline is preferred. - 5+ years of experience diagnosing/debugging faults in complex online services. Requirements - Hold active DoD Secret security clearance and CJIS adjudication to maintain USME access. - Ability to identify and script automatable problems, perform work with efficiency in mind. - Experience with PowerShell, SQL, and Python scripting. - Experience with VMware vSphere infrastructure management, including vCenter Server, ESXi host troubleshooting, and NSX-T networking components within Azure VMware Solution. - Proficiency in diagnosing and resolving Azure VMware Solution connectivity issues, including ExpressRoute circuits, HCX migration tools, and vSAN storage performance optimization. - Able to diagnose and mitigate faults. - Able to identify and drive recovery levers with feature teams. - Able to communicate effectively through written and oral English. - Able to interact with external customers and partners on behalf of Microsoft. - Ability to perform work under continuous deadline pressure. - Ability to support various rotating shifts. - Ability to execute work with precision in time-sensitive outage scenarios. - Effectively communicate status changes to impact. Benefits - Compensation ranges for ASM Research positions vary depending on multiple factors; including but not limited to, location, skill set, level of education, certifications, client requirements, contract-specific affordability, government clearance and investigation level, and years of experience. - The compensation displayed for this role is a general guideline based on these factors and is unique to each role. - Monetary compensation is one component of ASM's overall compensation and benefits package for employees. Company Description It is the policy of ASM that an individual's race, color, religion, sex, disability, age, sexual orientation or national origin are not and will not be considered in any personnel or management decisions. We affirm our commitment to these fundamental policies. - All recruiting, hiring, training, and promoting for all job classifications is done without regard to race, color, religion, sex, disability, or age. - All decisions on employment are made to abide by the principle of equal employment.
GeoPlatform Cloud Engineer
SAICSAIC is a premier Fortune 500® mission integrator focused on advancing the power of technology and innovation to serve and protect our world. Our robust portfolio of offerings across the defense, space, civilian and intelligence markets includes secure high-end solutions in mission IT, enterprise IT, engineering services and professional services. We integrate emerging technology, rapidly and securely, into mission critical operations that modernize and enable critical national imperatives. We are approximately 24,000 strong; driven by mission, united by purpose, and inspired by opportunities. SAIC is an Equal Opportunity Employer. Headquartered in Reston, Virginia, SAIC has annual revenues of approximately $7.5 billion. For more information, visit saic.com . For ongoing news, please visit our newsroom .
Role Description We are seeking a talented and motivated GeoPlatform Cloud Engineer to join our team. The successful candidate will support the design, deployment, modernization, and maintenance of cloud-based solutions for the National Geospatial Platform (GeoPlatform) and related geospatial initiatives. The ideal candidate will bring expertise in cloud-native architectures, geospatial services, big data processing, and automation, while providing scalable, secure, and innovative solutions for geospatial data management, analysis, and visualization. As a critical member of the team, the GeoPlatform Cloud Engineer will focus on delivering robust cloud functions and workflows, ensuring the availability, reliability, and security of the platform while enabling cutting-edge geospatial capabilities. - Design, build, and implement infrastructure and services within cloud platforms (e.g., AWS, Azure, Google Cloud Platform) to enhance GeoPlatform capabilities. - Evaluate and integrate cloud-native geospatial tools and services to support mission requirements, including GIS systems, data pipelines, APIs, and big data processing. - Develop and document scalable and resilient geospatial cloud solutions aligned with best practices for process automation, containerization, and resource optimization. - Collaborate with architects, DevOps engineers, and developers to define system requirements and design technical solutions that align with both operational and customer needs. - Develop Infrastructure as Code (IaC) templates (e.g., Terraform, CloudFormation, ARM templates) for efficient cloud resource automation. - Implement CI/CD pipelines to streamline software development lifecycles and automate deployment and monitoring processes. - Support geospatial data workflows including data ingestion, storage, processing, and visualization using platforms such as ArcGIS, GeoServer, OpenLayers, or PostgreSQL/PostGIS. - Optimize performance and scalability for geospatial workloads, including vector tiles, raster imagery, and 3D datasets. - Ensure adherence to cloud governance and security practices, supporting compliance frameworks (e.g., FedRAMP, NIST, FISMA). - Monitor system performance, incident management, and disaster recovery to guarantee system uptime and security. Company Description SAIC® is a premier Fortune 500® mission integrator focused on advancing the power of technology and innovation to serve and protect our world. Our robust portfolio of offerings across the defense, space, civilian and intelligence markets includes secure high-end solutions in mission IT, enterprise IT, engineering services and professional services. We integrate emerging technology, rapidly and securely, into mission critical operations that modernize and enable critical national imperatives. We are approximately 24,000 strong; driven by mission, united by purpose, and inspired by opportunities. SAIC is an Equal Opportunity Employer. Headquartered in Reston, Virginia, SAIC has annual revenues of approximately $7.5 billion. For more information, visit saic.com . For ongoing news, please visit our newsroom .
Cloud Service Agent, Full-time
CloudiaxGlobal Business Cloud provider for SAP B1, Cloud Infrastructure, AI Server & more - made in Germany, available worldwide
• You are the primary point of contact for our partners and customers and ensure the stable operation of their systems. • You handle incoming support requests by phone and in writing, documenting and processing them via our ticketing system. • You are responsible for provisioning, administration and the ongoing operation of customer systems. • You reliably work in a 3-shift schedule (early, late and night shifts). • You are eager to learn, continuously develop your skills and contribute new ideas for process optimization. • You organise yourself confidently in a remote environment and work efficiently and independently.
Senior Cloud Operations Engineer, Databases
Pega - Pegasystems, Inc.Founded in 1983, Pegasystems is a Forbes 100 company and a global business process management (BPO) services organization also known as Pega. Headquartered in Cambridge, Massachuse
Senior Cloud Operations Engineer, Databases Job Category: Engineering & Cloud Location: Poland - Remote Meet Our Team: As a member of Cloud Operations, you will be a key member responsible for the reliability and availability of Pegasystems cloud service offerings. We operate as a global follow the sun 24x7 team with locations in Bangalore, Krakow, Sydney, and the East Coast of the United States. We encourage a culture of diversity, openness, intellectual curiosity, problem solving, and consistently strive to create an environment that provides the support and mentorship needed to learn and grow. Picture Yourself at Pega: You will have the opportunity to work on complex problems and apply your expertise and experience to improve reliability of Pega Cloud Platform. You will take personal ownership of the systems you manage and possess the tenacity to delve to the root of the problem quickly, understand why it happened, and prevent it from reoccurrence. By collaborating and communicating with customers and internal stake holders, you will deliver best in class support. At Pega, in this position, the technologies you will lead with include Pega Cloud hosted on multi-cloud platforms (AWS, GCP), Microservices (Kubernetes, Docker), Databases (PostgreSQL, MongoDB, Cassandra), Migration Tooling (AWS DMS, GCP DMS), GenAI (Azure OpenAI, AWS Bedrock, GCP Gemini), Observability (Grafana), etc. What You'll Do at Pega: - Own database reliability, automation, and performance as we grow our platform - Handle alerts, incidents, service requests and changes within SLA. - Own customer escalations. - Troubleshoot and resolve customers environment issues along with root cause analysis and blameless post-mortems. - Use AWS Database Migration Service (AWS DMS) to migrate Pega On-Premise clients to Pega Cloud quickly, securely, and with minimal downtime and zero data loss. - Influence product teams on defects, feature, and enhancement requests to help build scalable, reliable, observable, available and highly performant services. - Create and maintain operational runbooks. - Identify the needs and build tools to automate repeated operational tasks and reduce toil. - Manage multiple projects simultaneously and able to adapt to changing business goals. - Participate in after hours on-call rotation including weekend shifts. Who You Are: - Proven professional and technical experience in an enterprise cloud environment supporting SAAS applications with a focus on operational delivery excellence and customer service. - You are self-motivated, inquisitive, and creative, with a passion for continuous improvement and excellent people skills. - Works well with cross-functional global and remote teams. - Demonstrated ability to learn new technologies, techniques, and tools quickly to meet our business requirements. - Comfortable working in a fast-paced, enterprise environment. - Possess customer obsession and proven empathy towards customers. What You've Accomplished: You are skilled in Postgres / MongoDB, AWS Cloud, Linux, Middleware and DevOps Technologies, and have accomplished the below: - 5+ years of operational or engineering experience of building and maintaining mission critical production database systems in a global enterprise. - 5+ years of experience with enterprise scale Linux Administration. - 2+ years of hands-on operational experience with Amazon Web Services (AWS). Exposure to additional public clouds such as GCP or Azure, a plus. - Deep understanding of PostgreSQL server architecture, RDS database internals and their administrative tasks including installations, upgrades, monitoring, backup, and recovery. - Experience with AWS Database Migration Service (AWS DMS) including analyzing source tables/data to create appropriate DMS tasks for efficient data loads. - Strong understanding of SQL in the context of driving optimal application performance. - Experience with both SQL and NoSQL databases such as PostgreSQL, MongoDB, DynamoDB, Cassandra, Elasticsearch, or equivalent. - Experience configuring database high availability, connection pooling, and load balancing. - Basic network troubleshooting skills including TCP/IP, DNS, VPN. - Experience working on Java technologies and understanding Java/JEE fundamentals. - Administration of web servers running Tomcat, Apache, IIS, Nginx. - Experience in Python, Bash, or similar scripting languages to automate common tasks. Education / Certifications: - Bachelor’s degree in computer science/engineering or equivalent. - DBA certification in Postgres or MongoDB or Oracle. - AWS Certification, a plus. - Pega experience, a plus. Pega Offers You: - Gartner Analyst acclaimed technology leadership across our categories of products. - Continuous learning and development opportunities. - An innovative, inclusive, agile, flexible, and fun work environment. - Competitive global benefits program inclusive of pay + bonus incentive, employee equity in the company. - The world’s most innovative organizations as reference-able clients. - Analyst acclaimed technology leadership in a massive emerging market. Additional Information Base salary range for this role is 197,600 - 295,400 PLN annually. This role may also be eligible for annual bonus OR commission, as well as benefits and other incentives. The final compensation will be determined during the offer process based on the candidate's education, experience, skills, and qualifications, as well as market conditions and may vary from the posted range. We will share an information on benefits, bonus/commission, and other pay components for this role at the relevant recruitment stage. Job ID: 23314


