Senior DevOps Engineer, Data Platforms
Location
Latin America
Posted
2 days ago
Salary
0
Seniority
Senior
Job Description
Senior DevOps Engineer, Data Platforms
phData
• Own and drive end-to-end operational delivery for cloud-native data platform environments (AWS, Azure) across multiple managed services clients — including environment configuration, infrastructure automation, and platform reliability. • Translate business and technical requirements into resilient, cost-effective platform solutions aligned with phData methodologies, architecture standards, and best practices. • Build, deploy, and maintain infrastructure-as-code configurations and CI/CD pipelines that support repeatable, governed platform delivery. • Monitor and support production data jobs and pipelines (ETL/ELT), ensuring timely resolution of failures and minimizing business impact as you develop depth in data platform patterns. • Ensure engagements are delivered on time, within scope, and with measurable business value for clients. • Collaborate with Solutions Architects, data engineering teams, analytics teams, and client stakeholders to deliver successful, well-integrated client engagements. • Provide technical leadership during troubleshooting sessions, infrastructure reviews, and platform deployments, particularly across cloud services on AWS and Azure. • Ensure high quality in deliverables through clear documentation, deployment guides, runbooks, and adherence to governance and change management processes. • Partner with practice and account leaders to improve operational maturity, standardize delivery patterns, and enhance client satisfaction across a large user base. • Contribute to internal initiatives such as building and enhancing IaC templates, automation scripts, CI/CD frameworks, and operational playbooks for Elastic Platform Operations. • Mentor peers by sharing best practices in cloud engineering, leading knowledge-sharing sessions, and helping up-skill team members on new tools and technologies. • Represent phData with professionalism in all interactions, communicating clearly with both technical and non-technical stakeholders. • Act as a trusted advisor to senior client stakeholders on cloud platform reliability, infrastructure strategy, and performance optimization. • Lead complex infrastructure and platform delivery efforts, coordinating across multiple teams and driving long-term improvements. • Mentor and coach junior engineers, fostering a culture of learning, feedback, and continuous improvement. • Help define and refine Elastic Operations standards, reusable IaC assets, and delivery frameworks for managed services.
Job Requirements
- 6+ years of hands-on experience in DevOps, cloud platform engineering, or infrastructure — ideally including client-facing or consulting delivery experience.
- Strong cloud platform experience across AWS and/or Azure, including core infrastructure services (e.g., S3, ADLS, IAM, networking, compute, secrets management).
- Proven hands-on experience with infrastructure-as-code tools (e.g., Terraform, CloudFormation) — including authoring, extending, and maintaining IaC in team environments.
- Solid experience designing and operating CI/CD pipelines and tooling (e.g., GitHub Actions, Bitbucket Pipelines, Azure DevOps) for platform and application deployments.
- Proficiency in Python for scripting, automation, and operational tooling in cloud-native environments.
- Working knowledge of SQL — ability to read, write, and debug queries to support platform troubleshooting and data flow validation.
- Working knowledge of Unix/Linux environments and common system administration and scripting concepts.
- Strong troubleshooting, performance tuning, and root-cause analysis skills across cloud infrastructure and platform services.
- Active daily use of AI coding tools (Cursor, GitHub Copilot, Claude, ChatGPT, or equivalent) with demonstrated judgment about when to trust, verify, and correct AI-generated output.
- Ability to describe specific engineering tasks completed with AI assistance.
- Experience delivering projects for external or internal clients in a professional services, consulting, or managed services environment.
- Ability to break down complex, ambiguous platform challenges into structured, actionable steps and drive them through to completion.
- Strong written and verbal communication skills in English, with the ability to clearly explain technical issues and solutions to non-technical stakeholders.
- Demonstrated ability to work effectively with distributed and cross-functional teams across engineering, sales, and support.
- Proven track record of taking full ownership, managing multiple concurrent priorities, and delivering high-quality work with minimal supervision.
- Openness to learning new technology stacks — including data platform technologies — and helping up-skill other team members.
- Bachelor's degree in Computer Science, Information Technology, Engineering, or a related field — or equivalent practical experience.
Benefits
- Remote-First Work Environment
- Casual, award-winning small-business work environment
- Collaborative culture that prizes autonomy, creativity, and transparency
- Competitive comp, excellent benefits, generous PTO plan plus 10 Holidays (and other cool perks)
- Accelerated learning and professional development through advanced training and certifications
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Senior Network Deployment Engineer – EU Hours
AstreyaIT services that put people at the center of your business
• Design, plan, and coordinate the implementation of network technologies in support of business and growth requirements. • Validate project requirements, define project scope, develop project schedules, and produce detailed network designs for assigned projects. • Produce work breakdown structures (WBS) that demonstrate understanding of proposed changes and how they will be implemented with minimal service impact. • Perform analysis and diagnosis of highly complex networking problems; build simulated networks in test labs to resolve significant issues and compatibility challenges. • Plan and drive complex network upgrade and migration activity, including highly automated environments and quarterly maintenance events. • Prepare and maintain up-to-date documentation detailing the configuration of deployed solutions; generate network configurations and run books. • Provide mentorship and technical leadership to existing network team members and partner teams during outages and downtimes. • Collaborate with vendors to manage circuit delivery, problem resolution, and network migrations.
Role Description This role exists to ensure the Hyperstack platform — including Hyperstack GPU Cloud, AI Studio and the Investor Portal — is kept running, automated and observable as it scales. As the DevOps team acts as a bridge across every function in the business, we need a capable engineer who can own automation, incident response, observability and internal tooling without waiting to be directed. This is a role for someone who builds first and documents second — someone who finds a manual process and replaces it, who picks up a production incident and drives it to resolution, and who enjoys the visibility that comes from working across an entire business. What You'll Be Doing - Own core DevOps engineering tasks across the Hyperstack platform: automation, incident response, release pipeline support and internal tooling. - Maintain and improve observability tooling (Prometheus, Grafana and the broader monitoring stack) to ensure platform health and early incident detection. - Support Kubernetes operations across two contexts: as a managed product sold to customers, and as the underlying infrastructure powering NexGen Cloud’s own platform. - Act as a first responder for platform incidents alongside the CX team — triaging issues, reviewing code, and confirming whether problems are bugs or expected behaviour. - Build and improve internal tools consumed by other teams including Revenue Ops, Finance, Engineering and CX. - Identify and eliminate manual workload through automation and self-service tooling as the business continues to scale. - Collaborate across a globally distributed, remote-first team and communicate clearly with non-technical stakeholders. Qualifications - Hands-on Kubernetes experience in production — both managed/hosted K8s as a product and self-managed clusters. - Active experience with Prometheus, Grafana and related observability tooling; able to maintain and improve monitoring of a live platform. - Strong automation and scripting skills — able to build or improve tooling that reduces manual workload across multiple teams. - Proven incident response experience in live environments; comfortable being a first responder alongside non-technical colleagues. - Cross-functional mindset — comfortable building tools and processes that serve Engineering, CX, Revenue Ops and Finance without being siloed. Nice to Have - Experience in a SaaS, cloud infrastructure or GPU/AI compute environment. - Familiarity with GitOps workflows and release pipeline tooling. - Exposure to OpenStack-based infrastructure or GPU cloud environments. - Experience working in a distributed, remote-first team across multiple time zones. Benefits - Competitive salary and annual discretionary bonus scheme. - Employee wellbeing benefits. - 25 days of holiday, plus public holidays. - Fully remote working — no office requirement, no geographic constraint. - Real ownership and autonomy, with the trust to take initiative and experiment. - Broad scope — this role touches every team in the business, giving you exposure well beyond a typical DevOps position. - Greenfield opportunities to improve tooling, automation and observability — not just maintenance. - Clear career progression and growth opportunities in a fast-growing company. - A collaborative, international culture built on trust, transparency and ownership. - The chance to work on a cutting-edge GPU cloud platform used for real AI, ML and HPC workloads — where Kubernetes is central to how the product is built and sold.
Role Description We are looking for an Engineer to join our DevOps team in Argentina. DevOps is responsible for the R&D platforms, which are based on containers and microservices. Your main focus will be improving the production infrastructure and building the next-level build process, automation methodologies, and tools while working closely with the R&D team. - Take our CI/CD to the next level - Develop and debug on top of the Kubernetes infrastructure - Improve our cloud-native Terraform modules - Research new technologies and provide overall technical direction for the team - Plan and push to close the automation backlog - Communicate with teams across the organization on testing strategy and design - Propose improvements to the testing process to decrease defects found in production - Work closely with R&D teams in an Agile environment - Own SRE responsibilities for our AWS and GCP environments, including ensuring the availability, scalability, and reliability of production services - Manage and evolve infrastructure as code using env0 for Terraform workflows - Design and implement monitoring, metrics, and dashboards using Datadog to ensure observability and proactive incident management Qualifications - 5-7 years of experience as a DevOps Engineer - Must have hands-on experience in Kubernetes (EKS) and Docker - Experience in Python/Bash/Groovy - Solid grasp of software development life cycle, best practices, and methods across multiple teams: Agile development, DevOps, CI/CD, Kubernetes, team leadership, and mainly test automation - Experience with at least one of Jenkins, GitLab CI, or GitHub Actions - Experience with Jira, Confluence, Git, AWS, Dockers, relational and non-relational DBs - Experience working with AWS and GCP cloud platforms in a production environment - Hands-on experience with Terraform (preferably managed via env0 or similar tooling) - Experience with Datadog for building monitoring solutions, metrics, and dashboards - Excellent communication skills are required for documenting and connecting with other groups - Ability to explore and initiate new ideas, think, and learn independently Advantages - Experience in REST API automation and testing methodologies Our Values - Care - We care about our customers and each other - Do - We do what it takes to make a positive impact - Try - We try our best and we don’t give up - Shine - We shine and make it our mission to always stand out
Role Description We are seeking an experienced Tactical Mission Network Site Lead to serve as the senior on-the-ground representative supporting U.S. Army Special Operations Command (USASOC) Tactical Mission Network (TMN) operations in Eastern Europe (EE). This individual will function as the primary point of contact between the contractor program team, the host nation environment, and USASOC supported units, providing both technical oversight and operational coordination to ensure TMN capabilities are effectively deployed, sustained, and integrated at the forward site. The EE Site Lead must be equally comfortable managing complex technical environments and navigating the operational demands of a Special Operations forward presence. The ideal candidate is a seasoned professional with prior military or SOF-adjacent experience, a strong understanding of TMN or similar tactical network architectures, and the maturity and judgment to operate with significant autonomy in a sensitive, strategically consequential OCONUS environment. Familiarity with the European theatre, and EUCOM/SOCEUR priorities, and the unique political-military dynamics of EE is a distinct advantage. This position is Remote/ Eastern Europe. This position requires an active DoD Top Secret which requires US citizenship for work on DoD contracts. Application Deadline: June 29, 2026 Qualifications - Bachelor's degree in a relevant field (Information Technology, Systems Engineering, Business, or related discipline); equivalent military education and experience will be considered in lieu of degree. - 7+ years of combined experience in technical operations, network systems, and/or military operational support roles. - Prior U.S. military service, with demonstrated experience operating in or supporting OCONUS environments; SOF experience strongly preferred. - Proven leadership experience managing teams in high-pressure, resource-constrained, or austere operational environments. - Technical proficiency with communications systems, tactical networks, or C2 infrastructure relevant to Special Operations. - Strong understanding of military operational planning processes, unit structures, and the demands of a forward-deployed customer. - Excellent interpersonal and cross-cultural communication skills with the ability to operate effectively alongside partner nation personnel and diverse multinational stakeholders. - Demonstrated ability to operate with significant autonomy, exercise sound judgment, and make time-sensitive decisions with limited oversight. - Active TS required (TS/SCI preferred) - Must meet all medical, physical, and administrative requirements for OCONUS deployment to the assigned location. Requirements - Serve as the senior contractor representative at the OCONUS site, providing day-to-day leadership, direction, and accountability for all contractor personnel assigned to the location. - Act as the primary liaison between on-site contractor staff, USASOC supported units, and the CONUS-based program management team. - Maintain situational awareness of all site activities and provide regular status updates, trip reports, and operational summaries to program leadership. - Manage site schedules, task prioritization, and personnel assignments to ensure mission requirements are met with available resources. - Enforce compliance with all applicable contractor policies, DoD regulations, Status of Forces Agreements (SOFA), and host nation requirements. - Oversee the installation, configuration, operation, and maintenance of TMN systems and associated infrastructure at the forward site. - Serve as the senior technical authority on-site for network architecture, systems integration, and TMN capability delivery. - Troubleshoot and resolve complex technical issues across communications, networking, and data systems, escalating to CONUS engineering support as needed. - Ensure TMN systems are maintained at operational readiness standards and that all planned and unplanned maintenance activities are documented and tracked. - Coordinate with supported unit S6/J6 and communications staff to align TMN capabilities with unit requirements and operational schedules. - Provide direct technical and analytical support to USASOC Special Operations units at the site, ensuring TMN tools and data products are accessible, functional, and operationally relevant. - Support the planning and execution of exercises, operations, and training events that rely on TMN infrastructure and data capabilities. - Anticipate and respond to the evolving operational needs of the supported unit, proactively identifying capability gaps and recommending solutions to the program team. - Coordinate logistics, equipment staging, and site readiness activities in support of unit deployments, rotations, and mission cycles. - Provide mentorship, performance guidance, and day-to-day leadership to all contractor personnel assigned under the site lead's authority. - Monitor the health, welfare, safety, and morale of on-site contractor staff operating in a potentially austere or high-risk OCONUS environment. - Coordinate with the program HR and security teams on personnel matters including travel, medical support, emergency procedures, and force protection protocols. - Conduct or facilitate performance evaluations, identify training needs, and support professional development of site personnel. - Serve as the on-site point of contact for physical security, information security (INFOSEC), and operational security (OPSEC) matters. - Ensure all contractor personnel adhere to site-specific security protocols, classification handling procedures, and applicable USASOC and DoD security directives. - Coordinate with the Facility Security Officer (FSO) and site security personnel to maintain compliance with TS/SCI access and handling requirements. - Report security incidents, anomalies, or concerns through proper channels in a timely manner. - Build and maintain strong working relationships with USASOC unit leadership, host nation counterparts, interagency partners, and other contractor entities operating at or near the site. - Represent the contractor program professionally in all interactions with military customers, partner nation personnel, and government stakeholders. - Prepare and deliver briefings, status reports, and after-action reviews (AARs) to both contractor leadership and military customers. - Facilitate the resolution of cross-organizational issues, resource conflicts, or capability shortfalls through collaborative engagement with all relevant stakeholders. - Oversee the accountability, maintenance, and serviceability of all government-furnished equipment (GFE) and contractor-owned assets at the site. - Manage site-level supply chain and logistics requirements, coordinating with CONUS program support staff for procurement, shipping, and equipment replenishment. - Track and report site expenditures, resource utilization, and equipment status in accordance with program reporting requirements. - Coordinate with host nation and U.S. military logistics elements to ensure continuity of site operations. Benefits - Health insurance - Paid leave - Retirement Company Description At SMX®, we are a team of technical and domain experts dedicated to enabling your mission. From priority national security initiatives for the DoD to highly assured and compliant solutions for healthcare, we understand that digital transformation is key to your future success. We share your vision for the future and strive to accelerate your impact on the world. We bring both cutting edge technology and an expansive view of what’s possible to every engagement. Our delivery model and unique approaches harness our deep technical and domain knowledge, providing forward-looking insights and practical solutions to power secure mission acceleration. SMX is an Equal Opportunity employer including disabilities and veterans.




