Job Closed
This listing is no longer active.
Senior Software Engineer – DGX Cloud Services and Software
Location
California + 3 moreAll locations: California | Massachusetts | Texas | Washington
Posted
21 days ago
Salary
$168K - $270.3K / year
Seniority
Senior
Job Description
Senior Software Engineer – DGX Cloud Services and Software
NVIDIA
• Work with NVIDIA internal customers • Design and build scalable software systems to manage NVIDIA’s cloud infrastructure. • Participate in responses to real-time operational events • Building network and systems automation software for managing a multi-tenant cloud infrastructure • Participate in open-source communities of software we leverage and build. • Present to internal stakeholders and NVIDIA leadership on roadmaps, vision, & demos
Job Requirements
- 8+ years of experience with designing and building distributed software systems.
- Track record of directly supporting systems with external customers, or demanding internal customers
- BS/MS degree in Computer science or related areas (or equivalent experience)
- Demonstrated ability to write code in a mainstream systems programming language such as C, C++, Golang, or Rust.
- Demonstrated ability to design and implement maintainable APIs for consumers.
- Practical experience with asynchronous programming, type safety, threading models, state machines and data structures.
- Background of data persistence (SQL or similar).
- Understanding of secure communication protocols (mutual-TLS, IPsec, or similar).
- Knowledge of SRE principles (observability, SLOs, logging, etc.)
Benefits
- competitive salaries
- generous benefits package
- equity
Related Guides
Related Categories
Related Job Pages
More Cloud Engineer Jobs
Cloud Native Java Developer
Railroad19Partnering With You For Your Custom Software and Cloud Platform Needs
• Understand our client's fast-moving business requirements. • Negotiate appropriate solutions with multiple stakeholders. • Write and maintain scalable enterprise-quality software. • Build web applications using Spring Boot. • Build Microservices that connect to Oracle and NoSQL databases. • Build software components that integrate with a workflow engine and/or ESB to execute asynchronous business processes. • Manage the complete software development life cycle. • Writing functional and unit tests to maintain code quality. • Work with Jenkins to perform continuous integration. • Collaborate with other teams to deliver a highly performant application that contains few or no defects. • Identify new opportunities, tools, and services to enhance the custom software platform. • Support and troubleshoot issues (process & system), identify root cause, and proactively recommend sustainable corrective actions.
• Lead and provide technical guidance to a global team of 6 cloud engineers, cultivating a culture of proactive continuous improvement leveraging automation and AI. • Drive Developer Experience by achieving feature parity with GCP; transition manual Azure processes to automated workflows leveraging project Chassis framework and AI. • Lead day-to-day operations for a global platform team, managing a 'follow-the-sun' support model across North America (NA) and Asia-Pacific (AP) regions. • Oversee incident response and escalation workflows, ensuring the team meets or exceeds MTT(x) goals. • Define and maintain the Azure/Dynamics platform strategy and roadmap within Jira; ensure strict adherence to Jira practices and PI planning. • Conduct financial forecasting and reviews for Azure and Dynamics; manage Microsoft licensing/pricing (PAYG/RI), cost calculators, and cost alerts. • Improve platform security by remediating NIST violations and ensuring standards are enforced by deploying custom Azure policies to block non-compliant infrastructure. • Upskill the team in Site Reliability Engineering (SRE) and DevOps principles; proactively encourage the integration of Copilot and AI/Agentic solutions. • Assist customers with onboarding and answering technical questions related to the Dynamics platform. • Define and document back up strategies for Azure hosted solutions.
Principal Cloud Support Engineer
Wasabi TechnologiesWasabi Technologies is a Massachusetts-based software organization specializing in affordable, high-performance, and secure cloud storage solutions, aiming to offer single-tier, hi
At Wasabi, we’re a proven collection of pioneers, visionaries and disruptive doers. We see things differently than our competitors, and we make our mark in the industry by challenging the norm and delivering the unexpected and improbable. We’re a fast-growing company taking the Cloud Storage industry by storm and recognized as one of the best places to work in Boston. Wasabi hot cloud storage is a new class and category of cloud storage, breaking all traditional barriers and boundaries of storage with a disruptive value proposition of being 1/5th the cost of AWS S3, faster than the competition, with no fees for egress or API requests. Cloud storage has never been so simple, so fast and so inexpensive. It’s all part of our vision to make cloud storage the next great global utility, just like electricity. Role Description: Principal Cloud Support Engineer Role Purpose: Wasabi’s Principal Cloud Support Engineer will serve as the highest-level technical resource within the Support team. This role extends beyond advanced troubleshooting into technical leadership, managing escalations, owning internal training programs, and serving as a trusted technical advisor for enterprise customers and partners. The Principal Cloud Support Engineer will leverage deep expertise in cloud storage, networking, and distributed systems to drive operational excellence, improve support readiness, and strengthen customer relationships. Your success will be measurable and highly visible across customer satisfaction, team capability, and service quality. On a typical day, Wasabi principal cloud support engineers may lead escalations, design and deliver technical training, advise enterprise customers on best practices, investigate complex protocol-level issues, collaborate closely with Engineering, or drive initiatives that enhance the scalability of Wasabi’s Technical Support operations. Wasabi is based in Boston MA, but we are open to remote candidates based elsewhere in the United States. Travel is not regularly required. *Principals only. No recruiters. Responsibilities: - Act as the highest escalation point for the Technical/Cloud Support team for complex customer issues. - Lead root cause investigations and collaborate with Engineering to resolve protocol, performance, or interoperability challenges. - Own and deliver the technical training program for Support, including onboarding, advanced troubleshooting techniques, and documentation. - Provide technical advisory and white-glove support to Wasabi’s strategic enterprise customers and partners. - Oversee escalation workflows and ensure consistent handling of high-severity customer incidents. - Engage with Wasabi Sales, Product, and Development engineers worldwide to improve Wasabi’s services and customer experience. - Mentor and coach Technical/Cloud Support Engineers and Senior Technical/Cloud Support Engineers to elevate team capabilities. - Drive improvements in supportability, diagnostic tooling, automation opportunities, and internal processes. - Create and maintain advanced troubleshooting guides, KB articles, and internal knowledge documentation. Requirements: - Must have expert-level hands-on experience with AWS S3 Cloud Storage or compatible object storage. - Must be AWS Solutions Architect Associate Certified; Professional certification preferred. - Must have Bachelor of Science degree in Network/System/Computer Science, Master's degree preferred - 10+ years of relevant work experience. - 10+ years of experience in technical support, network operations, or cloud infrastructure roles. - Extensive experience supporting mission-critical systems that operate 24x7x365. - Deep understanding of backup software, applications using APIs & SDKs, and object storage integrations. - Strong knowledge of Linux and networking protocols such as HTTP, TCP/IP, DNS, TLS. - Demonstrated experience leading escalations, mentoring others, and interfacing with enterprise-level customers. Base Salary – $101,440 – $152,160 The base salary range reflects the full range for this position at Wasabi Technologies. At Wasabi, we believe in paying fairly and competitively, and individual compensation is determined based on factors such as job-related experience, skills, location, and internal equity. Base pay is just one part of our total rewards approach. Depending on role eligibility, Wasabi team members may also receive additional compensation and benefits, including bonus or variable compensation, equity, and a comprehensive benefits package designed to support both professional growth and personal well-being. Wasabi Technologies is an Equal Opportunity Employer. We prohibit discrimination and harassment of any kind based on race, color, sex, religion, sexual orientation, national origin, disability, genetic information, pregnancy, or any other protected characteristic as outlined by federal, state, or local laws.
GCP Cloud Engineer - Management and Operations
ResultantResultant is a business consulting and services company that specializes in data and technology consulting. As an employer, the company is known for its growing and collaborative w
Role Description Resultant is building a large health information exchange (HIE) platform for a state government agency that integrates across multiple agencies, external partners, and identity providers. We are seeking an experienced Google Cloud Platform Engineer to stabilize, support and operate the GCP infrastructure in the long term as the initial buildout comes to close. The position requires an experienced, hands-on GCP cloud engineer capable of operating with a high degree of autonomy in a risk-averse, regulated environment where platform reliability, security, and data confidentiality are critical. You will work with Google Cloud Platform infrastructure, Terraform-based deployments, and Azure DevOps CI/CD pipelines, helping deliver a scalable, compliant, and operationally stable platform aligned with State standards and healthcare data requirements. This is a remote role; however, preference will be given to candidates based in the Indianapolis, IN, Dallas, TX, or Chicago, IL areas. Key Responsibilities - Platform Stabilization & Operations - Environment Manager: responsible and accountable for overall health, availability, performance, security, cost and day-to-day operations of the GCP platform and toolset. - Lead efforts to stabilize and optimize the HIE platform, addressing production issues, technical debt, and deployment inconsistencies. - Lead or participate in incident response, monitoring, operational support, governance, root cause analysis, security and vulnerability remediations, and resolution of platform issues. - Maintain technical documentation, runbooks, deployment standards, security artifacts. - Engage with partner teams in a collaborative delivery model. - CI/CD & DevOps (Azure DevOps) - Partner with architecture, security, and data teams to align platform architecture with project deliverables and contractual, regulatory, and operational expectations. - Build and maintain Azure DevOps pipelines for infrastructure and application deployment. - Support migration and alignment of existing GitHub-based pipelines into Azure DevOps. - Implement and maintain repeatable deployment frameworks for containerized platform and application services. - Support application teams with environment configuration and release processes. - Improve deployment reliability and rollback strategies for production releases. - Cloud Infrastructure Engineering (GCP) - Design, implement, maintain, operate GCP infrastructure across DEV, QA, STAGE, PROD etc. - Manage Infrastructure-as-Code (IaC) using Terraform, aligned with IT-provided baseline services (VPCs, DNS, service accounts), controls and standards. - Implement and maintain secure, scalable cloud patterns including networking, IAM, and service integrations. - Deploy and manage containerized workloads (e.g., Kubernetes/GKE or equivalent). - Healthcare & Data Integration Context - Support integrations across healthcare, public health, and social services systems. - Work with data ingestion pipelines, secure file transfer (e.g., SFTP/MOVEit), APIs, and new partner onboarding. - Align platform with client’s interoperability patterns and data governance expectations. - Identity, Access, & Security - Implement and manage federated identity patterns (BYOC) across multiple Identity Providers (e.g., State systems, State SSO, external IdPs). - Support role-based access controls and auditability aligned with data access policies. - Collaborate with security stakeholders to support ATO readiness and compliance audits. - Establish, improve and operate industry-standard tools, metrics procedures and governance around security and vulnerability management. - Collaboration, Leadership, & Engagement - This role operates within a multi-stakeholder environment and requires direct partnering with State agencies, External healthcare and social services partners, Internal engineering, data, security and audit. - The position requires a collaborative, delivery-focused mindset, with the ability to navigate ambiguity and provide structured, actionable solutions. Qualifications - Bachelor’s Degree or equivalent experience in Computer Science, Information Technology, Engineering, or a related field. - 5+ years of extensive, hands-on experience in cloud engineering or DevOps roles primarily on GCP in large mission-critical production environments. - Deep expertise in managing Terraform-based infrastructure deployments. - Strong experience with Azure DevOps (Repos, Pipelines, CI/CD). - Experience with containerized platforms (Kubernetes, Docker) including container observability. - Proficiency in scripting (PowerShell, Bash, Python). - Experience with Airflow/Cloud Composer especially for orchestrating data pipelines. - Experience with Git-based workflows and release management. - Strong troubleshooting skills in complex, integrated environments. - Google Associate Cloud Engineer certification or higher. - Willing to travel as needed for business needs (approximately 10%). - Must be legally authorized to work in the United States for any employer without sponsorship. Domain Experience Required - Experience with Healthcare IT, HIE, or SHIE platforms with healthcare and Health Information Exchange (HIE) or Social Health Information Exchange (SHIE) experience. - Familiarity with HIPAA, PII, and data governance requirements. - Experience integrating with external partners, state agencies, or regulated systems. Preferred/Desirable Qualifications - Experience with state government or public sector environments. - Experience in operations of critical SLA-bound environments with ongoing project releases. - Familiarity with Identity-Aware Proxy (IAP), federated identity, or OAuth/OIDC patterns. - Experience with data platforms (BigQuery, data lakes, data mesh architectures). - Exposure to Security Command Center, monitoring, and compliance tooling. - Experience in Azure cloud environment is highly desirable, including operating and managing large production business applications, analytic solutions, data lakes, lakehouses and mesh, big-data platforms and Ai and ML workloads. Azure Databricks experience is highly valued. - Desirable certifications: Google Professional Cloud Architect, Google Professional Cloud DevOps Engineer, Google Professional Security Operations engineer, Other Terraform, Kubernetes, or equivalent including Certified Kubernetes Administrator, Equivalent Azure certifications. Additional Information At Resultant, we are driven by purpose—partnering with clients to take on their toughest challenges and creating outcomes that make a real difference. Success here means lasting impact, not just delivered projects. We work as collaborative experts, leaning into complexity with confidence and humility, asking smart questions, sharing ideas freely, and combining our diverse expertise to turn challenges into clear, transformative solutions. Resultant offers a flexible, high-trust environment, paired with a shared commitment to accountability: - Take ownership of your work from start to finish and deliver on your commitments to clients and coworkers. - Communicate proactively, especially when priorities shift. - Focus on outcomes, follow-through, and measurable impact. - Show up where it matters, in person or virtually, when it strengthens relationships or results. - Do the small things brilliantly: respond timely, stay organized, and follow through to build trust. You may thrive here if you bring curiosity and a consulting mindset to every challenge, take ownership of your growth, and give and receive feedback with intention. If you're energized by bold ideas, continuous learning, and investing in the people around you, this is your place. We embrace AI across everything we do, and that includes how you prepare for this interview. Feel free to use AI tools to research the role, practice your responses, and put your best foot forward. What matters to us is getting to know the real you — how you think, how you communicate, and what you genuinely bring to the table.


