Job Closed
This listing is no longer active.
Senior Infrastructure & DevOps Engineer
Location
United States
Posted
111 days ago
Salary
0
No structured requirement data.
Job Description
Senior Infrastructure & DevOps Engineer
Physics Inverted Materials, Inc.
This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description To build and maintain the automated production line for PHIN's Physical Superintelligence. You will own the plumbing that allows our simulation engine to seamlessly scale, ensuring that our team can deploy updates multiple times a day and ingest massive amounts of simulation data without friction. - Greenfield Observability: Architect and implement a comprehensive logging, monitoring, and alerting stack across our platform from the ground up. - Compute Architecture, Scaling & FinOps: Provision, manage, and optimize highly concurrent scaling clusters. Act as a cloud-agnostic thinker to direct future architecture and implement rigorous FinOps practices to minimize the cost of running thousands of simultaneous jobs. - Infrastructure as Code (IaC): Own, maintain, and expand our Terraform footprint. - Continuous Deployment (CD): Design and maintain high-velocity CI/CD pipelines supporting multiple deployments per day. Ensure "code to production" is a seamless, automated journey. - Backend Robustness: Manage the API layer that sits between the infrastructure and the application layer. Read and refactor services to optimize data movement, squash bottlenecks, and maintain security. - Data Pipeline Architecture: Build the underlying pipelines to move, store, and process the massive datasets generated by atomic-scale simulations. - Platform DevEx & MLOps: Build self-serve tooling and event-driven pipelines that empower the entire organization. Create seamless abstractions so our developers can focus on what they do best. - DevOps & Intelligence Automation: Ruthlessly automate manual toil. Use and build AI-driven tools to manage logs, infrastructure provisioning, and business workflows. - Standard Enterprise Security: Implement and maintain security best practices (SOC2/ISO focus) required for enterprise-grade contracts. Qualifications - 5–8 years as a high-output Individual Contributor in Infrastructure or Backend roles. - Comfortable touching any part of the system—from networking and security to API design and data engineering. - Familiarity with Python and TypeScript/Node.js. - Deep experience with major cloud providers. - Familiarity with high-performance computing (HPC) schedulers like Slurm is a major plus. - Not married to one framework; you choose the best tool for the job (K8s, Serverless, HPC Schedulers, etc.). - Expert user of intelligence tools (Claude, Cursor, Codex, Copilot, Agents, etc.) to 10x your own productivity and automate business tasks. - Previous experience working closely with machine learning teams, supporting ML workflows, or building MLOps pipelines is highly desirable.
Job Requirements
- 5–8 years as a high-output Individual Contributor in Infrastructure or Backend roles.
- Comfortable touching any part of the system—from networking and security to API design and data engineering.
- Familiarity with Python and TypeScript/Node.js.
- Deep experience with major cloud providers.
- Familiarity with high-performance computing (HPC) schedulers like Slurm is a major plus.
- Not married to one framework; you choose the best tool for the job (K8s, Serverless, HPC Schedulers, etc.).
- Expert user of intelligence tools (Claude, Cursor, Codex, Copilot, Agents, etc.) to 10x your own productivity and automate business tasks.
- Previous experience working closely with machine learning teams, supporting ML workflows, or building MLOps pipelines is highly desirable.
Related Guides
Related Categories
Related Job Pages
More DevOps Engineer Jobs
Staff Engineer – DevOps
Weekday (YC W21)We are a Y-Combinator-backed startup building your AI-powered Recruiter Agent
• Architect and evolve our DevOps ecosystem, champion cloud cost governance, and implement best-in-class container orchestration practices. • Work cross-functionally with engineering, security, and finance teams to ensure operational excellence while proactively managing infrastructure spend. • Lead end-to-end DevOps strategy, including CI/CD pipelines, automation, infrastructure-as-code, and release engineering. • Design scalable, resilient cloud-native architectures aligned with business growth. • Establish DevOps best practices, reliability standards, and operational governance. • Architect and manage large-scale Kubernetes environments for production workloads. • Optimize workloads across clusters for performance, reliability, and cost efficiency. • Build and maintain containerized applications using Docker and Kubernetes, ensuring portability and scalability. • Drive multi-cluster, multi-region deployments where necessary. • Own infrastructure cost visibility and optimization initiatives. • Implement cloud cost-saving strategies including rightsizing, reserved capacity planning, auto-scaling optimization, and workload scheduling. • Create dashboards and reporting mechanisms to track infrastructure ROI and spend trends. • Continuously identify inefficiencies and implement measurable cost-reduction initiatives without compromising performance. • Design and implement comprehensive monitoring systems using Grafana and related observability tools. • Build real-time dashboards for system health, performance metrics, and cost insights. • Establish alerting frameworks to minimize downtime and improve incident response. • Drive improvements in system reliability through data-driven monitoring and post-incident analysis. • Automate provisioning, deployments, scaling, and recovery processes. • Improve system resilience, availability, and disaster recovery strategies.
Principal DevOps Engineer
LegalMatchAttorneys: Get the Legal Clients You Need. Call 866.953.4259 to View Cases.
• Designing and maintaining scalable, secure, and cost-efficient cloud infrastructure. • Automating infrastructure provisioning, deployment pipelines, and monitoring systems to improve reliability and efficiency. • Leading DevOps strategy by collaborating with teams to identify and address challenges in workflows and processes. • Driving system security by implementing best practices and ensuring compliance with industry standards. • Mentoring and guiding DevOps engineers to foster a culture of problem-solving and continuous improvement. • Acting as the technical lead for resolving complex issues and ensuring high system availability. • Optimizing cloud performance and costs while mitigating risks to enhance operational efficiency. • Staying current with emerging DevOps technologies and incorporating innovative solutions into the infrastructure. • Building fault-tolerant systems and tools that empower developers to deploy securely and efficiently. • Supporting 24/7 operations by participating in on-call rotations and ensuring quick issue resolution.
DevOps Engineer, Data & Analytics
Mondelēz InternationalWe’re a house of incredible brands providing people with the right snack, for the right moment, made the right way.
• You plan, develop and execute capital projects by supporting technical developments, feasibility perspectives of engineering-related activities in supply chain and capital expense project execution to support growth, world-class manufacturing and productivity with the highest levels of quality, safety and environmental requirements • You will be accountable for the quality and results of the capital projects per Mondelēz standards, and our business and innovation processes in project management. • You will work with key stakeholders to define and deliver the capital and technical agendas during the development phases of capital investment projects • You will develop capital budgets according to the contract and forecast cash flow, ensure that engineering developments and standards are implemented, and support the development and implementation of state-of-the art processes and equipment strategies to optimize resources, harmonize assets and rollout best practices. • You will own, lead, and drive systems engineering, software delivery automation and CI/CD practices across all digital platforms. • You will manage tool and vendor selection across development automation activities to ensure that tooling is fit for purpose for an enterprise-grade software deployment automation & DevOps platform.
Domain & Release Manager
Koniag Government Services, LLCKoniag Government Services (KGS) is an Alaska Native Owned corporation supporting the values and traditions of our native communities through an agile employee and corporate culture that delivers Enterprise Solutions, Professional Services and Operational Management to Federal Government Agencies.
Koniag IT Systems, LLC, a Koniag Government Services company, is seeking a Domain & Release Manager to support KITS and our government customer. The position is remote. We offer competitive compensation and an extraordinary benefits package including health, dental and vision insurance, 401K with company matching, flexible spending accounts, paid holidays, three weeks paid time off, and more. The Domain & Release Manager and Systems Engineer I is responsible for providing technical support and recommendations on annual, quarterly, and Ad Hoc Production Releases and release operational support strategies. The Domain & Release Manager will support Engineering staff in activities to author and maintain standard operating processes to maintain complex health care information systems essential to the functioning of IHS. The Domain & Release Manager will support Engineering staff to diagnose and resolve technical issues quickly, efficiently, and ensures that systems operate at optimal performance levels before and after scheduled Releases. General requirements: - Review and coordination of deployment and engineering artifacts. Provide recommendations. - Review configuration and technical management processes for consistency with recognized industry practice and IHS requirements. Specific requirements: - Architecture control - Employ industry best practices and guidelines to manage and produce architectural artifacts. Leverage existing IHS architecture artifacts where possible. - Help maintain and extend the IHS Health IT Modernization Enterprise Architecture. - Interfaces - Review custom EHR interfaces proposed by IHS vendors. Assure industry standards-based interfaces are leveraged. - Review and evaluate EHR vendor libraries. Assure interfaces use common standard ontologies, data formats, and data models so that data is interoperable across IHS programs and with other external systems. - Review interfaces requested by functional and business SMEs/councils to assure they adhere to the IHS program governance process. Support the Interface Working group and their analysis of newly proposed EHR system interfaces. - Data Management - Contribute to the Health IT Modernization Data Management Strategy and IHS program requirements, as required, for creating, managing, and delivering data across the Program and with external partners. - Systems engineering technical reviews (SETRs) - Participate in SETRs milestone reviews such as Preliminary Design Review (PDR) and Critical Design Review (CDR). Other reviews include IDR/FRR initial design review and final design review, IPR/Preliminary TRR In-Process Review/Preliminary Test Readiness Review, FDR/TRR Final Design Review/Test Readiness Review, IPR/Preliminary OTRR In-Process Review/Preliminary Operational Test Readiness Review, and ORR Operational Readiness Review. - Participate in enterprise performance life cycle reviews as needed for requirements analysis, system design, system development, and testing. - Participate in enterprise reviews for EHR system interoperability with focus on the Health Information Exchange Gateway. - Participate in reviews of infrastructure upgrades to support new system implementations and cybersecurity services. Engineering Tools and Processes: - Have experience with systems engineering technical management processes such as technical planning, requirements management, configuration management, technical assessment, decision analysis, technical risk management, interface management, and data management. - Have a working knowledge and familiarity with system engineering tools such as Atlassian Confluence, Atlassian Jira, HP ALM, and MS Project. - Engineering systems also include the MITRE Partner Network and the IBM Jazz Rational System Architect suite of products. - Have familiarity with other system engineering tools used in areas such as performance and capacity testing, change management for code repositories, test instrumentation and monitoring, security scanning, system emulation, system/application monitoring, user monitoring/application response measurement, availability measurement, and continuous monitoring. Work Experience, Knowledge, Skills & Abilities: - The ideal candidate has experience working with Healthcare Systems Engineering and/or Health Information Management in the federal government. - Strong written and verbal communications skills - Ability to present technical details to a non-technical audience (briefing). - Ability to facilitate reoccurring technical/non-technical meetings and working groups. - Have a technical background in implementing systems and tools. Microsoft Azure related systems, tools, and services are preferred. - Have experience performing system upgrades, managing backup and recovery, server monitoring and capacity planning, conducting version management, etc. - Have familiarity developing and reviewing system security standards. - BS in Computer Science or Information Systems related field. - 5+ minimum years’ leading projects that include software development and/or operations systems implementation including cloud implementations. - Microsoft Azure certification(s) are a strong plus. Security Requirement: - This position requires National Agency Check (NACI) background clearance. - A background investigation will be conducted to authorize basic public trust. Our Equal Employment Opportunity Policy The company is an equal opportunity employer. The company shall not discriminate against any employee or applicant because of race, color, religion, creed, ethnicity, sex, sexual orientation, gender or gender identity (except where gender is a bona fide occupational qualification), national origin or ancestry, age, disability, citizenship, military/veteran status, marital status, genetic information or any other characteristic protected by applicable federal, state, or local law. We are committed to equal employment opportunity in all decisions related to employment, promotion, wages, benefits, and all other privileges, terms, and conditions of employment. The company is dedicated to seeking all qualified applicants. If you require an accommodation to navigate or apply for a position on our website, please get in touch with Heaven Wood via e-mail at accommodations@koniag-gs.com or by calling 703-488-9377 to request accommodations. Koniag Government Services (KGS) is an Alaska Native Owned corporation supporting the values and traditions of our native communities through an agile employee and corporate culture that delivers Enterprise Solutions, Professional Services and Operational Management to Federal Government Agencies. As a wholly owned subsidiary of Koniag, we apply our proven commercial solutions to a deep knowledge of Defense and Civilian missions to provide forward leaning technical, professional, and operational solutions. KGS enables successful mission outcomes for our customers through solution-oriented business partnerships and a commitment to exceptional service delivery. We ensure long-term success with a continuous improvement approach while balancing the collective interests of our customers, employees, and native communities. For more information, please visit www.koniag-gs.com. Equal Opportunity Employer/Veterans/Disabled. Shareholder Preference in accordance with Public Law 88-352


