Working at Modular will enable you to grow quickly as you work alongside incredibly motivated and talented people who have high standards, possess a growth mindset, and a purpose to truly change the world. The estimated base salary range for this role to be performed in the US, regardless of the state, is $198,000.00 - $286,000.00 USD. The estimated base salary range for this role to be performed in Canada, regardless of the province, is $194,000.00 - $280,000.00 CAD. The salary for the successful applicant will depend on a variety of permissible, non-discriminatory job-related factors, which include but are not limited to education, training, work experience, business needs, or market demands. This range may be modified in the future. The total compensation for a candidate will also include annual target bonus, equity, and benefits, with equity making up a significant portion of your total compensation. For candidates who fall outside of the listed requirements, we nevertheless encourage you to apply as we may have openings that are lower/higher level than the ones advertised.
Cloud Inference Engineer
Location
United States + 1 moreAll locations: United States | Canada
Posted
2 days ago
Salary
$166.5K - $273K / year
Seniority
Mid Level
Job Description
Cloud Inference Engineer
Modular
Role Description In the Cloud Inference team, we are focused on building end to end distributed LLM inference deployments that are fully vertically integrated with the MAX stack. Our goal is to make inference both the fastest and most scalable while also building an easiest platform for deploying and scaling models for enterprises and developers alike. We're seeking engineers who are passionate about pushing the boundaries of distributed inference systems and enjoy working at the intersection of large-scale systems and machine learning. We are looking for candidates based on their breadth and depth of experience in backend engineering, AI inference, and distributed systems development. If this sounds exciting, we invite you to join our world-leading AI infrastructure team and help drive our industry forward! What you will do: - Build & ship a LLM focused inference platform using best in class inference techniques (disaggregated inference, multi-node deployment of large models, high performance networking, distributed kv-cache management, high throughput batch processing, etc). - Push the envelope for operational excellence with request-to-kernel observability, multi-cloud deployments, clever autoscaling, cold-start optimizations, and more. - Collaborate with our kernels and genAI teams to achieve SOTA application performance by integrating SOTA kernel & serving optimizations with SOTA cluster optimizations. - Build helm charts, kubernetes operators, and more to make a create simple, effective, maintainable deployments. Qualifications - 5+ years of experience working in backend engineering. - Experience with kubernetes and operating your own services. - Ability to create durable, reusable software tools and libraries that are leveraged across teams and functions. - Experience in machine learning technologies and use cases. - Creativity and curiosity for solving complex problems, a team-oriented attitude that enables you to work well with others, and alignment with our culture. - Strongly identifies with our core company cultural values. Helpful but not required: - Experience with high performance computing / networking. - Experience working on high scale ML inference infrastructure (traditional AI or genAI). - Familiarity with golang. Benefits - Amazing Team: We are a progressive and agile team with some of the industry’s best engineering and product leaders. - World-class Benefits: Premier insurance plans, up to 5% 401k matching, flexible paid time off, and more are available to you! - Competitive Compensation: We offer very strong compensation packages, including stock options. - Team Building Events: We organize regular team onsites and local meetups in Los Altos, CA as well as different cities. Traveling 2-4 times a year is expected for all roles. Salary Information The estimated base salary range for this role to be performed in the US, regardless of the state, is $166,500.00 - $273,000.00 USD. The estimated base salary range for this role to be performed in Canada, regardless of the province, is $158,000.00 - $258,000.00 CAD. The salary for the successful applicant will depend on a variety of permissible, non-discriminatory job-related factors, which include but are not limited to education, training, work experience, business needs, or market demands. Equal Opportunity Employer Modular is proud to emphasize an equal opportunity, safe environment for people to do their best work. Modular is an affirmative action employer. We are committed to providing equal employment opportunities regardless of race, color, ancestry, religion, sex, national origin, sexual orientation, age, citizenship, marital status, disability, gender identity, or Veteran status.
Related Guides
Related Categories
Related Job Pages
More Engineer Jobs
Role Description The Hardware team is responsible for designing and improving our IoT devices, dealing with every step of the development process, ranging from PCB design, firmware programming, case design, validations and certifications, always aiming to build trustworthy products that can be produced at a mass scale. As a CAE Engineer at Tractian, you'll drive the innovation and development of our IIoT solutions. Collaborating across teams, you'll apply your expertise in mechanics and design to create high-performance products that redefine industrial asset management. From concept to execution, your role will shape Tractian's industry-leading technology, ensuring top-tier quality and functionality in every solution we deliver. Responsibilities - Set up and run FEM analyses: linear/nonlinear static, modal transient, random, frequency response, explicit, durability. - Analyze vibrational behavior, derive FRFs, and interpret mode shapes to optimize performance. - Apply DOE, response surfaces, optimization, and uncertainty analyses to explore design space and quantify risk. - Plan and perform tests for model correlation and validation. - Partner with test teams on accelerated life testing, translating field data into test requirements and simulation inputs. - Script pre/post-processing (Python, MATLAB) to scale studies and improve modeling quality. - Document standards, templates, and best practices; propose new methods to capture physics and speed up iteration. - Work closely with product development, manufacturing, supplier quality, and production teams, guiding decisions with technical data. - Deliver clear reports and presentations explaining assumptions, modeling choices, limitations, and design implications. Qualifications - Bachelor’s degree in Mechanical, Mechatronics, or related engineering field. - Strong understanding of FEM fundamentals, with hands-on experience. - Ability to explain model behavior and link results to underlying physics. - Experience in test-based model calibration/validation and correlating simulation with experimental data. - Knowledge of fatigue analysis and durability assessment. - Proficiency in Python and/or MATLAB for data processing, automation, and custom tool development. - Strong communication skills and ability to collaborate in a multidisciplinary environment. - Proactive, self-directed learner who goes beyond the brief to resolve issues. - Advanced English level. - Live in or have availability to move to São Paulo. Bonus points - Expertise in vibration and structural dynamics. - Experience in optimization, robust/stochastic design, and uncertainty quantification. - Hands-on experience with FEA pre-processing tools and solvers (e.g., Abaqus, Ansys, OptiStruct, Nastran, Radioss, HyperMesh/ANSA). Benefits - Competitive salary and stock options. - 30 days of paid annual leave. - Education and courses stipend. - Earn a trip anywhere in the world every 4 years. - R$1.035/month for meals allowance. - Health plan with national coverage and without coparticipation. - Dental Insurance: we help you with dental treatment for a better quality of life. - Wellhub and Sports Incentive: R$300/mo extra if you practice activities.
OSP Engineer III
Pearce ServicesProviding mission-critical infrastructure solutions to create a more connected and sustainable future.
• Lead the end-to-end engineering of complex outside plant programs • Set the technical direction for fiber/copper builds • Mentor junior engineers and drive cross-functional coordination • Ensure compliance with NESC, client specifications, and Pearce standards • Build job budgets and forecasts
GIS Engineer
Essnova Solutions, Inc.Federal contracting company specializing in technical, geospatial, healthcare, and administrative solutions.
Role Description Essnova is seeking a GIS Engineer with expertise in Azure cloud services and ArcGIS Enterprise architecture to support a federal marine minerals and geospatial information program. The selected candidate will provide technical support for enterprise GIS infrastructure, cloud-hosted environments, system troubleshooting, service publishing, and security compliance activities. This role will work closely with federal stakeholders and technical teams to ensure the reliability, scalability, performance, and security of mission-critical geospatial systems. Key Responsibilities - Diagnose and resolve issues impacting GIS system uptime, performance, and accessibility - Support ArcGIS Enterprise architecture, configuration, deployment, and maintenance activities - Provide recommendations to improve system reliability, scalability, and security - Troubleshoot database, infrastructure, and service publishing issues within Azure-hosted environments - Support DOI security requirements, vulnerability remediation efforts, and compliance findings - Collaborate with GIS, IT, and program stakeholders to maintain enterprise geospatial capabilities - Assist with implementation of technical enhancements and infrastructure improvements - Maintain documentation and support operational best practices for cloud-hosted GIS environments Qualifications - Bachelor's degree in Computer Science, GIS, Information Systems, Engineering, or a related field - Demonstrated experience with Microsoft Azure cloud services, including storage, compute, networking, and security - Experience supporting ArcGIS Enterprise deployment, administration, and maintenance - Knowledge of geospatial service publishing and enterprise GIS infrastructure - Experience troubleshooting performance issues in cloud-hosted environments - Familiarity with federal security requirements and FISMA Moderate compliance - Strong understanding of enterprise geospatial architectures and best practices - Ability to obtain and maintain a NACI/Public Trust clearance Preferred Qualifications - Previous experience supporting BOEM, DOI, NOAA, USGS, or other federal geospatial programs - Experience with Azure DevOps, infrastructure automation, or cloud modernization initiatives - Familiarity with DOI cloud hosting requirements - Relevant Microsoft Azure, Esri, or cloud certifications - Experience supporting marine or environmental geospatial systems Company Description At Essnova Solutions, we're not just another IT services company—we're a catalyst for change in the federal contracting landscape. As a certified 8(a) and HUBZone small business, we've been recognized for our rapid growth and excellence, including being named Alabama's Minority-Owned Small Business of the Year and earning a spot on the Inc. 5000 list of fastest-growing companies. Our mission is clear: to empower those who serve by delivering bold, innovative solutions with unmatched efficiency. From geospatial analytics to healthcare IT, we tackle some of the most critical challenges across government, education, and healthcare sectors. We're seeking driven professionals ready to take ownership, drive meaningful change, and grow alongside a company that's scaling smart and fast.
• Work directly (remotely or on-site when necessary) with customer engineering teams to design and implement real-time applications built on LiveKit • Design scalable architectures for voice AI, real-time media, and developer platform use cases • Build prototype integrations, reference implementations, and sample applications to accelerate customer development • Help customers navigate SDKs, APIs, infrastructure, and deployment strategies for production systems • Debug and troubleshoot complex technical issues in production environments • Lead technical workshops and architecture sessions with developers and product teams • Translate customer needs into technical insights that inform product and platform development • Partner with Product and Engineering to improve developer experience, documentation, and tooling • Contribute code, examples, or improvements to LiveKit's open-source ecosystem when helpful • Act as a trusted advisor helping customers successfully launch and scale applications on LiveKit



