TetraScience logo
TetraScience

TetraScience is a cloud-native technology company that develops software and hardware solutions for monitoring and managing research experiments, as well as clo

Senior Software Platform Engineer

Location

United States

Posted

32 days ago

Salary

0

Seniority

Senior

Bachelor Degree7 yrs expEnglishAWSCloudDockerPythonTypeScript

Job Description

Senior Software Platform Engineer

TetraScience

• Design, implement, and maintain cloud-native platform to support AI and data workloads, with a focus on AI and data platforms such as Databricks and AWS Bedrock. • Build and manage scalable data pipelines to ingest, transform, and serve data for ML and analytics. • Develop infrastructure-as-code using tools like Cloudformation, AWS CDK to ensure repeatable and secure deployments. • Collaborate with AI engineers, data engineers, and platform teams to improve the performance, reliability, and cost-efficiency of AI models in production. • Drive best practices for observability, including monitoring, alerting, and logging for AI platforms. • Contribute to the design and evolution of our AI platform to support new ML frameworks, workflows, and data types. • Stay current with new tools and technologies to recommend improvements to architecture and operations. • Integrate AI models and large language models (LLMs) into production systems to enable use cases using architectures like retrieval-augmented generation (RAG).

Job Requirements

  • 7+ years of professional experience in software engineering and infrastructure engineering.
  • Extensive experience building and maintaining AI/ML infrastructure in production, including model, deployment, and lifecycle management.
  • Strong knowledge of AWS and infrastructure-as-code frameworks, ideally with CDK.
  • Expert-level coding skills in TypeScript and Python building robust APIs and backend services.
  • Production-level experience with Databricks MLFlow, including model registration, versioning, asset bundles, and model serving workflows.
  • Expert level understanding of containerization (Docker), and hands on experience with CI/CD pipelines, orchestration tools (e.g., ECS) is a plus.
  • Proven ability to design reliable, secure, and scalable infrastructure for both real-time and batch ML workloads.
  • Ability to articulate ideas clearly, present findings persuasively, and build rapport with clients and team members.
  • Strong collaboration skills and the ability to partner effectively with cross-functional teams.
  • Nice to Have
  • Familiarity with emerging LLM frameworks such as DSPy for advanced prompt orchestration and programmatic LLM pipelines.
  • Understanding of LLM cost monitoring, latency optimization, and usage analytics in production environments.
  • Knowledge of vector databases / embeddings stores (e.g., OpenSearch) to support semantic search and RAG.

Benefits

  • 100% employer-paid benefits for all eligible employees and immediate family members
  • Unlimited paid time off (PTO)
  • 401K
  • Flexible working arrangements - Remote work
  • Company paid Life Insurance, LTD/STD
  • A culture of continuous improvement where you can grow your career and get coaching

Related Categories

Related Job Pages

More Platform Engineer Jobs

Strivacity logo

Platform Engineer

Strivacity

Customer identity and access management (CIAM)

Full TimeRemoteTeam 11-50Since 2019H1B No Sponsor

• Design, deploy, and operate Kubernetes clusters (EKS or self-managed) on AWS, ensuring high availability and security • Build and maintain CI/CD pipelines and internal developer tooling to improve engineering velocity • Automate infrastructure provisioning and operational tasks using Python and tools like Terraform, OpenTofu, and Ansible • Define and enforce platform standards around observability, cost management, and incident response • Partner with application teams to support containerized workloads and resolve infrastructure bottlenecks • Collaborate with Customer Success teams by providing reliable and scalable tooling that supports seamless customer onboarding, integrations, and service delivery

Hungary
Midnite logo

Lead Platform Engineer

Midnite

Building the future of betting & entertainment

Full TimeRemoteTeam 11-50Since 2020H1B No Sponsor

• Ensure Midnite's reliability, performance, and availability. • Shape the evolution of the core platform and set engineering standards. • Drive improvements across infrastructure and backend systems. • Work closely with the Head Of Platform Engineering and the global Platform team. • Manage a small team of engineers, providing direction and coaching. • Regularly contribute code, reviews, and production changes.

Canada
£115K - £130K / year
TechBiz Global logo

Azure Cloud Engineer

TechBiz Global

TechBiz Global is a leading IT recruitment and software development company

Full TimeRemoteTeam 51-200H1B No Sponsor

• Configure and wire Azure services within pre-provisioned environments to support application workloads • Implement data ingestion pipelines using Azure services • Configure and manage: o Blob Storage o Event-driven architecture o Cloud-based databases • Set up secure access using managed identities and role-based access control • Support application deployment and CI/CD workflows • Write lightweight Python scripts for automation and integration • Run AzCopy operations for large-scale data backfills from source systems into Blob Storage • Configure Application Insights for monitoring, logging, and alerts • Create clear documentation and operational runbooks • Collaborate with application engineers to ensure seamless platform integration

Mexico
Job Closed
Midnite logo

Lead Platform Engineer

Midnite

Building the future of betting & entertainment

Full TimeRemoteTeam 11-50Since 2020H1B No Sponsor

Role Description We’re looking for an experienced Lead Platform Software Engineer to ensure Midnite is reliable, performant, and always available, no matter the time of day. You’ll play a key role in shaping the evolution of our core platform, setting engineering standards, and driving improvements across our infrastructure and backend systems. - Work closely with our Head Of Platform Engineering and the global Platform team. - Take ownership of critical systems design, reliability engineering, and deployment practices. - This is a hands-on leadership role, spending most of your time in the code. - Manage a small team of engineers, helping set direction and coaching day-to-day. - Create a high-performing, engineering-led culture. - Expect meaningful ownership and to be challenged and rewarded in equal measure. Qualifications - 7+ years of engineering experience with deep expertise in Python or an equivalent modern language (e.g. Go, Rust), and cloud-based systems. - Proven experience leading or managing engineers in a platform, backend, or infrastructure-focused role. - Strong background in building resilient, scalable, and high-performance distributed systems. - Ability to break down ambiguous and complex problems into clear plans and well-defined milestones. - Demonstrated impact improving reliability, performance, and the developer experience. - Hands-on technical leadership - able to dive into code and architecture while guiding others. - Strong communication and cross-functional collaboration skills. - Experience integrating third-party systems at scale and acting as the technical lead for those relationships. - Familiarity with modern delivery practices and a deep appreciation for quality, testing, and stability. Requirements - Own and drive architectural decisions across the platform, focusing on uptime, resilience, scalability, and performance. - Lead technical design for complex cross-team projects, especially those touching core infrastructure or platform reliability. - Identify and drive improvements in deployment pipelines, observability, performance monitoring, and systems architecture. - Anticipate failure points, plan for scaling, and ensure the platform remains robust as usage grows. - Guide the adoption of best practices in platform engineering, reliability, testing, and operational excellence. - Work closely with partners and vendors to integrate their products into our systems in a safe, repeatable, and scalable way. - Support senior engineers in unblocking cross-team issues, acting as the technical escalation point. - Regularly contribute code, reviews, and production changes as part of the team’s daily delivery. Benefits - Private healthcare coverage to support your physical health and wellbeing. - Virtual GP access so you can get medical advice quickly and conveniently. - Pension scheme to help you plan and invest in your future. - Tenure holiday policy: After three years you receive an extra two days leave, increasing to 30 days annually after five years. - Flexible working and a fully supported home office setup so you can do your best work from home.

Canada
£115K - £130K / year