Staff Platform Engineer

Platform EngineerPlatform EngineerFull TimeRemoteLeadTeam 49Since 2020Company Site

Location

Australia

Posted

9 days ago

Salary

0

Seniority

Lead

Job Description

Staff Platform Engineer

Gridsight

Role Description We're hiring a Staff Platform Engineer to build the foundations that let every product squad at Gridsight ship faster and with greater confidence. You'll work across the engineering organisation — identifying the cross-cutting problems that slow squads down and solving them once, well. That might be observability, deployment pipelines, environment management, test infrastructure, or emerging areas like AI-assisted developer tooling. Wherever the leverage is, that's where you'll be. We're building an engineering culture where AI-assisted development is the norm, not the exception. We expect our engineers to be actively using agentic coding tools and LLM-assisted workflows to move faster, think bigger, and deliver more. If you're already working this way — or you're the kind of engineer who's itching to — you'll fit right in. What You'll Do - Identify and solve the cross-cutting platform problems that prevent product squads from shipping faster and with confidence - Build and operate observability platforms — metrics, tracing, logging, alerting — that give product teams and on-call engineers clear, actionable visibility into system health - Evolve our deployment automation, release management, and infrastructure-as-code; tackle environment management and test infrastructure head-on to reduce lead time and tighten CI feedback loops - Architect infrastructure that's secure, cost-effective, and scalable across multi-tenant, multi-region deployments — and proactively pay down tech debt before it bites - Bring platform engineering standards into our data layer (dbt, Databricks): CI/CD, automated testing, environment promotion, observability - Act as a culture carrier for platform and reliability engineering — mentoring engineers across squads, sharing patterns, and shifting how the broader engineering organisation thinks about reliability and developer experience - Treat internal developers as customers: seek feedback, measure satisfaction, iterate on tooling based on real friction Qualifications - 6+ years of software engineering experience with demonstrated impact at Staff level in platform, infrastructure, or SRE domains - Deep cloud infrastructure experience at production scale — compute, networking, storage, IAM, cost management — with strong infrastructure-as-code and container orchestration skills (AWS and Terraform preferred) - Proven track record building and operating observability platforms (metrics, tracing, logging, alerting) at meaningful scale - Strong software engineering fundamentals: clean architecture, testing, version control, code review - A track record of defining systems and practices, not just working within them — you set technical direction, form strong opinions grounded in experience, and drive outcomes without close direction - Demonstrated experience as an enabling function within a product engineering organisation — reducing cognitive load on product teams, building capabilities they adopt willingly, and knowing when to consult, guide, or step back - Fluency in AI-assisted development tooling — you're already using agentic coding tools or LLM-assisted workflows to accelerate your work Requirements - SRE background: SLOs, error budgets, incident management, chaos engineering, war games - Experience building internal developer platforms or PaaS capabilities with a strong focus on DX - Familiarity with data platform technologies (Databricks, Spark, dbt) and their infrastructure requirements - Experience with authentication and identity patterns (Auth0, Azure AD, SAML/SSO, OIDC) in multi-tenant environments - Contributions to open-source projects or technical communities - Experience in energy, utilities, or infrastructure technology Benefits - Competitive salary and equity package - Remote-first, with head office in Sydney - A talented team of engineers, data scientists, and power systems specialists working on hard problems that matter

Related Categories

Related Job Pages

More Platform Engineer Jobs

Role Description We are hiring a Senior Platform Engineer to own, design, and scale our Azure-based platform infrastructure. This is not a pure DevOps role. You will: - Build the platform (hands-on) - Define the architecture (strategy) - Evolve into leadership (Team Lead → Director → Head) We want someone who can think like a Head of infrastructure, but still ship like a senior engineer in startup mode. What You Will Do - Platform & Infrastructure - Design and operate multi-environment Azure infrastructure (dev / staging / production) - Build secure, scalable cloud architectures using: - VNet, Subnets, Private Endpoints - AKS (Azure Kubernetes Service) - Storage (Blob, SQL, CosmosDB if needed) - Ensure strong isolation and compliance boundaries (HIPAA-ready mindset) - Infrastructure as Code - Lead infrastructure development using Terraform - Define reusable modules, environments, and deployment patterns - Implement infrastructure lifecycle management and drift control - DevOps & CI/CD - Build and maintain Azure DevOps pipelines - Design secure, scalable CI/CD workflows - Automate: - Build - Test - Deployment - Rollback strategies - Containers & Orchestration - Design and operate containerized workloads - Manage AKS clusters at scale - Implement: - Helm / deployment strategies - Autoscaling - Observability - Security & Compliance - Implement secure-by-design infrastructure - Work with: - Identity & access (RBAC, Managed Identity) - Secrets management (Key Vault) - Network isolation - Build with HIPAA / regulated environment mindset - Cost & Performance Engineering - Optimize cloud cost (FinOps mindset) - Monitor and improve: - Resource utilization - Scaling strategies - Cost vs performance trade-offs - AI Infrastructure (Next Gen) - Deploy and operate agentic AI workflows using Azure AI Foundry - Integrate AI services into platform architecture - Build infrastructure supporting: - LLM-based workflows - Data pipelines for AI systems - Leadership & Evolution - Act as a technical leader for platform - Define best practices, standards, and architecture - Mentor engineers and help build the platform team - Grow into: - Team Lead → Director → Head Qualifications - Strong experience in Azure cloud ecosystem - Deep knowledge of: - Networking (VNet, DNS, private endpoints) - Compute (AKS, VMs, containers) - Storage systems - Proven experience with Terraform in production - Hands-on experience with: - Azure DevOps (pipelines, releases) - CI/CD design and automation - Strong experience with: - Docker - Kubernetes / AKS - Experience across: - On-premise environments - IaaS - Ability to design end-to-end infrastructure systems - Experience with regulated environments (HIPAA or similar is a strong plus) - Understanding of: - Data protection - Access control - Auditability - Experience with Azure AI / LLM infrastructure (Strong Plus) - Interest in building AI-first platform systems Nice to Have - Experience with .NET / backend systems - Experience in: - Healthcare - Fintech - Payments / regulated systems - Experience with Hashicorp tools Benefits - Build greenfield + evolving platform in a complex, high-impact domain - Work on real-world regulated systems (healthcare + payments-like complexity) - Opportunity to shape the entire platform organization - Competitive compensation - Strong growth path into leadership - Exposure to AI-native infrastructure from day one - Meaningful Equity – Share in the success you help build. - Comprehensive Healthcare – Medical, dental, and vision. - Flexible Work – Fully remote with flexible hours. - Generous PTO – Time to recharge, plus company holidays. - Professional Development – Budget for certifications, courses, and conferences.

United States
CVS Health logo

Senior Manager – Platform Engineering

CVS Health

Bringing our heart to every moment of your health.

Full TimeRemoteTeam 10,001+Since 1963H1B No Sponsor

• Leading a team of engineers, including hiring, coaching, career development, and performance management • Setting technical direction and driving architectural decisions for your team's domain, balancing innovation with reliability and maintainability • Partnering with Product and Architecture to define roadmaps, decompose initiatives into deliverables, and ensure alignment across teams • Driving operational excellence through investment in observability, incident response, SLOs, and production readiness practices • Establishing and evolving engineering processes that enable your team to deliver high-quality software at pace — CI/CD, code review, testing strategies, and release management • Building a culture of ownership, psychological safety, and continuous learning within your team • Managing delivery across multiple workstreams, removing blockers, managing dependencies, and communicating status and risk to leadership • Contributing to the broader engineering organization through participation in technical governance, architecture reviews, and cross-team initiatives • Championing developer experience and engineering productivity improvements that benefit teams beyond your own • Staying current with industry trends and technologies, evaluating their applicability to our platform and advocating for strategic adoption where appropriate

Massachusetts
$130.3K - $260.6K / year

Role Description We are seeking an experienced Kafka Platform Engineer to architect, deploy, and operate large-scale Apache Kafka and Confluent platform environments supporting mission-critical event-driven workloads. In this role you will own the Kafka platform end-to-end, including: - Cluster sizing - Configuration - Security - Automation - Observability - Developer enablement The ideal candidate will combine deep Kafka internals knowledge with strong DevOps and SRE practices, and will partner with application teams to deliver a reliable, performant, and developer-friendly streaming platform. You will work closely with cross-functional partners — product, design, engineering, operations, and business stakeholders — to translate ambiguous requirements into well-engineered solutions. Expectations include raising the bar through code review, design review, and mentorship of more junior engineers. The successful candidate brings strong engineering discipline, a clear communication style, and a track record of shipping meaningful work that holds up well in production. Qualifications - Bachelor’s degree in Computer Science, Engineering, or a related technical discipline. - Five or more years of experience operating Apache Kafka or Confluent Platform in production. - Deep, hands-on knowledge of Kafka internals (partitions, replication, ISRs, consumer groups). - Strong experience with Kafka security (SASL, mTLS, ACLs, RBAC). - Hands-on experience with Kafka Connect, Schema Registry, and either Kafka Streams or ksqlDB. - Experience with HA/DR strategies for Kafka. - Strong scripting skills in Python, Bash, or Go. - Hands-on experience with infrastructure-as-code (Terraform, Ansible). - Working knowledge of observability tooling for Kafka. - Excellent troubleshooting, communication, and documentation skills. Requirements - Architect, deploy, and operate large-scale Apache Kafka or Confluent Platform clusters across on-prem and cloud environments. - Design partitioning, replication, and topic strategies that balance throughput, durability, and operational simplicity. - Implement strong security on Kafka clusters using SASL, mTLS, ACLs, RBAC, and integration with corporate IdPs. - Operate Schema Registry, Kafka Connect, KSQL/ksqlDB, and Kafka Streams in production. - Build and operate Kafka Connect pipelines integrating sources and sinks across enterprise systems. - Design HA/DR strategies for Kafka, including MirrorMaker 2, Cluster Linking, and multi-region active-active patterns. - Build CI/CD pipelines for Kafka topic, ACL, and connector configurations using GitOps patterns. - Implement comprehensive observability using Prometheus, Grafana, Datadog, or Confluent Control Center. - Drive Kafka cost and capacity optimization through right-sizing and storage tiering. - Onboard application teams to Kafka with clear patterns, templates, and best practices. - Lead incident response and post-incident reviews for streaming workloads. - Mentor and coach junior and mid-level engineers through code review, design review, pair programming, and structured knowledge sharing. - Maintain comprehensive, current technical documentation. - Continuously evaluate emerging streaming technologies (Pulsar, Redpanda, AWS MSK, Azure Event Hubs). Benefits - Competitive base salary commensurate with experience, plus benefits. How to Apply For immediate consideration, please send your resume to [email protected] or contact us at +1 (908) 765-8199. Learn more about Bright Vision Technologies at www.bvteck.com .

United States

Role Description We are seeking a PLM Platform Engineer with deep experience operating either PTC Windchill or Siemens Teamcenter (preferably both) in large enterprise environments. In this role you will own the technical operation of the PLM platform — installation, configuration, performance tuning, upgrades, integrations, and high availability — and partner with functional, engineering, and manufacturing teams to deliver a reliable, performant, and secure PLM ecosystem. The ideal candidate will bring strong PLM administration fundamentals, hands-on experience with PLM upgrades and migrations, and a measurement-driven approach to platform reliability. Key Responsibilities - Install, configure, and operate Windchill or Teamcenter environments across development, test, and production. - Lead PLM upgrades, patches, and platform migrations with minimal disruption. - Manage PLM application servers, web servers, database connectivity, and method servers. - Operate file vaults, replication services, and CAD data management subsystems. - Implement and tune HA/DR strategies for PLM environments. - Optimize PLM performance through query tuning, caching, indexing, and JVM tuning. - Manage user provisioning, security configurations, and audit support. - Operate PLM integration brokers and middleware connectors. - Develop automation scripts using shell, Python, or Ansible. - Monitor PLM health using native tooling and integrated observability platforms. - Provide hands-on post-go-live and hypercare support. - Maintain comprehensive, current technical documentation. - Mentor and coach junior and mid-level engineers. - Drive continuous improvement of the PLM platform. Qualifications - Bachelor’s degree in Computer Science, Engineering, or a related technical discipline. - Five or more years of PLM platform administration experience. - Hands-on experience with either PTC Windchill or Siemens Teamcenter in production. - Strong experience with PLM upgrades and migrations. - Working knowledge of Oracle and SQL Server database administration. - Strong Linux/Unix administration skills. - Experience operating HA/DR for PLM environments. - Familiarity with PLM integration brokers and middleware. - Scripting skills in shell, Python, or Ansible. - Excellent troubleshooting and documentation skills. Preferred Qualifications - Experience operating PLM on cloud platforms (AWS, Azure, OCI). - Exposure to infrastructure-as-code for PLM environments. - Familiarity with CI/CD patterns for PLM change management. - PTC or Siemens PLM certifications. - Experience with CAD integration patterns at scale. How to Apply Would you like to know more about this opportunity? For immediate consideration, please send your resume to [email protected] or contact us at +1 (908) 765-8199. Learn more about Bright Vision Technologies at www.bvteck.com . Equal Employment Opportunity (EEO) Statement Bright Vision Technologies (BV Teck) is committed to equal employment opportunity (EEO) for all employees and applicants without regard to race, color, religion, sex, sexual orientation, gender identity or expression, national origin, age, genetic information, disability, veteran status, or any other protected status as defined by applicable federal, state, or local laws. This commitment extends to all aspects of employment, including recruitment, hiring, training, compensation, promotion, transfer, leaves of absence, termination, layoffs, and recall.

United States