Job Closed

This listing is no longer active.

AI Platform Engineer

Location

Australia

Posted

49 days ago

Salary

0

Seniority

Senior

Job Description

AI Platform Engineer

Budgetly

• Design agentic workflows that take a requirement from spec to production • Review agent output with a critical eye to catch the quality issues that automated checks miss • Identify failure patterns and turn them into system-level improvements • Maintain and evolve the platform (TypeScript, serverless architecture, shared codebase) • Work closely with product to reduce the ambiguity agents struggle with

Job Requirements

  • Strong TypeScript and React experience in production environments
  • You’ve shipped real software to real users, not just prototypes
  • You can read a codebase and quickly identify its patterns, conventions, and architecture
  • Comfort working in ambiguity, as you’ll help define what “good” looks like in this space
  • Nice to have: AWS serverless experience (CDK, Lambda, DynamoDB)

Benefits

  • 5 weeks annual leave and flexible working - we believe in balance
  • Monthly “Wellness Budget” to help you stay healthy, mentally & physically
  • Employee share options (ESOP) for all team members - we truly all “own it”

Related Categories

Related Job Pages

More Platform Engineer Jobs

Role Description We are seeking a highly skilled AI Infrastructure and Kubernetes Platform Architect with deep expertise in managing GPU-accelerated workloads on NVIDIA DGX systems. The ideal candidate will have hands-on experience with Kubernetes at the administrator, application developer, and security levels (CKA, CKAD, CKS), and will be responsible for designing, deploying, securing, and maintaining large-scale AI infrastructure powered by DGX BasePODs and SuperPODs. This role involves optimizing AI workloads, managing high-performance networking (InfiniBand), and ensuring operational excellence across NVIDIA AI systems and BlueField DPU environments. Key Responsibilities - Kubernetes and AI Platform Orchestration - Architect and maintain containerized AI/ML platforms using Kubernetes on DGX systems. - Integrate NVIDIA Base Command Manager with Kubernetes for workload scheduling and GPU resource optimization. - Design multi-tenant GPU resource partitioning strategies using MIG (Multi-Instance GPU) to maximize hardware utilization across concurrent AI workloads. - Implement and manage Helm charts, custom controllers, and GPU operators for scalable ML infrastructure. - DGX Infrastructure Administration - Administer and optimize NVIDIA DGX BasePODs and SuperPODs. - Ensure optimal GPU, CPU, and storage performance across AI clusters. - Leverage DGX System Administration best practices for lifecycle management and updates. - Coordinate capacity planning for DGX cluster expansion including rack power, cooling, and storage integration with NVIDIA AI Enterprise software stack. - High-Performance Networking & DPU - Deploy, monitor, and manage InfiniBand networks using Unified Fabric Manager (UFM). - Integrate BlueField DPUs for offloaded security, networking, and storage tasks. - Optimize end-to-end data pipelines from storage to GPUs. - Security and Compliance - Apply best practices from the CKS certification to harden Kubernetes clusters and AI workloads. - Implement secure service mesh and microsegmentation with BlueField DPU integration. - Conduct regular audits, vulnerability scanning, and security policy enforcement. - Automation & Monitoring - Automate deployment pipelines and infrastructure provisioning with IaC tools (Terraform, Ansible). - Monitor performance metrics using GPU telemetry, Prometheus/Grafana, and NVIDIA DCGM. - Troubleshoot and resolve complex system issues across hardware and software layers. - Implement MLOps workflows integrating KubeFlow Pipelines, NVIDIA Triton Inference Server, and model registry tooling to support end-to-end model training and production deployment. Qualifications - CKA, CKAD, CKS certifications – demonstrating full-stack Kubernetes expertise. - Proven experience with NVIDIA DGX systems and AI workload orchestration. - Hands-on expertise in InfiniBand networking, UFM, and BlueField DPU administration. - Strong scripting and automation skills in Python, Bash, YAML. - Familiarity with Base Command Manager, NVIDIA GPU Operator, and KubeFlow is a plus. - Ability to work across teams to support ML researchers, DevOps engineers, and infrastructure teams.

United States
$110 - $150 / hour
Job Closed
Tucows logo

Principal Engineer, Platform Engineering

Tucows

Making the Internet better since 1993. We're in the business of building platforms that keep people connected.

Full TimeRemoteTeam 1,001-5,000H1B No Sponsor

• Gain a deep understanding of the current platform landscape including infrastructure, platform services, and application layers. • Identify key constraints that are slowing engineering delivery. • Build context around the environment platform strategy, including declarative environments, generated artifacts, and self-service developer experiences. • Identify current platform gaps such as manual steps or fragile integrations. • Establish strong working relationships across Infrastructure/CI, Operations, Security, and product engineering teams. • Begin shaping shared standards through design reviews and code reviews. • Identify 1–2 high-leverage improvements.

Canada
$164.5K - $192.8K / year
Job Closed
Airwallex logo

Staff Data Platform Engineer

Airwallex

Airwallex is a financial services company that has developed a “global financial platform for modern businesses.” As an employer, the company strives to cul

Full TimeRemoteTeam 2,200Since 2015

About Airwallex Airwallex is the only unified payments and financial platform for global businesses. Powered by our unique combination of proprietary infrastructure and software, we empower over 200,000 businesses worldwide - including Brex, Rippling, Navan, Qantas, SHEIN and many more - with fully integrated solutions to manage everything from business accounts, payments, spend management and treasury, to embedded finance at a global scale. Proudly founded in Melbourne, we have a team of over 2,000 of the brightest and most innovative people in tech across 26 offices around the globe. Valued at US$8 billion and backed by world-leading investors including T. Rowe Price, Visa, Mastercard, Robinhood Ventures, Sequoia, Salesforce Ventures, DST Global, and Lone Pine Capital, Airwallex is leading the charge in building the global payments and financial platform of the future. If you're ready to do the most ambitious work of your career, join us. Attributes We Value We hire successful builders with founder-like energy who want real impact, accelerated learning, and true ownership. You bring strong role-related expertise and sharp thinking, and you're motivated by our mission and operating principles. You move fast with good judgment, dig deep with curiosity, and make decisions from first principles, balancing speed and rigor. You're humble and collaborative; turn zero-to-one ideas into real products, and you "get stuff done" end-to-end. You use AI to work smarter and solve problems faster. Here, you'll tackle complex, high-visibility problems with exceptional teammates and grow your career as we build the future of global banking. If that sounds like you, let's build what's next. Staff Data Platform Engineer (Real-time Data Platform) Knowledge Platform Team Hiring Location: Singapore Airwallex is the leading financial technology platform for modern businesses growing beyond borders. With one of the world's most powerful payments and banking infrastructures, our technology empowers businesses of all sizes to accept payments, move money globally, and simplify their financial operations, all in one single platform. Established in 2015 in Melbourne, we aim to connect entrepreneurs, business builders, makers, and creators with opportunities in every corner of the world. Today, Airwallex has a global footprint across Asia-Pacific, Europe, and North America. Who We Are? Team Introductions: The Knowledge Platform team is at the heart of our company's data and AI strategy. We are building the foundational infrastructure that empowers the entire company to leverage data, AI, and ML into business impact. We accomplish this by creating platforms that handle the entire data and AI/ML lifecycle, simplifying the interface while providing proper safety and governance. This includes managing our data infrastructure (Databricks, Spark, Kafka, etc.), the technology to serve that data to our users (RAG, MCP, etc.), and the platform to host and govern these AI/ML models. In 2026, our team's overarching mission is to evolve our full data ecosystem-encompassing both platform and models-into a fully AI agent-ready infrastructure; we will empower customers to engage directly with the data platform to extract actionable value through capabilities like analytics and natural language querying, while also upgrading the platform to deliver robust, real-time performance for instant, data-driven decision-making. Who You Are? Role Overview: As a high level architect (staff engineer), you will oversee the strategy, architecture, development, and operation of Airwallex's data and AI platforms. You will play a pivotal role in influencing data-driven and AI-powered decision-making processes. A key aspect of your role will also include building a high-performing team that excels in a fast-paced environment. Your leadership will be critical in mentoring the team, promoting a culture of innovation, and driving technical excellence. Responsibilities: - Spearheaded the identification and resolution of Airwallex-wide challenges using cutting-edge data platform solutions. - Provide visionary technical direction, fostering a community within Airwallex's data realm, and actively leading in solving complex problems hands-on. - Advocate for best practices across the data platform, instilling a culture of craftsmanship and innovation. - Mentor the data platform team, nurturing both technical and professional development. Minimum Requirements: - A minimum of 8 years of experience in Data Platform or an equivalent combination of work and academic exposure in a quantitative field. - Proven experience leading company-wide initiatives across multiple teams or influencing tech roadmap planning. - Effective collaboration with diverse teams and stakeholders to drive tangible business outcomes. - Demonstrated ability to balance execution and velocity with in-depth research, statistical understanding, and scalable design. - Track record of mentoring and investing in the development of scientists, engineers, and peers. - Experience providing technical leadership on significant projects, covering ETL frameworks, metrics stores, infrastructure management, and data security. - Proficiency in building, deploying, and maintaining reliable multi-geographical data pipelines at scale. - Familiarity with workflow or orchestration frameworks such as Airflow, DBT, etc. Preferred Qualifications: - Hands-on design experience in crafting data processing patterns for a modern Lakehouse architecture. - Contribute to the design and development of standard framework modules, high-performance services, and client libraries for big data using tools like GCP, Databricks, BigQuery, DataProc, Kafka, Kubernetes, Spark, DataFlow, Google Cloud Storage, and Airflow. - Excellent written and verbal communication skills tailored for diverse audiences (leadership, users, company-wide). - Ability to rapidly evaluate various technologies and conduct proof of concepts to drive architecture design. - Experience thriving in a complex environment. At Airwallex, you can make an impact in a rapidly growing, global fintech. We want you to share in our success, which is why you'll be offered a competitive salary plus valuable equity within Airwallex. We also like to ensure we create the best environment for our people by providing collaborative open office space with a fully stocked kitchen. We organise regular team-building events and we give our people the freedom to be creative. Applicant Safety Policy: Fraud and Third-Party Recruiters To protect you from recruitment scams, please be aware that Airwallex will not ask for bank details, sensitive ID numbers (i.e. passport), or any form of payment during the application or interview process. All official communication will come from an @airwallex.com email address. Please apply only through careers.airwallex.com or our official LinkedIn page. Airwallex does not accept unsolicited resumes from search firms/recruiters. Airwallex will not pay any fees to search firms/recruiters if a candidate is submitted by a search firm/recruiter unless an agreement has been entered into with respect to specific open position(s). Search firms/recruiters submitting resumes to Airwallex on an unsolicited basis shall be deemed to accept this condition, regardless of any other provision to the contrary. Equal opportunity Airwallex is proud to be an equal opportunity employer. We value diversity and anyone seeking employment at Airwallex is considered based on merit, qualifications, competence and talent. We don't regard color, religion, race, national origin, sexual orientation, ancestry, citizenship, sex, marital or family status, disability, gender, or any other legally protected status when making our hiring decisions. If you have a disability or special need that requires accommodation, please let us know.

Singapore
Job Closed
People Inc. logo

Internal Tools Platform Intern

People Inc.

People Inc. is the largest print and digital publisher in America. Nearly 200 million people trust us each month to help them make decisions, take action, and find inspiration. People Inc.'s over 40 iconic brands include PEOPLE, Better Homes & Gardens, Verywell, Food & Wine, Travel + Leisure, Allrecipes, REAL SIMPLE, Investopedia, and Southern Living. Please be aware of fraudulent recruiters offering opportunities at People Inc. If you are in conversations about a job opportunity and wish to confirm its validity, please reach out directly to hrconcerns@people.inc.

InternshipRemoteTeam 1,001-5,000Since 1996

Job Title Internal Tools Platform Intern Job Description This internship will start May 2026 and wrap up June 2027. About People Inc: People Inc. is America’s largest digital and print publisher. Our 40+ iconic and fast-growing brands harness the best intent-driven content, the fastest sites, and the fewest ads to help nearly 200 million people every month, including 95 percent of US women, make decisions, take action, and find inspiration. People Inc. brands include PEOPLE, Better Homes & Gardens, Verywell, FOOD & WINE, The Spruce, Allrecipes, Byrdie, REAL SIMPLE, Investopedia, Southern Living and more. About Your Team: The Internal Tools Platform team builds and maintains internal application and service frameworks used by engineering teams across People Inc. The team works on a variety of projects including CLI tooling, frontend component libraries, and cutting edge AI-native workflows. About the Intern Role: People Inc. is looking for an eager 3rd or 4th year student looking to learn and grow in a fast-paced environment. You will have the opportunity to build out new functionality within our tooling platform under the guidance of highly experienced and talented Senior and Principal engineers, exposing you to and deepening your knowledge of JavaScript/TypeScript, Vue, Node, Deno, and more. Internship Responsibilities will Include: - Contribute to the development and maintenance of our internal CLI tooling and plugin ecosystem, implementing new commands and improving developer experience - Build and enhance reusable frontend components within our Vue-based component library, ensuring consistency and accessibility across internal applications - Assist in developing full stack features across our internal app framework, spanning both API endpoints and frontend interfaces - Write clean, well-documented TypeScript/JavaScript code and participate in code reviews to learn best practices from senior and principal engineers - Engage in agile ceremonies (standups, sprint planning, retrospectives) and contribute to a collaborative, fast-paced engineering team About You: You are a curious and driven engineering student with a genuine passion for learning and building. You're comfortable diving into unfamiliar problems, asking thoughtful questions, and iterating quickly based on feedback. You take ownership of your work, communicate openly with your team, and care about the quality of what you ship. You're as comfortable collaborating with experienced engineers as you are working independently to figure things out. Most importantly, you bring a growth mindset — you see every challenge as an opportunity to deepen your skills and make a meaningful contribution. Candidates for this role should have: - Currently enrolled in a 3rd or 4th year Computer Science, Software Engineering, or related post-secondary program - Foundational knowledge of JavaScript/TypeScript and experience building with Node.js or similar server-side runtimes - Familiarity with component-based frontend frameworks such as Vue, React, or Angular - Basic understanding of RESTful API design and consumption - Exposure to version control workflows using Git, including branching, pull requests, and code reviews - A collaborative mindset with strong communication skills and a willingness to ask questions and learn from experienced engineers It is the policy of People Inc. to provide employment opportunities regardless of age, physical or mental disability, race, gender, sex, sexual orientation, or any other characteristic protected by applicable laws. In addition, the Company will provide reasonable accommodations for qualified individuals with disabilities. Accommodation requests can be made by emailing hr@people.inc. #NMG#

Canada
Job Closed