CrowdStrike has redefined security with the world’s most advanced cloud-native platform that protects and enables the people, processes and technologies that drive modern enterprise. Tested and proven, the world's largest organizations trust CrowdStrike to stop breaches with unparalleled protection against the most sophisticated cyberattacks. The CrowdStrike culture has been built upon our Core Values since the day we began. We are Fanatical About the Customer, Relentlessly Focused on Innovation and believe that our Limitless Passion drives Unlimited Potential for every CrowdStriker. As a purpose-built remote-first company, we believe cultivating a connected culture for every employee, no matter where they are in the world, is a key ingredient in building a high-performing, diverse team. We don’t have a mission statement. We’re on a mission—to stop breaches. Ready to join a mission that matters?
Principal Data Engineer, LLM/AI Platforms
Location
United States
Posted
1 day ago
Salary
$195K - $290K / year
Seniority
Lead
Job Description
Principal Data Engineer, LLM/AI Platforms
CrowdStrike
• Architect, implement, and optimize data platforms and pipelines specifically designed to support LLMs, Retrieval-Augmented Generation (RAG), and sophisticated AI agentic systems at Exabyte scale • Drive the adoption and deployment of agentic workflows and agent harnessing techniques to create autonomous, data-driven security features • Design and implement highly scalable, fault-tolerant, and cost-effective data solutions, emphasizing rapid iteration and high-quality deployment • Write elegant, production-ready code with a focus on performance, maintainability, and testing rigor, ensuring the ability to ship fast without compromising quality • Provide technical leadership and deep expertise in data modeling, normalization, and semantic cataloging for AI/ML workloads • Establish best practices for MLOps/DataOps surrounding LLMs, including monitoring, observability, and zero-touch recovery mechanisms for AI services • Actively mentor engineers, conducting technical workshops, leading design reviews, and strengthening the team's knowledge in cutting-edge AI platform technologies • Collaborate across the organization with Data Scientists, Product Managers, and other engineering teams to transform research prototypes into robust, production-grade services • Own the end-to-end lifecycle of critical data services: development, testing, deployment, and monitoring
Job Requirements
- Master’s degree or PhD in Computer Science, Data Engineering, or a related STEM field, or equivalent practical experience
- 10+ years of progressive experience in Data Engineering/Platform Engineering, with at least 3 years focused on architecting and building platforms for AI/ML or Data Science at massive scale
- Demonstrable hands-on experience in LLM engineering (fine-tuning, prompt engineering, deployment), RAG, and developing agentic workflows
- Proven track record of designing and delivering large-scale distributed systems (sharding, partitioning, concurrency)
- Exceptional ability to write clean, elegant, performant, and well-tested code, coupled with a proactive mindset for delivering results quickly
- A thorough understanding of engineering practices, including effective peer code reviews, resilient architecture design, and comprehensive testing paradigms
- Prior experience in a Principal or Staff level engineering role, demonstrating technical leadership and mentorship capabilities
Benefits
- Market leader in compensation and equity awards
- Comprehensive physical and mental wellness programs
- Competitive vacation and holidays for recharge
- Paid parental and adoption leaves
- Professional development opportunities for all employees regardless of level or role
- Employee Networks, geographic neighborhood groups, and volunteer opportunities to build connections
- Vibrant office culture with world class amenities
- Great Place to Work Certified™ across the globe
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Senior Project Manager, Streaming Intelligence & Data Platform
PlayOn! SportsThe nation's leading high school media company providing live streaming and digital ticketing services.
• Serve as the primary delivery partner for the Streaming Intelligence product line and DPT, converting roadmap priorities into actionable project plans with clear milestones, owners, and dependencies. • Co-own quarterly and annual planning with the Principal PM—from ideation through go-to-market—ensuring engineering capacity, vendor timelines, and business priorities are aligned. • Track and report on product line KPIs defined by the Principal PM (accuracy benchmarks, processing latency, content coverage, downstream adoption) and surface risks or deviations early. • Partner with the data team on agentic AI enablement—coordinating efforts to surface data models and semantic views to the broader business through AI-powered interfaces, including usage tracking and accuracy assurance. • Manage delivery cadence for platform infrastructure initiatives including data governance rollouts, real-time streaming capabilities, and pipeline migration efforts—tracking phased delivery across multiple months with clear metrics of success. • Lead end-to-end program management for AI pipeline initiatives, including the game film analysis benchmarking program that evaluates vision models (e.g., Gemini Flash, Gemini Pro) against human-tagged ground truth across 28+ annotated fields per play. • Manage the cadence of model evaluation cycles: coordinating game imports, prompt engineering iterations, accuracy scoring, and cost analysis across multiple AI models. • Drive vendor integration workstreams—tracking deliverables, SLAs, and roadmap alignment with external computer vision and AI partners alongside the Principal PM. • Build and maintain project documentation for hybrid human-AI workflows, including escalation paths for ambiguous plays, quality review processes, and fallback handling. • Facilitate cross-team visibility across product, engineering, data, design, and QA to ensure aligned execution mapping back to PlayOn’s unified product objectives: Engaged Communities, Resilient Services, and New Customer Markets. • Own dependency mapping, risk management, and blocker resolution for strategic AI initiatives spanning multiple engineering teams. • Develop and maintain project dashboards, status reports, and executive-level communications that increase transparency of engineering progress to the broader business. • Provide governance and optimization for Atlassian tools (Jira, Confluence) to ensure standardized workflows, consistent estimation practices, and clear traceability from strategy to execution. • Champion operational improvements to planning cycles, stakeholder coordination, cross-org meetings, and feedback loops—creating transparency through standardized tooling and repeatable frameworks. • Actively incorporate AI tools (e.g., Claude, automation platforms) into your own delivery workflows: automating status reports, synthesizing meeting notes, generating risk analyses, and accelerating documentation. • Contribute to PlayOn’s broader AI maturity journey by demonstrating AI-augmented delivery practices that can be adopted across the PMO and engineering organization.
• Design, implement, and maintain enterprise data warehouse solutions following Data Vault 2.0 methodology • Develop and optimise Hubs, Links, Satellites, PIT, and Bridge tables • Lead conceptual, logical, and physical data modelling activities across multiple initiatives • Define and promote data architecture standards, modelling guidelines, naming conventions, and documentation practices • Ensure scalability, performance, reliability, and maintainability of data warehouse architectures • Provide technical leadership and architectural guidance to Data Engineers and development teams • Drive the standardisation and automation of ELT processes, metadata management, and deployment practices • Collaborate closely with Business Analysts, Product Owners, Data Governance teams, and other stakeholders to translate business requirements into robust data solutions • Support reporting and analytics teams by delivering optimised and business-ready data structures
Data Engineer, Data Modeler
Essnova Solutions, Inc.Federal contracting company specializing in technical, geospatial, healthcare, and administrative solutions.
• Designing and maintaining scalable database architectures for analytical workloads, ensuring optimal performance through indexing, partitioning, caching, and query tuning. • Developing, automating, and maintaining data ingestion, transformation, validation, and loading pipelines to support high-quality data flows. • Building reusable data connection modules and backend infrastructure components to streamline data operations and support future scalability. • Supporting data preprocessing, normalization, and rigorous data quality validation processes. • Contributing to cloud migration planning, governance, and evaluation of cloud solutions for security, compliance, interoperability, and performance. • Producing comprehensive data models, architecture diagrams, flowcharts, and technical documentation for internal and external stakeholders. • Utilizing PowerShell scripting and automation to enhance data engineering workflows and system integration. • Collaborating with technical, security, and stakeholder teams throughout the project lifecycle to deliver high-impact data solutions.
• Design the vision and blueprint of the organization's data framework to support public health informatics objectives. • Design and manage data systems, establishing policies for how data is stored, accessed, and governed. • Coordinate various data sources within the organization to ensure consistency and integrity across the data environment. • Integrate new data technologies into existing IT infrastructures. • Perform data modeling and design activities to support reporting, analytics, and operational needs. • Develop and maintain ETL pipelines to support data movement and transformation. • Write and optimize SQL queries in support of data operations and system requirements. • Administer database environments in coordination with application and infrastructure teams. • Submit weekly status reports detailing completed and ongoing project activities, including risks as applicable. • Maintain accurate records of work performed and hours expended in accordance with OTS Task Order requirements.




