Zscaler logo
Zscaler

We make it easy to secure your cloud transformation. Get fast, secure, and direct access to apps without appliances.

Principal GenAI Data Engineer

Data EngineerData EngineerFull TimeRemoteLeadTeam 5,001-10,000Since 2008H1B SponsorCompany SiteLinkedIn

Location

United States

Posted

3 days ago

Salary

$182K - $260K / year

Seniority

Lead

Bachelor DegreeEnglishPython

Job Description

Principal GenAI Data Engineer

Zscaler

• Architect enterprise-scale GenAI data platforms for ingestion, transformation, enrichment, and serving of structured and unstructured data • Design scalable pipelines for enterprise knowledge ingestion from diverse data sources including documents, SaaS platforms, knowledge bases, collaboration tools, and databases • Define architecture for metadata extraction, chunking, enrichment, embeddings generation, and knowledge preparation workflows • Design AI-ready data models and storage strategies for vector, graph, and hybrid knowledge systems • Architect scalable unstructured data processing pipelines for text, images, PDFs, tables, and multimodal content

Job Requirements

  • Expert-level Python programming and software engineering capabilities
  • Experience building distributed/scalable data pipelines for AI workloads
  • Strong understanding of unstructured data extraction and processing pipelines
  • Experience with vector databases, graph databases, and metadata/knowledge storage systems
  • Hands-on experience with clustering, entity recognition algorithms, and modern retrieval strategies (including RAG, search, and agentic AI workflows)

Benefits

  • Various health plans
  • Time off plans for vacation and sick time
  • Parental leave options
  • Retirement options
  • Education reimbursement
  • In-office perks, and more!

Related Categories

Related Job Pages

More Data Engineer Jobs

Full TimeRemoteTeam 11-50

Role Description We are seeking a detail-oriented and dependable Remote Data Entry Specialist to join our growing team. In this role, you will be responsible for accurately entering, updating, and maintaining information within company databases and systems. This is a fully remote position offering flexibility, paid training, and opportunities for professional growth. Salary: $25–$30 per hour Weekly pay available depending on employer policies. Responsibilities - Enter and update data accurately into company databases and spreadsheets - Review records for errors, inconsistencies, and missing information - Organize and maintain digital files and documentation - Verify data accuracy by cross-checking source materials - Communicate with internal teams to resolve discrepancies - Follow confidentiality and company data protection procedures - Meet daily and weekly productivity goals Qualifications - High school diploma or equivalent - Strong attention to detail and organizational skills - Basic computer knowledge and typing proficiency - Ability to work independently in a remote environment - Strong communication and time-management skills - Reliable internet connection and computer/laptop access Preferred Qualifications - Previous data entry, administrative, or customer support experience preferred but not required - Familiarity with Microsoft Excel, Google Sheets, and online databases is a plus Benefits - Fully remote position - Flexible scheduling options - Paid training provided - Career advancement opportunities - Supportive and collaborative work environment - Work-life balance with remote flexibility

United States
$25 - $30 / hour
Leadfeeder logo

Senior Data Engineer, Platform Data

Leadfeeder

Identify visitors, qualify prospects, connect with decision makers.

Data Engineer3 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

• Design, build, and operate production data pipelines that power Leadfeeder's product features — from ingestion through enrichment, processing, and serving. • Build and maintain streaming and real-time ingestion systems that move event data through the platform at scale and with low latency. • Own the cloud infrastructure underpinning the pipelines — compute, storage, networking, security, observability — designed and managed as code. • Collaborate with product and ML engineers to deliver datasets and pipelines that power product-facing features and AI/ML workflows. • Implement data quality, observability, and reliability controls across the pipelines so issues are caught early, incidents are short, and downstream teams can trust the data. • Drive engineering practices across the team: code review, testing, CI/CD for data, infrastructure-as-code, performance tuning, and cost discipline. • Partner with engineering, product, and ML teams to translate product requirements into scalable, well-documented data systems.

Germany
Full TimeRemoteTeam 201-500Since 2014H1B Sponsor

• Deliver cutting-edge services and solutions • Help global enterprises overcome their toughest data challenges • Collaborate with major cloud data platforms like Snowflake, AWS, Azure, GCP.

Brazil
Lumen Technologies logo

Senior Lead Data Architect

Lumen Technologies

Lumen Technologies is self-described as a global company of 40,000+ professionals empowering businesses, government, and communities to “produce amazing things.” Driven by the

Data Engineer3 days ago
Full TimeRemoteTeam 10,001

Role Description Lumen is seeking a strategic and technically adept Senior Lead Data Architect to lead high-impact analytics initiatives across the Product organization. This role goes beyond traditional business intelligence — the ideal candidate will bring deep strengths in automation, advanced analytics, and scalable reporting, with working knowledge of data science methods and the versatility to operate across both code and no-code environments. This individual will drive data discipline, enable predictable delivery, and support innovation by transforming complex datasets into actionable insights. They will serve as a thought partner to product leadership, influencing decisions through rigorous analysis, automated workflows, and enterprise-grade reporting frameworks. You’ll be at the heart of Lumen’s transformation, enabling data-driven decision-making across product innovation, delivery, and customer experience. This role offers visibility to executive leadership and the opportunity to shape how data informs our future. Main Responsibilities - Strategic Analytics Leadership: Partner with Product Ops and Product Houses to define and measure innovation vs. predictability trade-offs, surfacing gaps in current metrics and proposing new KPIs aligned to business goals. - Data Architecture & Governance: Lead efforts to unify data sources across legacy systems and modern platforms (e.g., CDW, Palantir Foundry), ensuring consistency, auditability, and scalability of analytics solutions. - Product Performance Measurement: Develop frameworks to assess product delivery velocity, backlog health, and customer impact using tools like Power BI, Salesforce, and internal product layer data. - Stakeholder Engagement: Collaborate with Product Managers, Engineering, Finance, and Sales to align data definitions and reporting logic, ensuring transparency and trust in shared dashboards and executive summaries. Design and maintain standardized, automated reporting pipelines that reduce manual effort and deliver consistent, on-demand insights to stakeholders at all levels. - Advanced Modeling & Forecasting: Build predictive models and apply data science techniques — including regression, clustering, and time-series analysis — to support funnel analysis, ARPU forecasting, churn prediction, and incremental sales tracking for high-bandwidth services (e.g., Ethernet, IPVPN, NAS). Translate model outputs into business-ready narratives for non-technical audiences. - Mentorship & Enablement: Guide junior team members and cross-functional teams in best practices for data handling, visualization, and storytelling. Champion upskilling and bi-directional data literacy across the Product organization. - Automation & Workflow Engineering: Design and implement automated data pipelines, scheduled reports, and alert-driven workflows that reduce manual processing and increase the speed and reliability of analytics delivery. Leverage scripting (Python, SQL) alongside automation platforms to operationalize recurring analytics at scale. - Code & No-Code Environment Fluency: Operate effectively in both programmatic (Python, SQL, Jupyter) and no-code/low-code environments (Power BI, Tableau, Alteryx, or similar), selecting the right tool for each audience and use case. Empower business users through self-serve analytics while maintaining rigor in code-based workflows for complex analysis. Qualifications - Bachelor’s or Master’s degree in Data Science, Statistics, Computer Science, or related field. - 8+ years of experience in data analytics, with at least 3 years in a senior or principal role. - Proficiency in SQL, Python, and enterprise data systems (e.g., CDW, SAP ECC); hands-on experience building automated workflows, scheduled pipelines, and data transformation scripts. - Strong understanding of telecom product structures, service definitions, and port-based connectivity models. - Experience with audit preparation and investor-facing reporting frameworks. Preferred Skills - Familiarity with Palantir Foundry and ontology-driven data modeling. - Experience in product operations, backlog management, and agile delivery metrics. - Ability to translate business strategy into measurable outcomes and scalable dashboards. - Data science exposure preferred — familiarity with machine learning concepts, statistical modeling, and libraries such as scikit-learn, statsmodels, or similar; ability to collaborate with or direct data science teammates. - Experience with no-code and low-code analytics tools (e.g., Power BI, Tableau, Alteryx, Dataiku, or similar), with the ability to serve both technical and non-technical users within the same workflow. - Experience designing and managing automated reporting systems, including scheduled delivery, exception alerting, and self-serve analytics portals. - Comfort working across the full analytics stack — from raw data extraction and transformation to polished executive-facing deliverables — without requiring handoffs between teams. Compensation This information reflects the anticipated base salary range for this position based on current national data. Minimums and maximums may vary based on location. Individual pay is based on skills, experience and other relevant factors. - $132,232 - $176,310 in these states: AL, AR, AZ, FL, GA, IA, ID, IN, KS, KY, LA, ME, MO, MS, MT, ND, NE, NM, OH, OK, PA, SC, SD, TN, UT, VT, WI, WV, WY - $138,844 - $185,124 in these states: CO, HI, MI, MN, NC, NH, NV, OR, RI - $145,456 - $193,940 in these states: AK, CA, CT, DC, DE, IL, MA, MD, NJ, NY, TX, VA, WA Benefits - Lumen offers a comprehensive package featuring a broad range of Health, Life, Voluntary Lifestyle benefits and other perks that enhance your physical, mental, emotional and financial wellbeing.

United States
$132.2K - $193.9K / year