Job Closed
This listing is no longer active.
The AI-Native Platform Rewriting the Architecture of Modern Acquisitions.
Senior Data Engineer
Location
California
Posted
101 days ago
Salary
0
Seniority
Senior
Job Description
Senior Data Engineer
Rohirrim
• Design, build, and optimize data pipelines and infrastructure for AI products • Collaborate closely with AI/ML teams, product teams, and security/compliance partners • Develop and operate ETL/ELT workflows • Implement and optimize vector database systems and embeddings pipelines • Architect and manage Azure-based data infrastructure • Build internal tools for metadata extraction and document parsing • Monitor and improve pipeline performance and reliability
Job Requirements
- 10+ years in Data Engineering, Software Engineering, or ML/Data Infrastructure roles
- Strong experience with Python, SQL, and modern data engineering tools (Airflow, Dagster, dbt, Prefect, etc.)
- Experience building large-scale document extraction ETL pipelines (OCR, PDF parsing, metadata extraction, NLP preprocessing)
- Proficiency with Kubernetes, Docker, and containerized data pipelines deployed on Azure, AWS and/or Google Cloud
- Hands-on experience with relational databases (Postgres, SQL Server, MySQL) and non-relational systems such as Elasticsearch, Redis, and graph databases
- Experience with document-heavy or text-heavy data processing (OCR, parsing, NLP preprocessing)
- Strong data quality, governance, lineage, and validation mindset
- Excellent communicator who can align with ML, engineering, and product teams.
Benefits
- Dynamic environment
- Leadership opportunities
- Technical direction
- Mentorship roles
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
• Assist with an ongoing effort to converge legacy systems onto existing system’s Azure PaaS cloud environment. • Help architect a common data model and establish data pipelines. • Implement ETL solutions and create SQL views, stored procedures, and functions as needed. • Work with a team that follows the Scrum Agile framework. • Perform data engineering work including implementing ETL solutions and creating data architecture documentation. • Document architecture and SOPs, processes, data flows, and technical decisions for internal and client use. • Assist in designing and implementing ETL pipelines using Azure cloud tools including Data Factory and Logic Apps. • Support data ingestion from structured and unstructured sources.
Database/Data Warehouse Developer
Decision FoundryA Global, Salesforce Marketing Cloud Implementation Partner.
Welcome to Decision Foundry! Decision Foundry, an advisory-led, premier Salesforce Data Cloud delivery partner, bridges the gap between data access, platform adoption, and business impact. As a certified ISV and award-winning Salesforce integration partner, we offer global consulting services, integrating Data Cloud, Account, Engagement, Personalization, Sales, and Service solutions. We win as an organization through our core tenets. They include: · One Team. One Theme. · We sign it. We deliver it. · Be Accountable and Expect Accountability. · Raise Your Hand or Be Willing to Extend it About the role: We are seeking an experienced Database/Data Warehouse Developer for our client. The ideal candidate will be responsible for developing, and maintaining database solutions and data warehouse architectures to support our business operations and analytics needs. This role will involve collaborating closely with stakeholders to understand requirements, implementing robust ETL processes, optimizing database performance, and ensuring the integrity and security of our data infrastructure. The candidate will also be responsible for troubleshooting issues, fine-tuning database systems, and providing technical support as needed. Location: - US remote Responsibilities - Develop, and maintain database solutions and data warehouse architectures using MSSQL 2019/2022 and MySQL 8. - Collaborate with stakeholders to gather requirements and understand business needs for data storage and processing. - Develop and maintain ETL processes using SSIS, BIML, and DBT to ensure efficient data integration and transformation. - Implement and manage solutions within Power BI Fabric for enterprise-scale data analytics. - Write complex SQL queries, stored procedures, functions, and triggers to support application and reporting requirements. - Optimize query performance through indexing strategies, execution plan analysis, and database tuning techniques. - Troubleshoot and resolve database performance issues, data integrity problems, and system bottlenecks. - Monitor database health, perform capacity planning, and implement proactive maintenance strategies. - Develop automation scripts using PowerShell and Python to streamline database operations and maintenance tasks. - Ensure data security, backup and recovery procedures, and compliance with organizational standards. - Utilize version control tools such as Git and project management tools like Jira and Confluence or similar platforms as required. - Work effectively in a team environment, collaborating on concurrent developments and contributing to team success. - Document database architectures, ETL processes, standards, and procedures. - Stay current with database technologies, best practices, and emerging trends in data warehousing. - Provide technical guidance and support to team members and end-users as needed. - Contribute to continuous improvement initiatives related to data management and analytics infrastructure.
• Design efficient architectural recommendations, gain buy-in, and implement them in a reliable manner • Improve, oversee, and coach standards for code+infrastructure maintainability and performance through written documentation and peer review • Work closely with the analytics and business intelligence teams, as well as other stakeholders from finance, sales, marketing, and product, to identify the data needs of the business and produce processes that enable a better product and support growth decision-making • Champion and ensure data governance is a core component of engineering workflows • Help to evolve and scale our data platform, with an eye towards growth of our business and stability
• Design, configure, and implement data replication between SAP HCM and other SAP systems (e.g., SuccessFactors, EC Payroll, SAP S/4HANA). • Develop and maintain integration interfaces using SAP CPI, including iFlows, adapters, and mappings. • Implement, configure, and support both BIB (Business Integration Builder) and PTP (Point to Point) replication scenarios. • Lead and execute data migration and transformation activities for legacy-to-SAP and SAP-to-SAP HCM transitions. • Lead work in collaborating with other project team members on necessary data transformations to/from SAP SuccessFactors necessary for replication and data migration scenarios. • Analyze, map, and transform data between source and target systems ensuring accuracy, consistency, and completeness. • Work with stakeholders to understand functional requirements and translate them into scalable technical solutions for the project lifecycle as related to both replications and data. • Perform data validation, testing, and quality assurance in coordination with functional teams. • Provide integration and data support during cutover, go-live, and post-production phases. • Work with other project team members to execute cutover items related to data, replications, and migration per project phase. Including project planning with the project managers. • Document integration architecture, interface designs, and migration plans. • Ensure compliance with SAP best practices, security standards, and data governance policies.




