Job Closed

This listing is no longer active.

Rimini Street logo
Rimini Street

Extraordinary technology solutions powered by extraordinary people

AI Data Engineer

Data EngineerData EngineerFull TimeRemoteMid LevelTeam 1,001-5,000Since 2005H1B SponsorCompany SiteLinkedIn

Location

India

Posted

32 days ago

Salary

0

Seniority

Mid Level

Job Description

AI Data Engineer

Rimini Street

Role Description The AI Data Engineer is responsible for building the knowledge layer of Rimini Street’s Agentic ERP Platform—the data pipelines, RAG (Retrieval-Augmented Generation) systems, and embedding infrastructure that give AI agents access to the right information at the right time. This role owns how knowledge is ingested, processed, indexed, and retrieved to support intelligent agent behavior. Reporting to the Sr. Director, Engineering, this engineer designs the data architecture that powers agent intelligence—from extracting knowledge from Rimini Street’s 15+ years of support case history to building real-time retrieval systems for customer-specific context. The ideal candidate combines strong data engineering fundamentals with modern AI/ML knowledge, particularly in embeddings, vector search, and retrieval optimization. Qualifications - 2-3 years of data engineering experience, with at least 1-2 years focused on AI/ML data pipelines or RAG systems. - Hands-on experience building and optimizing RAG pipelines in production environments. - Strong experience with vector databases, embeddings, and similarity search. - Experience with ETL/ELT pipelines and data integration from diverse source systems. - Production experience with PostgreSQL and SQL-based data processing. - Background in Python for data processing and pipeline development. - Experience with enterprise data platforms (Snowflake, Databricks, or similar) preferred. - Exposure to enterprise software, ERP systems, or support/ticketing systems preferred. Requirements - Implement and manage vector storage using PostgreSQL with pgvector extension, including index optimization for search performance. - Evaluate and select embedding models appropriate for enterprise content (technical documentation, business processes, ERP terminology). - Build embedding pipelines that process documents at scale with appropriate batching and error handling. - Implement incremental indexing strategies for real-time updates without full reprocessing. - Design multi-tenant vector architectures that isolate customer data while enabling efficient search. - Monitor and optimize vector search performance: latency, accuracy, and resource utilization. - Build data pipelines to ingest knowledge from diverse sources: Salesforce support tickets, ServiceNow cases, documentation repositories, email archives, and ERP transaction logs. - Implement ETL processes that clean, normalize, and enrich raw data for AI consumption. - Develop document processing pipelines: PDF extraction, HTML parsing, structured data normalization. - Implement data quality monitoring and alerting for ingestion pipelines. - Design the knowledge architecture that organizes information across the Four-Spoke model: Policy Intelligence, Institutional Memory, Rimini Collective Intelligence, and Intelligent Escalation. - Integrate with Snowflake for large-scale data processing, leveraging Cortex AI capabilities where applicable. Benefits - Compensation, bonuses, and benefits to match the skills of top-performing team members. - Opportunity to work in a fast-paced environment and grow together. - Challenging and meaningful work with a sense of achievement and purpose. Company Description Rimini Street, Inc. (Nasdaq: RMNI) is a global provider of end-to-end, mission-critical enterprise software support, managed services, and innovative Agentic AI ERP solutions. The company has signed thousands of contracts with Fortune Global 100, Fortune 500, midmarket, public sector, and government organizations. - Established operations in Hyderabad in 2013 with Global Client Onboarding Services, IT shared services, and Global Service Development. - Gained valuable share in bringing the reputation to Rimini Street Inc of being a global provider of unified support and managed service solutions for enterprise software. - Today, Rimini Street, India GCC is a family of about 800+ full-time talented individuals.

Related Categories

Related Job Pages

More Data Engineer Jobs

Cormac Corporation logo

Data Lead/SME

Cormac Corporation

At CORMAC, we leverage the power of data management and analytics to enable our customers to achieve their strategic goals. With over 20 years of experience in health information technology (HIT), human-centered design principles, and Agile development methodologies, CORMAC delivers complex digital solutions to solve some of the most challenging problems facing public healthcare programs today.

Data Engineer32 days ago

Role Description CORMAC is seeking a Data Lead / SME to be responsible for providing deep functional and technical expertise on Centers for Medicare & Medicaid Services (CMS) data domains, business processes, and system relationships across the enterprise data platform. Serves as the primary expert for understanding how CMS data sources, programs, workflows, and reporting requirements connect across claims, enrollment, provider, beneficiary, and operational systems to support analytics, compliance, and decision-making. - Serve as the primary SME for CMS data domains, including claims, eligibility, enrollment, provider, beneficiary, financial, quality, and operational datasets. - Provide expertise on how CMS systems, programs, and data flows interact across enterprise platforms and downstream reporting environments. - Support data mapping, source-to-target analysis, and business rule validation for data ingestion, transformation, and reporting initiatives. - Work closely with data architects, system architects, business analysts, and engineering teams to ensure accurate interpretation and use of CMS data. - Define and validate business logic, data lineage, and cross-system relationships for enterprise analytics and reporting solutions. - Assist with metadata management, data catalog governance, and stewardship activities to improve data discovery and usability. - Support data quality assessments, reconciliation efforts, and root cause analysis for data discrepancies and reporting issues. - Provide guidance on CMS program requirements, policy impacts, and operational dependencies affecting data architecture and analytics. - Collaborate with stakeholders to translate complex business and policy requirements into actionable technical and reporting requirements. - Support user adoption of data platforms such as Databricks, reporting portals, and metadata catalog solutions. - Participate in governance reviews, audits, compliance activities, and documentation efforts related to federal healthcare data management. - Develop business process documentation, data dictionaries, source system documentation, and knowledge transfer materials. Qualifications - Bachelor’s degree in Health Informatics, Information Systems, Public Health, Business, or related field. - Must be U.S. Citizen. - Must be eligible to obtain a Public Trust (PT) Clearance. - 5+ years of experience working with CMS programs, operations, and enterprise healthcare data environments. - Prior experience supporting Centers for Medicare & Medicaid Services programs, federal healthcare contracts, or Medicare/Medicaid operations. - Deep understanding of claims, provider, enrollment, beneficiary, and healthcare financial data structures. - Experience working with federal healthcare systems, reporting environments, and regulatory requirements. - Strong understanding of data lineage, metadata, business rules, and enterprise data governance. - Ability to interpret policy, operational processes, and system dependencies across multiple CMS domains. - Experience supporting analytics platforms such as Databricks, data warehouses, and reporting tools. - Strong stakeholder engagement, communication, and cross-functional collaboration skills. - Ability to bridge business operations and technical implementation teams. Preferred Skills & Experience - Master’s degree in Health Informatics, Information Systems, Public Health, Business, or related field. - Familiarity with metadata catalog tools, enterprise data governance platforms, and reporting solutions such as Amazon QuickSight, Power BI, or Tableau. Location Leesburg, VA Work Arrangement 100% Remote Why CORMAC? At CORMAC, we leverage the power of data management and analytics to enable our customers to achieve their strategic goals. With over 20 years of experience in health information technology (HIT), human-centered design principles, and Agile development methodologies, CORMAC delivers complex digital solutions to solve some of the most challenging problems facing public healthcare programs today. As a US Federal Government contractor in the public healthcare sector, our work is impactful and cutting-edge while being performed in a supportive, collaborative, and welcoming environment. We offer flexible work schedules with remote, hybrid, or fully in-person workplace options to empower our employees to decide the workplace most suitable for them. At CORMAC, we have a highly diverse workforce and believe a work environment is a place where creativity, collaboration, enthusiasm, and innovation happen, regardless of location. Equal Employment Opportunity As an Equal Employment Opportunity employer, CORMAC provides equal employment opportunity to all employees and applicants without regard to an individual's protected status, including race/ethnicity, color, national origin, ancestry, religion, creed, age, gender, gender identity/expression, sexual orientation, marital status, parental status, including pregnancy, childbirth, or related conditions, disability, military service, veteran status, genetic information, or any other protected status.

United States
Job Closed
Data Engineer32 days ago
Full TimeRemoteTeam 5,001-10,000Since 1995H1B No Sponsor

• Design and implement a Data Lake and Lakehouse architecture from scratch, defining the layer structure (Bronze, Silver and Gold / Medallion Architecture), the logical and physical organization of data, and ensuring the solution’s scalability and resilience; • Assess and recommend the most appropriate technology stack for the client context (including Microsoft Fabric, Databricks and/or Snowflake), taking into account volume, complexity and governance requirements; • Define and implement the data governance framework — covering cataloging, classification and lineage — establishing a unified data catalog as the central management element of the platform; • Plan integration strategies to ingest data from unstructured sources (spreadsheets, SharePoint, legacy systems), defining ETL/ELT standards and loading strategies (full load or incremental as required); • Establish technical standards for data modeling, data contracts and development best practices to guide the engineering team; • Evaluate data volume and characteristics to recommend the most suitable processing platform; • Guide and direct the team in implementing the defined solutions, conducting technical reviews and ensuring adherence to architectural decisions; • Translate technical decisions into clear, accessible documentation.

Brazil
Job Closed
Capital Bank, N.A. logo

Senior Data Engineer

Capital Bank, N.A.

Capital Bank, N.A. is an Affirmative Action, E-Verify, and Equal Opportunity Employer.

Data Engineer32 days ago

Role Description As part of the Technology team, the Senior Data Engineer will act as a subject matter expert (SME) responsible for managing, designing, and optimizing enterprise data pipelines and database solutions that support all lines of business at Capital Bank, including our Commercial Bank, Consumer Card division, and Mortgage division. This role owns end-to-end delivery and operations across our primary technology stack (Azure, SQL Server, and Snowflake), ensuring scalable, secure, and high-performance data platforms, reliable integrations with enterprise systems, and strong data governance. The Senior Data Engineer will build and optimize ETL/ELT processes, data lake and data warehouse solutions, and data transfer/file processing workflows to enable trusted analytics and reporting across the bank. Qualifications - Bachelor’s degree or higher in Computer Science, Information Systems, or a related field. - 6+ years of experience in data engineering, ETL, database management with experience in cloud-based databases and financial services preferred. - Experience in Database Administration (DBA), managing and optimizing databases for performance and security. - 3+ years of experience designing and building data lakes and data warehouses using platforms like Azure Fabrik, Snowflake, Amazon Redshift, or Google BigQuery. - 2+ years of experience using data visualization tools like Power BI, Sisense, Google Looker, Tableau, or similar platforms. - Experience with managing data transfers and file processes, including SFTP, secure data pipelines, and real-time or batch data movement. - Experience in proactively identifying, addressing, and monitoring data quality issues, adhering to established data quality standards. - Excellent communication skills and the ability to collaborate effectively across teams and stakeholders, explain technical concepts to non-technical stakeholders. Requirements - Expertise in cloud-based database platforms, such as Azure SQL, Amazon RDS, Google Cloud Spanner, and Snowflake. - Strong knowledge of data lake and data warehouse architectures, including designing efficient schemas, partitioning strategies, and optimizing storage. - Proficiency with data integration tools and technologies, such as Apache Kafka, Apache Spark, Talend, or Informatica. - Hands-on experience building and maintaining ETL pipelines to support large-scale data environments. - Advanced SQL skills and familiarity with programming languages like Python or Java for data manipulation and automation. - Experience with data visualization platforms, including building and optimizing dashboards using Sisense, Power BI, Google Looker, Tableau, or similar tools. - Experience with CI/CD tools (e.g., GitLab, Azure DevOps, Jenkins) and data pipeline monitoring tools (e.g., Airflow, Apache NiFi, Azure Data Factory). - Strong understanding of database security best practices, including encryption, access controls, and compliance with regulatory standards. - Ability to manage data file transfers and processing workflows effectively. - Experience with database monitoring and performance tuning in cloud and hybrid environments. - Preferred experience in understanding data relationships, as well as developing and optimizing SQL queries, stored procedures, and database functions in both OLTP and OLAP systems. - Strong organizational and problem-solving skills in Agile or fast-paced environments. Benefits - Base Salary Range: $115,000 - $130,000 annually. - This role will include a yearly annual target bonus based on individual performance. - This is a 100% remote role. - Comprehensive benefits package including Medical, Dental, Vision, Company Paid Life Insurance, Disability Insurance, and more. - Company Contributions to your 401k - Regardless of your contribution. - Employee Perks: Paid Parental Leave, Employee Recognition Program, Leadership Program, Tuition Reimbursement Program, Employee Bank Checking Account, and much more! - Generous Paid Time Off and Paid Holidays - Including Paid Charity Hours to support volunteer opportunities.

Maryland + 9 moreAll locations: Maryland | District Of Columbia | Virginia | Pennsylvania | Delaware | Indiana | Illinois | South Carolina | North Carolina | Florida
$115K - $130K / year
Jusbrasil logo

Senior Data Engineer – Vaga Afirmativa para PCD

Jusbrasil

💻 Descomplicamos o acesso à informação jurídica por meio da tecnologia

Data Engineer32 days ago
Full TimeRemoteTeam 201-500H1B No Sponsor

• Projetar e manter pipelines de ingestão e transformação de dados em larga escala (batch e streaming). • Garantir a qualidade, confiabilidade e integridade dos dados ao longo de todo o ciclo (testes, versionamento, monitoramento). • Implementar e evoluir arquitetura de dados em GCP (BigQuery, Cloud Storage, Pub/Sub, Composer). • Colaborar com equipes de Analytics, Produto e Engenharia para definir necessidades de dados e traduzir requisitos de negócio em soluções técnicas. • Participar da evolução das boas práticas de engenharia de dados e padronização interna de modelos, nomenclaturas e monitoramentos.

Brazil