Job Closed

This listing is no longer active.

Growth Acceleration Partners logo
Growth Acceleration Partners

Consult • Design • Build • Modernize

Senior Data Platform Engineer (ClickHouse | OLAP)

Data EngineerData EngineerOtherRemoteTeam 501-1,000H1B No SponsorCompany SiteLinkedIn

Location

United States + 2 moreAll locations: United States | Colombia | Costa Rica

Posted

90 days ago

Salary

0

Job Description

Senior Data Platform Engineer (ClickHouse | OLAP)

Growth Acceleration Partners

WHAT WE DO Founded in 2007, Growth Acceleration Partners (GAP) is a consulting and technology services company. We consult, design, build and modernize revenue-generating software and data engineering solutions for clients. With modernization services and AI tools, we help businesses achieve a competitive advantage through technology. GAP’s remote, integrated engineering teams use end-to-end solutions to innovate and align with your business goals. We have 600+ English-speaking engineers based in Latin America and approximately 20 U.S.-based engineers. With some of the highest customer satisfaction scores in the industry, GAP’s focus is customer and employee success. GAP is a woman-owned company headquartered in Austin Texas. We are a values-based company focused on growing our people by investing in education, onsite English classes and training in the latest technologies, including AI, data analytics and machine learning. Our goal is to provide solutions for our customers that help them achieve critical business outcomes, while enabling our GAPSters and our communities to attain long-term success. Summary We are looking for a Senior Data Platform Engineer with deep expertise in large-scale analytical databases and real-time data platforms. In this role, you will lead the design, optimization, and scalability of high-performance OLAP systems that power analytics, AI pipelines, and intelligent connectivity platforms. You will architect and optimize distributed analytical databases such as ClickHouse while ensuring the platform can process multi-terabyte datasets efficiently. This role combines data architecture, performance engineering, and platform design, with a strong focus on real-time analytics and large-scale data processing. As a Senior Engineer, you will also guide engineering teams on best practices for data modeling, distributed query optimization, and system scalability, ensuring that data systems deliver reliable insights and support advanced AI-driven workloads. Education - Bachelor’s or Master’s Degree in Computer Science, Data Science, Engineering, or related field, or equivalent practical experience. Professional Experience - 5–10 years of experience building and operating data engineering or analytics platforms - Strong experience working with OLAP databases or analytical data systems - Experience designing distributed data systems handling multi-terabyte datasets - Proven experience optimizing high-performance analytical queries and data pipelines - Experience mentoring engineers and guiding data architecture decisions Key Responsibilities Data Platform Architecture - Architect and optimize large-scale ClickHouse or OLAP databases - Design distributed data systems capable of handling multi-terabyte datasets - Define best practices for analytical data models and query performance - Ensure high availability, scalability, and reliability of data systems Performance Optimization - Tune analytical queries for maximum performance and efficiency - Optimize storage structures, indexing strategies, and partitioning - Improve query latency for real-time analytics workloads - Monitor and address performance bottlenecks in distributed data systems Data Pipeline & Platform Integration - Integrate ClickHouse with analytics platforms, dashboards, and AI pipelines - Design efficient data ingestion and transformation pipelines - Support real-time analytics use cases across distributed systems - Enable downstream consumption by analytics and product teams Engineering Leadership - Mentor engineers on distributed data system design and performance optimization - Guide teams in building scalable analytics infrastructure - Contribute to architectural decisions across the data platform - Promote best practices for data architecture and data reliability Cross-Team Collaboration - Work closely with data scientists, analytics teams, and backend engineers - Ensure the data platform supports real-time analytics and AI-driven features - Communicate complex data architecture decisions clearly to technical teams Required Technical Skills Data Platforms & Databases - Deep experience with ClickHouse or similar OLAP databases - Experience with analytical databases such as Druid, DuckDB, or similar systems - Strong experience managing multi-terabyte data environments - Deep understanding of distributed data system architecture Data Engineering & Analytics Systems - Experience building and operating analytics platforms and data pipelines - Knowledge of high-performance query optimization - Experience designing analytical schemas for fast querying - Experience integrating data platforms with analytics or AI systems Nice to Have - Experience with real-time data streaming architectures - Experience working with vector databases or AI data pipelines - Experience with cloud-based data infrastructure - Experience supporting large-scale analytics platforms Soft Skills - Advanced English proficiency - Strong communication and collaboration skills - Builder mindset with strong curiosity and experimentation attitude - Ability to explain complex technical concepts clearly At Growth Acceleration Partners, we're an equal opportunity employer committed to building a diverse and inclusive team. We value everyone's unique background, and we provide equal opportunities regardless of race, color, creed, religion, sexual orientation, gender identity, age, national origin, disability, marital status, veteran status or any other personal right protected by law. We foster a culture of belonging and strive to provide a welcoming environment where everyone feels safe to contribute and grow.

Related Categories

Related Job Pages

More Data Engineer Jobs

Data Engineer Intern (Internship)

Abacus Insights

Abascus Insights is a technology company working to improve people's lives by "harnessing the healthcare data explosion" through intelligent integration software. Founded in 2017 b

Data Engineer90 days ago

About Us Abacus Insights is transforming how data works for health plans. Our mission is simple: make healthcare data usable, so the people responsible for care and cost decisions can act faster, with confidence. We help health plans break down data silos to create a single, trusted data foundation. That foundation powers better decisions —so plans can improve outcomes, reduce waste, and deliver better experiences for members and providers alike. Backed by $100M from top investors, we’re tackling big challenges in an industry that’s ready for change. Our platform enables GenAI use cases by delivering clean, connected, and reliable healthcare data that can support automation, prioritization, and decision workflows—and it’s why we are leading the way. Our innovation begins with people. We are bold, curious, and collaborative—because the best ideas come from working together. Ready to make an impact? Join us and let's build the future together. About the Role Our Data Distribution Engineering team is looking to bring on a Data Engineering Intern (post-grad) to be part of our fast-growing team. We see massive growth in our future and would love for you to be a part of it! You will partner with customers and several internal teams and extract data from the industry leading data platform which is built on highly scalable, flexible and resilient cloud architectures. As a trusted customer advocate, you will have the opportunity to guide a client’s successful adoption of Abacus’s core data management offerings. Your day to day: - Work with clients and implementation teams to understand the data distribution requirements. - Perform data analysis and data mapping required to produce client output. - Build the data distribution extracts and scripts. - Optimize the performance of the data extract scripts. - Foster a dynamic, high-preforming, and collaborative culture. - Guide team members to solve issues and help them to understand any challenges. - Spearhead clients’ projects and handle client calls. - Constantly think about process enhancements and bring different ideas into table. What you bring to the team: - 2+ years of experience working with healthcare data. - Understanding and passion for healthcare data and systems – knowledge on Membership, Claims, Provider and other domains. - Strong experience writing queries, performing data analysis, data profiling and reporting. - Performance optimization experience with understanding of how to fine tune the queries. - Experience handling a large volume of data. - Ability to communicate with technical and non-technical stakeholders. - Ability to seek better analytical solutions to abstract business problems. - Experience performing root cause analysis on internal and external data and processes to answer specific business questions and identify opportunities for improvement. - Experience, coupled with a willingness, to learn new things - Desire to contribute individually, with the ability both to self-direct work and to collaborate with others. - Strong SQL Programming skills. - Strong project management and organizational skills. What we would like to see, but not required: - Experience working with an AWS, Azure, Databricks, Spark. - Experience working with Snowflake. - Understanding or experience working for a cloud-based data services provider to large healthcare clients. Compensation: Compensation for this role is $30/hour or $62,400 per annum. What you’ll get in return: - Unlimited paid time off – recharge when you need it - Work from anywhere – flexibility to fit your life - Comprehensive health coverage – multiple plan options to choose from - Growth-focused environment – your development matters here #LI-RF1 #LI-Remote Our Commitment as an Equal Opportunity Employer As a mission-led technology company helping to drive better healthcare outcomes, Abacus Insights believes that the best innovation and value we can bring to our customers comes from diverse ideas, thoughts, experiences, and perspectives. Therefore, we dedicate resources to building diverse teams and providing equal employment opportunities to all applicants. Abacus prohibits discrimination and harassment regarding race, color, religion, age, sex, national origin, disability status, genetics, protected veteran status, sexual orientation, gender identity or expression, or any other characteristic protected by federal, state, or local laws. At the heart of who we are is a commitment to continuously and intentionally building an inclusive culture—one that empowers every team member across the globe to do their best work and bring their authentic selves. We carry that same commitment into our hiring process, aiming to create an interview experience where you feel comfortable and confident showcasing your strengths. If there’s anything we can do to support that—big or small—please let us know.

United States
Job Closed
OtherRemoteTeam 201-500

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description Genius is looking for a remote Senior Data Engineer to help build the ultimate music companion. Overseeing millions of pages of lyrics, annotations, and music metadata, your work will power Genius.com, our public API, and our partner integrations, with no shortage of interesting data challenges to solve. We're looking for engineers who thrive in complex backend systems: folks who can build robust data pipelines, optimize query performance at scale, and keep critical services running smoothly. As a Senior Data Engineer for Genius, you will play a key role in strengthening our data infrastructure, helping us process, transform, and serve the music knowledge that powers everything we do. You will work across our Rails backend, PostgreSQL databases, and Python-based data pipelines to improve data reliability, throughput, and accuracy. Additionally, you'll work to improve the stability, performance, and scalability of Genius' backend services, which is no small feat for a platform serving millions of users daily! Location Requirement: Candidates must be based in Los Angeles, California, or Seattle, Washington to be considered for this role. What You’ll Do - Build and maintain data pipelines using Python and Airflow to ingest, transform, and enrich music metadata from internal and external sources - Proactively identify and fix infrastructure bottlenecks to scale backend services to tens of thousands of requests per minute - Architect database query patterns and migrations in PostgreSQL, Clickhouse and BigQuery that scale to large tables with 1B+ rows - Design and implement backend APIs in Ruby on Rails that serve data reliably and performantly to our frontend and partner integrations - Take ownership over the systems you build, proactively identifying and surfacing performance, reliability, and maintainability improvements - During your on-call rotation, be the backstop for backend quality, stability, and performance. Triage incoming issues to find the most urgent problems, and handle emergent incidents to keep services online - Work directly with stakeholders to uncover and address business needs, including product owners, data analysts, and other engineers across the company Qualifications - Hands-on experience building and maintaining ETL/ELT pipelines using Python and workflow orchestration tools like Apache Airflow - Deep proficiency with PostgreSQL and relational databases - Strong experience with Ruby on Rails (or a similar "batteries-included" framework) - Brings a product-first mindset, able to drive projects from ideation to launch while focusing on data quality, system reliability, and cross-functional alignment - Comfortable working cross-functionally with frontend engineers, data analysts, product, and other teams - Demonstrates a passion for staying current with new technologies, frameworks, and industry best practices Requirements - At least 4 years of hands-on experience in a backend or data engineering capacity, preferably in a product-driven environment Education (Preferred) - Bachelor's degree in Computer Science, Engineering, or a related technical field

United States
Job Closed
Growe Talents logo

Data Engineer

Growe Talents

Your growth starts here.

Data Engineer90 days ago
Full TimeRemoteTeam 11-50H1B No Sponsor

• Design, develop, and maintain scalable data pipelines using Airflow; • Create and optimize tables in Athena, ensuring efficient query performance and cost-effectiveness; • Write and manage SQL transformations in DBT, building reusable and well-documented models; • Optimize data workflows for performance, reliability, and cost-efficiency; • Automate infrastructure provisioning using Terraform for data processing environments; • Ensure data integrity and consistency through monitoring, validation, and error handling; • Collaborate with analytics, BI, and engineering teams to understand data needs and deliver solutions; • Troubleshoot and resolve data pipeline issues in a timely manner.

Worldwide
Job Closed
CACI International Inc logo

Data Engineer

CACI International Inc

Expertise and Technology for National Security

Data Engineer91 days ago
OtherRemoteTeam 10,001+Since 1962H1B No Sponsor

Job Title: Data Engineer Job Category: Information Technology Time Type: Full time Minimum Clearance Required to Start: Public Trust Employee Type: Regular Percentage of Travel Required: None Type of Travel: None * * * CACI seeks a data engineer that will be responsible for designing, building, validating, and maintaining scalable data pipelines and analytics solutions with a strong emphasis on Databricks and data quality. This role partners closely with our product owners, analysts, data scientists and software engineers to translate business and technical requirements into reliable, testable, and high-quality data solutions to support programmatic goals. Responsibilities: Design, develop, optimize and maintain scalable data pipelines and transformations using Databricks, Apache Spark and SQL. Impelement data ingestion, transformation, and orchestration workflows to support back and where applicable real-time processing. Perform data quality assurance activities, including identifying and resolving any inconsistencies in data flow, data outside legitimate ranges, and illogical data responses by developing data quality reports and investigation and resolution of data anomalies or errors by using a combination of software packages including SAS, Excel, and other software as warranted. Use technical expertise, initiative, creativity, critical thinking, and strong communication and interpersonal skills daily to solve data quality problems in support of technical development efforts. Implement data quality controls to ensure accuracy, completeness, and reliability of datasets. Document data pipelines, transforms, business rules and data dependencies using appropriate technical documentation methods (e.g., data flow diagrams, data dictionaries, etc.). Serve as liaison and coordinate with a multi-disciplinary team. Collaborate with the program team to identify opportunities for process improvements, making strategic adjustments, and exploit opportunities focused on maximizing programmatic impact. Communicate data issues, risks, and remediation approaches clearly to technical and non-technical team members. Qualifications: Required: Must be able to obtain a Public Trust clearance. Bachelor’s degree in Computer Science, Data Engineering, Information Systems, or a related technical field (or equivalent experience) Demonstrated experience as a Data Engineer in a production environment Strong hands-on experience with Databricks, including Spark-based data processing Proficiency in SQL and at least one programming language such as Python Excellent communication skills: listening, writing, and experience interacting comfortably with scientists, epidemiologists, informaticians and developers. Experience supporting analytics, reporting or machine learning workloads. These Qualifications Would Be Nice to Have - Experience supporting public health, healthcare, or government data systems - Knowledge of data governance, data quality frameworks, or metadata management - Experience working with large-scale analytics or reporting environments - Familiarity with Power BI or other business intelligence tools - Prior experience supporting multiple teams or programs simultaneously (the “steady hand” type) - Exposure to Agile or iterative delivery environments. - Prefer candidate to be in the Atlanta, Georgia area. - What You Can Expect: A culture of integrity. At CACI, we place character and innovation at the center of everything we do. As a valued team member, you’ll be part of a high-performing group dedicated to our customer’s missions and driven by a higher purpose – to ensure the safety of our nation. An environment of trust. CACI values the unique contributions that every employee brings to our company and our customers - every day. You’ll have the autonomy to take the time you need through a unique flexible time off benefit and have access to robust learning resources to make your ambitions a reality. A focus on continuous growth. Together, we will advance our nation's most critical missions, build on our lengthy track record of business success, and find opportunities to break new ground — in your career and in our legacy. Pay Range: There are a host of factors that can influence final salary including, but not limited to, geographic location, Federal Government contract labor categories and contract wage rates, relevant prior work experience, specific skills and competencies, education, and certifications. Our employees value the flexibility at CACI that allows them to balance quality work and their personal lives. We offer competitive compensation, benefits and learning and development opportunities. Our broad and competitive mix of benefits options is designed to support and protect employees and their families. At CACI, you will receive comprehensive benefits such as; healthcare, wellness, financial, retirement, family support, continuing education, and time off benefits. Since this position can be worked in more than one location, the range shown is the national average for the position. The proposed salary range for this position is: $63,300-$129,700 CACI is an Equal Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, pregnancy, sexual orientation, age, national origin, disability, status as a protected veteran, or any other protected characteristic.

United States
$63.3K - $129K / year
Job Closed