Job Closed

This listing is no longer active.

CreatorIQ

The most trusted software to unify and power advanced influencer marketing for the world’s most innovative enterprises

Senior Data Engineer, Reporting

Data EngineerData EngineerFull Time Remote SeniorTeam 201-500Since 2014H1B SponsorCompany Site LinkedIn

Location

Poland

Posted

92 days ago

Salary

zł289K - zł325K / year

Seniority

Senior

Bachelor Degree5 yrs expEnglishAirflow Apache ETL Numpy Pandas Python SQL

Job Description

• Design and implement ETL pipelines migrating from transactional databases to analytical data warehouses • Create real-time data ingestion systems processing campaign data, user metrics, and business intelligence. • Build multi-tenant data models with proper partitioning strategies for enterprise-scale clients. • Develop data quality frameworks with comprehensive validation, monitoring, and alerting. • Implement Row-Level Security (RLS) and Role-Based Access Control (RBAC) in analytical databases • Design dynamic permission models supporting organization-level and division-level data access • Build session-based context management for secure multi-tenant queries • Create comprehensive audit trails and access logging for compliance requirements • Design database schemas with advanced partitioning and indexing strategies • Build materialized views and aggregated tables for real-time analytics • Implement query optimization, data skipping, and compression techniques • Handle high-concurrency embedded dashboard usage with sub-second query performance • Build dashboard data sources with optimized SQL transformations • Handle complex data structures and parsing requirements • Create flat, denormalized tables optimized for embedded analytics consumption • Implement custom field handling for tenant-specific metadata requirements

Job Requirements

5+ years of data engineering experience with production-scale systems
Expert-level SQL skills with analytical databases (columnar databases preferred)
Strong Python programming with data libraries: pandas, numpy, pyarrow
Experience with ETL orchestration tools: Apache Airflow, Prefect, dbt, or similar
Deep understanding of analytical databases, partitioning strategies, and OLAP optimization
Experience building SaaS data platforms with tenant isolation requirements
Knowledge of Row-Level Security (RLS) implementation in analytical databases
Understanding of RBAC patterns and session-based access control
Experience with authentication flows in data systems
Familiarity with compliance requirements (SOC2, GDPR) for multi-tenant data.

Benefits

26 days vacation
Floating and set holidays
Wellness allowance
Paid parental leave
Medical insurance
Life insurance
Business travel insurance
Stock options as part of our equity-sharing program.
Comprehensive perks program providing stipends for cell phone and internet, home office setup, mental wellness, professional development and tuition reimbursement, plus occasional company-funded meal opportunities throughout the year.

Related Categories

Data Engineer

Related Job Pages

Remote Full-time Jobs (US)Remote Python Jobs (US)More Remote Jobs

More Data Engineer Jobs

Ingeniero de Datos

Talycap Global

Data Engineer92 days ago

Full Time RemoteTeam 51-200

🧠 ¡Buscamos Ingeniero de Datos! 🧠 ¿Te apasiona trabajar con grandes volúmenes de datos y proyectos en la nube? 🎯 Objetivo del rol: Participar en proyectos de migración y desarrollo en entornos cloud, utilizando Azure Synapse y PySpark para garantizar soluciones eficientes y escalables. 🎓 Requisitos del perfil - Formación académica: Ingeniero de Sistemas o carreras afines. - Experiencia: Mínimo 2 años en proyectos en la nube con Azure Synapse y desarrollo de notebooks con PySpark. - Conocimientos indispensables: - PySpark (desarrollo y migración de código desde otras plataformas). - Azure Synapse. - SQL avanzado. - Git (control de versiones). - Creación y gestión de pipelines. 💡 Competencias personales - Trabajo en equipo. - Comunicación efectiva. - Adaptabilidad. - Orientación a resultados. - Capacidad analítica. ⚙️ Responsabilidades principales - Desarrollar y mantener notebooks en PySpark. - Ejecutar proyectos de migración de código hacia PySpark. - Implementar y optimizar pipelines de datos. - Colaborar con equipos multidisciplinarios para garantizar la calidad del proyecto. - Documentar procesos y buenas prácticas. 🕒 Condiciones laborales - 📍 Ubicación: Colombia (Remoto). - ⏰ Horario: Tiempo completo. - 💰 Salario: A convenir 🌐 TALYCAP GLOBAL – Conectamos el mejor talento IT con proyectos de alto impacto. #IngenieroDeDatos #AzureSynapse #PySpark #TalycapGlobal

Azure PySpark SQL Git

View details: Ingeniero de Datos

Colombia

Apply

Senior Data Engineer

Certifid, Inc

Data Engineer92 days ago

Full Time Remote

Role Description Cybercrime is rising, reaching record highs in 2024. According to the FBI's IC3 report, total losses exceeded $16 billion. With investment fraud and BEC scams at the forefront, the message is clear: the real estate sector remains a lucrative target for cybercriminals. At CertifID, we take this threat seriously and provide a secure platform that verifies the identities of parties involved in transactions, authenticates wire transfer instructions, and detects potential fraud attempts. Our technology is designed to mitigate risks and ensure that every transaction is conducted with confidence and peace of mind. We know we couldn’t take on this challenge without our incredible team. We have been recognized as one of the Best Startups to Work for in Austin, made the Inc. 5000 list, and won Best Culture by Purpose Jobs two years in a row. We are guided by our core values and our vision of a world without wire fraud. We offer a dynamic work environment where you can contribute to meaningful impact and be part of a team dedicated to enhancing security and fighting fraud. CertifID is the wire fraud prevention platform protecting real estate closings. Every transaction we secure generates data: identity signals, verification events, behavioral patterns, payment flows. That data is how we detect fraud, how our customers measure risk, and how the business operates. You will own the systems that make it trustworthy, fast, and useful. What You'll Do - Data platform and pipeline engineering - Design, build, and operate the core data infrastructure: data lake, warehouse, orchestration, observability, and governance, using declarative configuration and infrastructure as code (Terraform or equivalent) so the platform is reproducible and auditable. - Partner with platform and domain teams to design ingestion pipelines and implement declarative configuration for data sources across the stack. - Architect the transformation layer: dimensional models, aggregation strategies, and incremental materialization patterns that balance query performance against pipeline cost at scale. - Own streaming and near-real-time data flows for fraud signal propagation, transaction status events, and verification webhooks, with the reliability expectations those require. - Build for scale: partition strategies, clustering, late-arriving data handling, and backfill patterns that hold up when data volume doubles. - Business outcome ownership - Own the source-of-truth models for the metrics the business runs on: ARR, NRR, churn, transaction volume, fraud detection rates, customer health scores, and operational throughput. - Make the numbers defensible: when a business leader challenges a metric, you can walk them through exactly how it is calculated, what is excluded, and why. - Partner with Product, Finance, CS, and GTM to translate business questions into data models and help teams measure what actually matters. - Engineering craft and standards - Write production-grade Python and SQL: modular, tested, version-controlled, and reviewable by someone who was not in the room when you wrote it. - Implement CI/CD pipelines for data systems: automated testing, schema change detection, data contract validation, deployment gates, and cost optimization and performance tuning as ongoing practice, not one-time projects. Qualifications - 6+ years in data engineering with primary, end-to-end ownership of a production data platform, not a supporting role on a large team. - Direct experience designing and operating streaming or near-real-time pipelines (Kafka, Kinesis, Pub/Sub, Flink, or equivalent) at production scale, including debugging failures under load. - Hands-on production experience with cloud-based data platforms (Snowflake, BigQuery, Redshift, Databricks, or equivalent) and a production-grade orchestrator (Airflow, Dagster, Prefect, or equivalent). Requirements - Expert SQL and distributed systems: window functions, recursive CTEs, query plan analysis, query concurrency management, and optimization strategies that go beyond adding an index. - Strong Python for data engineering: production-quality pipeline code with error handling, idempotency, retry logic, and test coverage; Go is a meaningful plus. - Dimensional modeling mastery: you understand the tradeoffs between normalized and denormalized designs, when SCDs are the right tool, and how incremental strategies affect downstream query semantics. - Event-driven architecture fundamentals: exactly-once semantics, consumer group management, backpressure handling, offset management, and the operational realities of keeping a streaming pipeline healthy. - Warehouse internals: clustering keys, materialized views, partition pruning, and cost optimization strategies that keep query costs from compounding as data volume grows. What Sets You Apart - You instrument, measure, and verify that your work produced the outcome it was supposed to. - You make architectural decisions independently, communicate outwardly, and document the reasoning so the decision survives you. - You have joined teams where the data was a mess, and you shipped before the situation was fully resolved, because waiting for perfection was not an option. Benefits - Flexible vacation. - 12 company-paid holidays. - 10 paid sick days. - No work on your birthday. - Health, dental, and vision Insurance (including a $0 option). - 401(k) with matching, and no waiting period. - Equity. - Life insurance. - Generous parental paid leave. - Wellness reimbursement of $300/year. - Remote worker reimbursement of $300/year. - Professional development reimbursement. - Competitive pay. - An award-winning culture.

Observability/Monitoring Infrastructure as Code Terraform Webhooks Google Tag Manager Python SQL CI/CD Data Engineering Apache Kafka Amazon Kinesis Apache Flink Snowflake BigQuery Amazon Redshift Databricks Airflow Distributed Systems

View details: Senior Data Engineer

United States

Apply

Job Closed

Senior AI Data Engineer

Sunrise Robotics Corporation

Data Engineer92 days ago

Full Time Remote

Role Description At Sunrise Robotics, this role owns the data platform that powers our robotics, AI, and deployment systems. You’ll design and build the infrastructure that handles large-scale, multi-modal data - from ingestion and storage to processing and access. This includes video, sensor data, and robot telemetry used across training, simulation, and production systems. Your work will enable teams to reliably collect, version, and use data to improve system performance and accelerate development. This is a hands-on role focused on building robust, scalable systems that support real-world robotics deployments. What You’ll Do - Design and build data pipelines for ingesting, processing, and serving multi-modal data (video, images, sensor data) - Own and evolve our data platform, ensuring reliability, scalability, and ease of use across teams - Implement data storage and processing systems on AWS (e.g. S3, serverless pipelines, data lake architecture) - Build systems for data versioning, lineage, and reproducibility - Optimise data handling for large-scale, memory-intensive workloads - Enable efficient data access for AI, simulation, and deployment teams - Collaborate with AI, robotics, and infrastructure teams to align data systems with real-world use cases - Contribute to monitoring, observability, and tooling for data quality and system performance Qualifications - Strong Python engineering skills, including data structures and efficient data handling - Experience working with large, multi-modal datasets (e.g. video, images, sensor data) - Experience building and operating data infrastructure on AWS (e.g. S3, serverless processing, data lakes) - Experience with data versioning and lineage systems (e.g. DVC, LakeFS, Pachyderm, or similar) - Experience with asynchronous programming and memory-efficient processing - Experience with data pipeline orchestration tools (e.g. Airflow, Dagster, Prefect, or similar) - Familiarity with MLOps workflows and tools (e.g. MLflow, ClearML, ZenML, or similar) - Experience with infrastructure-as-code tools (e.g. Terraform, Pulumi, CloudFormation) - Experience working with databases (e.g. SQLite, MongoDB, Cassandra) Requirements - Experience working with robotics or other real-time, sensor-driven systems - Experience with streaming data and edge data processing (e.g. Jetson, Coral, NXP) - Experience with synthetic data generation (e.g. Unity, Isaac Sim) - Experience with data visualisation and monitoring tools (e.g. Grafana, Plotly, Rerun) - Familiarity with ROS/ROS2 or robotics frameworks - Experience working with telemetry data (e.g. logs from physical systems) Benefits - High exposure: Work closely with robotics, AI, and infrastructure teams on core systems - Career acceleration: Build and own the data platform in a fast-scaling robotics company - Real impact: Your work will directly enable how data is used to train, evaluate, and improve systems running in production

AI AWS Amazon S3 Observability/Monitoring Python Airflow MLflow Terraform Pulumi SQLite MongoDB Cassandra Unity Grafana

View details: Senior AI Data Engineer

Slovenia

Apply

Data Engineer

CenterWell Senior Primary Care

Data Engineer92 days ago

Full Time RemoteTeam 1,001-5,000H1B No Sponsor

Company Site LinkedIn

• develop scalable data processing architectures and solutions for data transformation • ingest data from internal and external sources using cloud-native platforms • apply industry-standard software development practices and patterns • use AI, machine learning, and big data to enhance service delivery • help shape departmental strategy and decisions regarding technical approaches

Cloud Python SQL

View details: Data Engineer

North America

$97.9K - $133.5K / year

Apply

Job Closed

Senior Data Engineer, Reporting

Job Description

Job Requirements

Benefits

Related Guides

Related Categories

Related Job Pages

More Data Engineer Jobs

Ingeniero de Datos

Senior Data Engineer

Senior AI Data Engineer

Data Engineer