Craft.co is self-described as an enterprise intelligence company working to help organizations make data-informed business decisions. A growing company founded

Sr. Data Engineer (Poland)

Data EngineerData EngineerOther Remote Senior Company Site

Location

United States

Posted

152 days ago

Salary

Seniority

Senior

Bachelor Degree9 yrs expEnglishAI Airflow AWS CircleCI Databricks Docker Elasticsearch Git Pandas PostgreSQL PySpark Python SQL Terraform

Job Description

About Craft: Craft is the leader in supplier risk intelligence, enabling enterprises to discover, evaluate, and continuously monitor their suppliers at scale. Our unique, proprietary data platform tracks real-time signals on millions of companies globally, delivering best-in-class monitoring and insight into global supply chains. Our customers include Fortune 500 companies, government agencies, SMEs, and global service platforms. Through our configurable Software-as-a-Service portal, our customers can monitor any company they work with and execute critical actions in real-time. We’ve developed distribution partnerships with some of the largest integrators and software platforms globally. We are a post-Series B high-growth technology company backed by top-tier investors in Silicon Valley and Europe, headquartered in San Francisco with hubs in Seattle, London, and Warsaw. We support remote and hybrid work, with team members across North America and Europe. We're looking for innovative and driven people passionate about building the future of Enterprise Intelligence to join our growing team! About the Role: We’re growing quickly and looking to hire several senior-level data engineers for multiple teams. Each team is responsible for a key product within the organization. As a core member of the team, you will have great say in how solutions are engineered and delivered. Craft gives engineers a lot of responsibility and authority, which is matched by our investment in their growth and development. Our data engineers carry a lot of software engineering responsibilities, so we're looking for engineers who have strong Python coding experience, Pandas expertise, and solid software engineering practices. What You'll Do: Build and optimize data pipelines (batch and streaming). Extracting, analyzing and modeling rich and diverse datasets of structured and unstructured data Design software that is easily testable and maintainable. Support in setting data strategies and our vision. Keep track of emerging technologies and trends in the Data Engineering world, incorporating modern tooling and best practices at Craft. Work on extendable data processing systems that allows to add and scale pipelines. Apply machine learning techniques such as anomaly detection, clustering, regression classification, and summarization to extract value from our data sets. Leverage AI-powered development tools (e.g. Cursor) to accelerate development, refactoring, and code generation. Who You Are: Competitive salary at a well-funded, fast-growing startup PTO days so you can take the time you need to refresh! Full-time employees: 28 PTO days allotted + paid public holidays B2B contractors: 15 PTO days allotted + paid public holidays 100% remote work (or hybrid if you prefer! We have coworking space in center of Warsaw.) A Note to Candidates: We are an equal opportunity employer who values and encourages diversity, equity and belonging at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, caste, or disability status. Don’t meet every requirement? Studies have shown that women, communities of color and historically underrepresented talent are less likely to apply to jobs unless they meet every single qualification. At Craft, we are dedicated to building a diverse, inclusive and authentic workplace, so if you’re excited about this role but your past experience doesn’t align perfectly with every qualification in the job description, we strongly encourage you to apply. You may be just the right candidate for this or other roles! #BI-Remote

Job Requirements

4+ years of experience in Data Engineering.
4+ years of experience with Python.
Experience in developing, maintaining, and ensuring the reliability, scalability, fault tolerance, and observability of data pipelines in a production environment.
Have fundamental knowledge of data engineering techniques: ETL/ELT, batch and streaming, DWH, Data Lakes, distributed processing.
Strong knowledge of SDLC and solid software engineering practices.
Familiar with infrastructure-as-code approach.
Demonstrated curiosity through asking questions, digging into new technologies, and always trying to grow.
Strong problem solving and the ability to communicate ideas effectively.
Self-starter, independent, likes to take initiative.
Familiarity with at least some of the technologies in our current tech stack:
Python, PySpark, Pandas, SQL (PostgreSQL), ElasticSearch, Airflow, Docker
Databricks, AWS (S3, Batch, Athena, RDS, DynamoDB, Glue, ECS, Amazon Neptune)
CircleCI, GitHub, Terraform
Knowledge surrounding AI-assisted coding and experience with Cursor, Co-Pilot, or Codex
A strong track record of leveraging AI IDEs like Cursor to:
Rapidly scaffold components and APIs
Refactor legacy codebases efficiently
Reduce context-switching and accelerate documentation
Experiment and prototype with near-instant feedback
What We Offer:
Option to work as a
B2B contractor
or
full-time employee

Related Categories

Data Engineer

Related Job Pages

Remote Python Jobs (US)More Remote Jobs

More Data Engineer Jobs

Senior Data Engineer

Loopback Analytics

Where Health Systems & Life Sciences Connect.

Data Engineer153 days ago

Other RemoteTeam 51-200Since 2009H1B No Sponsor

Company Site LinkedIn

• Design and build data pipelines for collecting, processing, and storing healthcare clinical data, and continuously improve them. • Collaborate with data scientists, data integration specialists, analysts, and other stakeholders to understand data requirements and translate them into scalable solutions. • Implement data validation, cleansing, and transformation processes to ensure data accuracy and consistency. • Proactively monitor and troubleshoot pipeline performance, identifying and resolving issues before they become problems. • Ensure data security and compliance with industry regulations, including HIPAA. • Partner with the DevOps team to deploy and manage data pipeline infrastructure in our Azure cloud environment. • Evaluate and adopt emerging tools, frameworks, and techniques that can improve our data platform: bring recommendations, not just questions. • Leverage AI-assisted development tools and modern engineering workflows to accelerate delivery and improve code quality.

Apache HTTP Server AWS Azure ETL Hadoop Java Apache Kafka Python Scala Apache Spark SQL

View details: Senior Data Engineer

Texas

Apply

Job Closed

Senior Data Engineer – Machine Learning

Gorilla Logic

Agile. Unstoppable.

Data Engineer153 days ago

Full Time RemoteTeam 501-1,000H1B No Sponsor

Company Site LinkedIn

• Design and build end-to-end ML pipelines using GCP services (Vertex AI, BigQuery, Dataform) • Develop and productionize tabular ML models (e.g., XGBoost or similar) • Implement robust feature engineering pipelines with point-in-time correctness • Ensure reliable batch scoring workflows and production deployment • Partner with engineering and product stakeholders to translate business needs into ML solutions • Optimize data workflows for performance, scalability, and cost efficiency • Contribute to model evaluation, monitoring, and continuous improvement • Collaborate within a distributed team, ensuring clear communication and delivery alignment

BigQuery Google Cloud Platform Pandas Python Scikit-Learn

View details: Senior Data Engineer – Machine Learning

Colombia

Apply

Job Closed

Data Architect

KSM (Katz, Sapper & Miller)

Advisory, tax, and audit firm providing visionary people with inspiration and insight to achieve great things.

Data Engineer153 days ago

Other RemoteTeam 201-500Since 1923H1B No Sponsor

Company Site LinkedIn

• Define and maintain KSM’s Databricks lakehouse architecture, including ingestion, storage, transformation, modeling, and access patterns • Establish clear, repeatable design standards for Bronze, Silver, and Gold data layers to ensure consistency and reuse • Design architecture that supports structured, semi-structured, and unstructured data across enterprise systems • Design and implement data governance, lineage, and quality frameworks that ensure data is trusted, auditable, and scalable • Enable discoverability, access control, and metadata management using Databricks Unity Catalog and related tooling • Partner with data and analytics teams to define validation standards, reconciliation processes, and ownership models • Partner closely with the Senior Data Engineer to translate architectural standards into production-ready pipelines • Review and validate data pipelines, models, and workflows to ensure alignment with architectural best practices • Support teams by providing guidance, patterns, and examples rather than one-off solutions • Define best practices for Databricks compute and storage optimization, balancing performance and cost • Establish architectural patterns that promote reliability, scalability, and operational simplicity • Collaborate on monitoring, alerting, and operational standards to ensure platform health • Define and support CI/CD standards for data pipelines and lakehouse infrastructure • Partner with engineering teams to implement Infrastructure as Code (IaC) using tools such as Terraform • Ensure consistent, automated deployments across development, test, and production environments • Align data architecture decisions with firm-wide analytics, automation, and AI strategy • Help prioritize architectural investments that deliver near-term value while supporting long-term growth • Evaluate emerging data platform capabilities thoughtfully, focusing on practical adoption rather than experimentation for its own sake.

PySpark Python SQL Terraform Unity

View details: Data Architect

United States

Apply

Job Closed

Data Migration Engineer

Vineskills

We streamline your law firm with Filevine, so you can focus on what really matters.

Data Engineer153 days ago

Other RemoteTeam 1-10H1B No Sponsor

Company Site LinkedIn

• Lead end-to-end, non-recurring data and document migrations from legacy case management systems into Filevine. • Partner directly with clients to assess source systems, define migration strategy, and set clear expectations. • Design and build SQL queries and ETL processes to extract, transform, and load structured data. • Develop and document detailed data mappings between source databases and Filevine. • Cleanse, normalize, and validate data to ensure accuracy and integrity before and after migration. • Collaborate with clients to understand their data structures and migration needs. • Troubleshoot and resolve migration-related issues, escalating when necessary. • Conduct post-migration testing and quality assurance. • Maintain detailed documentation of migration processes, issues, and resolutions. • Provide support and training to clients and internal teams on data migration best practices. • Participate in code reviews and continuous improvement initiatives.

AWS ETL MySQL PostgreSQL Python SQL SSIS

View details: Data Migration Engineer

United States

$74.4K - $120K / year

Apply

Job Closed

Sr. Data Engineer (Poland)

Job Description

Job Requirements

Related Guides

Related Categories

Related Job Pages

More Data Engineer Jobs

Senior Data Engineer

Senior Data Engineer – Machine Learning

Data Architect

Data Migration Engineer