Craft.co is self-described as an enterprise intelligence company working to help organizations make data-informed business decisions. A growing company founded in 2015, Craft.co is
Sr. Data Engineer (Poland)
Location
United States
Posted
104 days ago
Salary
$0
Seniority
Senior
Job Description
Sr. Data Engineer (Poland)
Craft.co
About Craft: Craft is the leader in supplier risk intelligence, enabling enterprises to discover, evaluate, and continuously monitor their suppliers at scale. Our unique, proprietary data platform tracks real-time signals on millions of companies globally, delivering best-in-class monitoring and insight into global supply chains. Our customers include Fortune 500 companies, government agencies, SMEs, and global service platforms. Through our configurable Software-as-a-Service portal, our customers can monitor any company they work with and execute critical actions in real-time. We’ve developed distribution partnerships with some of the largest integrators and software platforms globally. We are a post-Series B high-growth technology company backed by top-tier investors in Silicon Valley and Europe, headquartered in San Francisco with hubs in Seattle, London, and Warsaw. We support remote and hybrid work, with team members across North America and Europe. We're looking for innovative and driven people passionate about building the future of Enterprise Intelligence to join our growing team! About the Role: We’re growing quickly and looking to hire several senior-level data engineers for multiple teams. Each team is responsible for a key product within the organization. As a core member of the team, you will have great say in how solutions are engineered and delivered. Craft gives engineers a lot of responsibility and authority, which is matched by our investment in their growth and development. Our data engineers carry a lot of software engineering responsibilities, so we're looking for engineers who have strong Python coding experience, Pandas expertise, and solid software engineering practices. What You'll Do: Build and optimize data pipelines (batch and streaming). Extracting, analyzing and modeling rich and diverse datasets of structured and unstructured data Design software that is easily testable and maintainable. Support in setting data strategies and our vision. Keep track of emerging technologies and trends in the Data Engineering world, incorporating modern tooling and best practices at Craft. Work on extendable data processing systems that allows to add and scale pipelines. Apply machine learning techniques such as anomaly detection, clustering, regression classification, and summarization to extract value from our data sets. Leverage AI-powered development tools (e.g. Cursor) to accelerate development, refactoring, and code generation. Who You Are: Competitive salary at a well-funded, fast-growing startup PTO days so you can take the time you need to refresh! Full-time employees: 28 PTO days allotted + paid public holidays B2B contractors: 15 PTO days allotted + paid public holidays 100% remote work (or hybrid if you prefer! We have coworking space in center of Warsaw.) A Note to Candidates: We are an equal opportunity employer who values and encourages diversity, equity and belonging at our company. We do not discriminate on the basis of race, religion, color, national origin, gender, sexual orientation, age, marital status, veteran status, caste, or disability status. Don’t meet every requirement? Studies have shown that women, communities of color and historically underrepresented talent are less likely to apply to jobs unless they meet every single qualification. At Craft, we are dedicated to building a diverse, inclusive and authentic workplace, so if you’re excited about this role but your past experience doesn’t align perfectly with every qualification in the job description, we strongly encourage you to apply. You may be just the right candidate for this or other roles! #BI-Remote
Job Requirements
- 4+ years of experience in Data Engineering.
- 4+ years of experience with Python.
- Experience in developing, maintaining, and ensuring the reliability, scalability, fault tolerance, and observability of data pipelines in a production environment.
- Have fundamental knowledge of data engineering techniques: ETL/ELT, batch and streaming, DWH, Data Lakes, distributed processing.
- Strong knowledge of SDLC and solid software engineering practices.
- Familiar with infrastructure-as-code approach.
- Demonstrated curiosity through asking questions, digging into new technologies, and always trying to grow.
- Strong problem solving and the ability to communicate ideas effectively.
- Self-starter, independent, likes to take initiative.
- Familiarity with at least some of the technologies in our current tech stack:
- Python, PySpark, Pandas, SQL (PostgreSQL), ElasticSearch, Airflow, Docker
- Databricks, AWS (S3, Batch, Athena, RDS, DynamoDB, Glue, ECS, Amazon Neptune)
- CircleCI, GitHub, Terraform
- Knowledge surrounding AI-assisted coding and experience with Cursor, Co-Pilot, or Codex
- A strong track record of leveraging AI IDEs like Cursor to:
- Rapidly scaffold components and APIs
- Refactor legacy codebases efficiently
- Reduce context-switching and accelerate documentation
- Experiment and prototype with near-instant feedback
- What We Offer:
- Option to work as a
- B2B contractor
- or
- full-time employee
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
• Design and build data pipelines for collecting, processing, and storing healthcare clinical data, and continuously improve them. • Collaborate with data scientists, data integration specialists, analysts, and other stakeholders to understand data requirements and translate them into scalable solutions. • Implement data validation, cleansing, and transformation processes to ensure data accuracy and consistency. • Proactively monitor and troubleshoot pipeline performance, identifying and resolving issues before they become problems. • Ensure data security and compliance with industry regulations, including HIPAA. • Partner with the DevOps team to deploy and manage data pipeline infrastructure in our Azure cloud environment. • Evaluate and adopt emerging tools, frameworks, and techniques that can improve our data platform: bring recommendations, not just questions. • Leverage AI-assisted development tools and modern engineering workflows to accelerate delivery and improve code quality.
• Design and build end-to-end ML pipelines using GCP services (Vertex AI, BigQuery, Dataform) • Develop and productionize tabular ML models (e.g., XGBoost or similar) • Implement robust feature engineering pipelines with point-in-time correctness • Ensure reliable batch scoring workflows and production deployment • Partner with engineering and product stakeholders to translate business needs into ML solutions • Optimize data workflows for performance, scalability, and cost efficiency • Contribute to model evaluation, monitoring, and continuous improvement • Collaborate within a distributed team, ensuring clear communication and delivery alignment
Data Architect
KSM (Katz, Sapper & Miller)Advisory, tax, and audit firm providing visionary people with inspiration and insight to achieve great things.
• Define and maintain KSM’s Databricks lakehouse architecture, including ingestion, storage, transformation, modeling, and access patterns • Establish clear, repeatable design standards for Bronze, Silver, and Gold data layers to ensure consistency and reuse • Design architecture that supports structured, semi-structured, and unstructured data across enterprise systems • Design and implement data governance, lineage, and quality frameworks that ensure data is trusted, auditable, and scalable • Enable discoverability, access control, and metadata management using Databricks Unity Catalog and related tooling • Partner with data and analytics teams to define validation standards, reconciliation processes, and ownership models • Partner closely with the Senior Data Engineer to translate architectural standards into production-ready pipelines • Review and validate data pipelines, models, and workflows to ensure alignment with architectural best practices • Support teams by providing guidance, patterns, and examples rather than one-off solutions • Define best practices for Databricks compute and storage optimization, balancing performance and cost • Establish architectural patterns that promote reliability, scalability, and operational simplicity • Collaborate on monitoring, alerting, and operational standards to ensure platform health • Define and support CI/CD standards for data pipelines and lakehouse infrastructure • Partner with engineering teams to implement Infrastructure as Code (IaC) using tools such as Terraform • Ensure consistent, automated deployments across development, test, and production environments • Align data architecture decisions with firm-wide analytics, automation, and AI strategy • Help prioritize architectural investments that deliver near-term value while supporting long-term growth • Evaluate emerging data platform capabilities thoughtfully, focusing on practical adoption rather than experimentation for its own sake.
Data Migration Engineer
VineskillsWe streamline your law firm with Filevine, so you can focus on what really matters.
• Lead end-to-end, non-recurring data and document migrations from legacy case management systems into Filevine. • Partner directly with clients to assess source systems, define migration strategy, and set clear expectations. • Design and build SQL queries and ETL processes to extract, transform, and load structured data. • Develop and document detailed data mappings between source databases and Filevine. • Cleanse, normalize, and validate data to ensure accuracy and integrity before and after migration. • Collaborate with clients to understand their data structures and migration needs. • Troubleshoot and resolve migration-related issues, escalating when necessary. • Conduct post-migration testing and quality assurance. • Maintain detailed documentation of migration processes, issues, and resolutions. • Provide support and training to clients and internal teams on data migration best practices. • Participate in code reviews and continuous improvement initiatives.




