Data Engineer
Location
Poland
Posted
31 days ago
Salary
0
Seniority
Mid Level
Job Description
Data Engineer
Vattenfall
Role Description As our new Data Engineer, you will join a collaborative and skilled organization working with data at scale to enable automation of real business cases through data-driven and Machine Learning-based intelligence. We are hiring two Data Engineers for two closely collaborating teams with different focus areas: - The Data Science team, working directly with forecasting, analytics, and ML-based decision support. - The Data Streaming Platform (DSP) team, a newly formed team building and operating a central platform for real-time and event-driven data across the organization. We work in a Scrum setup and have in recent years been very successful in forecasting both electricity prices and electricity consumption, while at the same time scaling our real-time data capabilities. Your work will have direct and measurable impact. What will you do? - Design, build, and operate batch and real-time data pipelines using Azure, Databricks, and Python. - Ingest and process external and internal data such as weather data, market prices, grid telemetry, and event streams. - Work with streaming technologies including Kafka, Spark Structured Streaming, and Flink. - Ensure high data quality through validation, monitoring, and lineage. - Develop and improve data workflows, standards, and reliability in production systems. - Dependent on your area of expertise and interest, you will work in either the Data Science- or Data Streaming Platform team. Data Science team focus: - Enable forecasting, analytics, and ML use cases with high-quality, well-modelled data. - Collaborate closely with Data Scientists, ML Engineers, and other engineering teams. DSP team focus: - Build Kafka based central streaming data platform from scratch to enable traders, analysts and asset managers to publish and consume live and historical data at scale. - Operate and continuously enhance the components once the platform is in production. Location: Katowice or remotely in Poland Qualifications - Strong Python and SQL skills. - Practical experience with cloud data platforms (Azure, Databricks) and with different data storage solutions, including data lakes, lake houses and analytical databases. - Hands-on experience with streaming systems (Kafka, Spark or Flink). - Familiarity with Airflow, CI/CD, DevOps practices. - Experience with Docker, Kubernetes, and Infrastructure-as-Code. - Understanding of data contracts, time-series data, and event-driven systems. - Fluency in English and interest in the energy domain. Benefits - Good remuneration. - A challenging and international work environment. - The possibility to work with some of the best in the field. - Working in interdisciplinary teams with support from committed colleagues. - Attractive employment conditions and opportunities for personal and professional development.
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Freelance Spanish Data Annotation Specialist
DocPlannerAt Docplanner Group, we’re on a mission to help people live longer, healthier lives. As the world’s largest healthcare platform, each month, we connect 24 million patients with 280k doctors across 13 countries. Our marketplaces, SaaS and AI tools simplify daily tasks and help doctors, clinics and hospitals work more efficiently. Real impact – We help doctors help patients. Your work truly makes a difference. At scale, yet agile – 3,000+ employees, but still fast, flexible, and hands-on. Shape the future, sustain growth – Make a difference now and build for long-term success.
Role Description As a Freelance Data Annotation Specialist at Docplanner, your main focus will be preparing and curating linguistically and medically oriented datasets in Spanish to power our machine learning and natural language processing (NLP) initiatives. You will work independently while maintaining clear communication with our NLP experts, ensuring the accuracy and relevance of our data for the market, and contributing directly to the quality and reliability of our AI-driven products in the medical domain. This role is ideal for candidates with a strong linguistic background in Spanish who are passionate about working in the healthcare and technology space. The ideal candidate has experience in data annotation, transcription, and text analysis, combined with a basic understanding of medical terminology and concepts. This is not a full-time job; it's for freelancers only. - Prepare and maintain medically oriented datasets for machine learning purposes, ensuring high-quality data annotation and consistency. - Follow and provide feedback on annotation and labeling conventions, as well as perform audio transcriptions in Italian (aligning audio with text and correcting text based on audio). - Implement text patterns to detect and extract named entities in Portuguese, such as names, addresses, organization names, ID numbers, and medical terms. - Contribute to dictionaries and taxonomies in Spanish for use in NLP systems. - Evaluate the correctness and accuracy of words, sentences, and text structures, ensuring linguistic quality and consistency. - Deliver high-quality work within established deadlines and communicate proactively about project progress and challenges. Qualifications - Languages: Native Spanish speaker and Full Professional Level of English - Linguistic Expertise: A degree or strong background in linguistics, computational linguistics, or a related field. - Medical Knowledge: Familiarity with medical terminology or prior experience in the healthcare domain. - Data Annotation: Hands-on experience with data annotation, transcription, and labeling tools. - Text Analysis: Proficiency in working with text patterns and dictionaries for NLP purposes. - Attention to Detail: Exceptional ability to assess and improve data quality. - Self-Management: Strong ability to work independently, manage time effectively, and consistently meet deadlines. Requirements - Previous experience in AI, machine learning, or data preparation projects. - Knowledge of multiple languages or dialects. - Exposure to medical coding systems (e.g., ICD, CPT) or electronic health records (EHR). - Experience working as a freelancer on similar annotation or linguistic projects. Benefits - Remuneration: 15 euros / h
Principal Data Engineer
NasstarFrom cloud optimisation and application modernisation to connectivity and collaboration, we are Nasstar.
• Serve as the bridge between complex technical architecture and strategic business value. • Act as the primary strategic partner to client business owners, aligning technical data solutions with corporate goals while championing reliability and "clean data" for all consumers. • Drive the development of complex, large-scale data architectures and multi-year technology roadmaps. • Oversee "team-of-teams" delivery performance, ensuring the quality and resilience of high-impact data pipelines. • Lead the triage and resolution of critical platform incidents, maintaining transparent communication with clients. • Implement robust governance frameworks and compliance controls. • Champion an organisational culture of knowledge sharing and represent the firm at industry events. • Influence long-term organizational health by driving platform efficiency and leading strategic hiring and retention initiatives.
Title: Data Team Leader Location: United Kingdom Employees can work remotely Full-time Job Description: Please note this role has the flexibility to be hybrid working or fully remote based on your location. Working within the Healthcare (registries) division of NEC Software Solutions you will oversee a team of data scientists and data analysts (DSA) who’s duties feature a mix of maintaining existing applications, regular and bespoke report generation, dashboard creation, documenting code bases, cleansing datasets, identifying data quality issues in large databases and support the analytical requirements of clinical steering committees. Your primary responsibilities will be to manage the workload of the team, identify opportunities for team members to upskill, coordinate/standardise the production of reports, contribute to the statistical oversight and data efficacy boards, explore new opportunities for the application of machine learning and advanced statistical methodologies to health data and support the development of the DSA team. The focus of your work will be on data that is held in clinical registries relating to various performance outcomes for patients who have undergone surgical or medical interventions. This role involves working closely with the data science and analytics team, as well as clinicians, to develop new registries and health data reporting procedures to support the work of the NHS and private UK healthcare providers. This role has the flexibility to be hybrid working or fully remote based on your location and may require occasional trips to the head office in Hemel Hempstead or other NEC/NHSE locations in the UK to facilitate in-person meetings. Key responsibilities include: - Oversee a team of data scientists and data analysts - Prioritise and schedule the competing workloads of the team to meet delivery timelines - Lead regular update and 1-to-1 meetings with team members - Liaise with industry and academic partners to align and improve our statistical methodology and reporting strategy - Identify and provide appropriate training opportunities for members of the data science and analytics team - Explore real-world-evidence data sets for registry-specific analytical opportunities - Coordinate and assist with the generation of regularly scheduled and bespoke reports, including the plotting of charts using standard libraries - Take charge of data cleansing, quality assurance and supporting integration. - Contribute ideas to improve productivity and delivery. - Liaise with members of the data quality, development, business analyst and business consultant teams to deliver collaborative outputs - Build an extensive clinical knowledge base through meeting regularly with clinical steering committees - Work closely with clinicians to understand and support their reporting and analytical needs Qualifications Mandatory qualifications and experience: - A postgraduate degree in computer science, statistics, biology or a related quantitative discipline. - Expert R (R Studio) programming skills for manipulation/cleaning, report generation, visualisation of data, statistical analysis and modelling - Experience leading a team of data scientists or analysts - Proficiency with SQL (Azure) for data manipulation, extract creation and aggregation of data from disparate sources - Proven experience cleansing, processing and managing large data sets - Proven experience responding creatively to analytical briefs - Experience with data warehouses and open database connectivity - Statistical expertise (including survival analysis and predictive modelling) - The ability to deliver under pressure and without regular supervision - Proven problem-solving skills and ability to work to deadlines - Self-motivation and a flexible approach to work - The ability to thrive in a collaborative team environment - Desirable qualifications and experience: - Experience working with health data including clinical data registries - Experience working with ‘unclean’ data and free text - An interest in the application of machine learning, NLP and AI to solve problems related to health data and patient safety - Experience using Snowflake/DBT - Prompt Engineering for Github Copilot or equivalent Additional Information We pride ourselves in offering an excellent benefits package, including an above average pension scheme. When you join the team at NEC Software Solutions, you are provided with the following: - Private Medical Cover funded by NEC for Employees (with the option to add family members at an additional cost) - 25 days paid holiday with the option to buy/sell - 4 x basic salary life assurance cover funded by NEC (with the option to increase cover at an additional cost) - A Group Pension Plan with fantastic employer contributions up to a maximum of 8.5% - A selection of flexible benefits to suit your individual needs Candidates must be able to demonstrate a pre-existing right to work and travel within the UK. Documentary evidence will be required. All offers are subject to satisfactory vetting and reference checks. Depending on the nature of the role a Disclosure Barring Service (DBS) check may also be required. NEC Software Solutions is an equal opportunities employer, welcoming applications from all communities. If you require any reasonable adjustments or have specific accessibility needs during the recruitment or interview process, please feel free to share these with us. We are committed to ensuring an inclusive and accommodating experience for all candidates. Candidates must be able to demonstrate a pre-existing right to work and travel within the UK. Documentary evidence will be required. All offers are subject to satisfactory vetting and reference checks. This role will require UK Security Clearance. NEC Software Solutions is an equal opportunities employer, welcoming applications from all communities. If you require any reasonable adjustments or have specific accessibility needs during the recruitment or interview process, please feel free to share these with us. We are committed to ensuring an inclusive and accommodating experience for all candidates. Who We Are: We’re NEC Software Solutions (part of global tech giant NEC Corporation). While you read this ad, our software is helping to dispatch ambulances, support families, keep trains on the move, locate missing people and even test the hearing of newborn babies. Working with us, you’ll be helping our 3,000+ employees push the boundaries of what’s possible and support amazing public services. We work with governments, hospitals, police forces, housing providers, local authorities and more. We help them pay financial support faster, speed up treatments for patients and respond to emergencies in the right way. The more we do, the more our customers can do for others. And together, we make a world of difference. We’d love your help. And we’ll support you all the way. NEC’s Healthcare division specialises in medical registries and health screening. The company’s software has been used to screen millions of babies for hearing loss and other inherited conditions. It also supports national and local diabetic eye screening programmes. NEC is the largest clinical registry supplier in the UK and its customers include NHS England’s UK-wide Outcomes and Registries Programme (which will become one of the largest clinical registries in the world), as well as specialist registries such as for robotic and spinal surgery.
Role Description Enable Data Incorporated is seeking a highly skilled Senior Data Engineer with expertise in cloud technologies, Spark, and Databricks to join our dynamic team. As a Senior Data Engineer, you will play a crucial role in designing, developing, and maintaining scalable and efficient data solutions in the cloud. - Design, develop, and maintain scalable and robust data solutions in the cloud using Apache Spark and Databricks. - Gather and analyze data requirements from business stakeholders and identify opportunities for data-driven insights. - Build and optimize data pipelines for data ingestion, processing, and integration using Spark and Databricks. - Ensure data quality, integrity, and security throughout all stages of the data lifecycle. - Collaborate with cross-functional teams to design and implement data models, schemas, and storage solutions. - Optimize data processing and analytics performance by tuning Spark jobs and leveraging Databricks features. - Provide technical guidance and expertise to junior data engineers and developers. - Stay up-to-date with emerging trends and technologies in cloud computing, big data, and data engineering. - Contribute to the continuous improvement of data engineering processes, tools, and best practices. Qualifications - Bachelor's or Master's degree in computer science, engineering, or a related field. - 10+ years of experience as a Data Engineer, Software Engineer, or similar role, with a focus on building cloud-based data solutions. - Strong experience with cloud platforms such as Azure. - Proficiency in Apache Spark and Databricks for large-scale data processing and analytics. - Experience in designing and implementing data processing pipelines using Spark and Databricks. - Strong knowledge of SQL and experience with relational and NoSQL databases. - Experience with data integration and ETL processes using tools like Apache Airflow or cloud-native orchestration services. - Good understanding of data modeling and schema design principles. - Experience with data governance and compliance frameworks. - Excellent problem-solving and troubleshooting skills. - Strong communication and collaboration skills to work effectively in a cross-functional team. - Relevant certifications in cloud platforms, Spark, or Databricks are a plus.


