Senior Software Engineer- Data Engineering
Location
United States
Posted
2 days ago
Salary
$142.6K - $192.9K / year
Seniority
Senior
Job Description
Senior Software Engineer- Data Engineering
Noctua Technology
Role Description The Data Engineering & AI/ML capability serves as the backbone of our organization's data-driven operations. We are seeking a talented and motivated Senior Data Engineer to join our dynamic Data Engineering team. As a key member of our engineering team, you will play a crucial role in: - Constructing and optimizing data pipelines - Implementing efficient storage solutions - Orchestrating the infrastructure necessary to support our customer’s data-driven initiatives - Leading engineering engagements by collaborating with cross-functional teams including customers, partners, and internal engineers Location: Primarily Remote. Candidates must be based in CA or DC Metro Area for proximity to project and client teams. Security Clearance Requirement: Applicants must be US citizens and eligible to obtain and maintain an active Secret security clearance or above. Key Responsibilities - Data Collection and Processing - Acquire, clean, and preprocess diverse datasets from various sources, establishing best practices and ensuring data quality standards. - Design and build required infrastructure for optimal extraction, transformation and loading of data from various data sources using CSP managed services and SQL technologies, focusing on scalability and cost efficiency. - Develop, maintain, and optimize mission-critical data pipelines to ensure a continuous flow of high-quality data, implementing robust monitoring and alerting. - Data Migrations & Optimization - Architect and develop data migration strategies and schemas to lead complex customer migrations from on-prem to cloud technologies. - Oversee and execute high-volume data migration activities, ensuring data integrity and minimal downtime. - Optimize databases and data warehouses for efficient querying and data storage, implementing advanced partitioning, indexing, and tuning techniques. - Data Analysis and Visualization - Perform exploratory data analysis to uncover patterns, trends, and insights, providing actionable recommendations to technical and business stakeholders. - Create visualizations and reports to communicate findings effectively to stakeholders both internally and externally, driving data-driven decision making. - Collaboration and Documentation - Collaborate with cross-functional teams, including software engineers, domain experts, and business analysts, to understand requirements and deliver integrated solutions, end-to-end data solutions. - Create and maintain comprehensive documentation for data architectures, code, algorithms, and models. Ensure that the knowledge is shared and accessible within the team. - Customer Engagement - Act on client feedback constructively to improve services and outcome, serving as a key technical contact for data-related discussions. - Continuously seek ways to enhance the overall customer experience, proactively identifying and addressing complex data challenges. - Continuous Learning and Innovation - Stay updated on the latest developments in cloud data services, machine learning, data science, and analytics. - Drive innovation by evaluating, proposing, and implementing cutting-edge techniques and technologies to address client challenges. Qualifications - Expert-level understanding and experience with SQL and relational database concepts - Deep understanding of database technologies, data warehouses, and ETL tools (e.g., MySQL, PostgreSQL, Beam, Airflow, and Kafka) - Proven track record of designing and implementing scalable data pipelines - Advanced experience with data analysis tools (e.g., Jupyter, Colab, Pandas) - Extensive experience with data visualization tools (e.g., Tableau, Looker, PowerBI, Qlik, and SuperSet) - Demonstrated experience developing comprehensive data strategies and facilitating data migrations into production systems - Expertise with cloud platforms (e.g., AWS, Azure, GCP) - Proficiency in programming languages such as Python, Java, or C++ - Strong software engineering skills with an emphasis on writing clean, modular, and maintainable code - Experience with version control systems (e.g., Git) and collaborative development workflows - Exceptional problem-solving and critical-thinking skills - Effective communication skills and ability to work in a collaborative team environment Preferred Qualifications - Bachelor's or advanced degree in Computer Science, Data Science, Machine Learning, or a related field - Experience with other database technologies (e.g., NoSQL, Graph) - Any of the below cloud certifications: - Google Cloud Professional Cloud Architect - Google Cloud Professional Database Engineer certification - Google Cloud Professional Data Engineer - Experience with additional data processing tools and technologies (e.g., Spark, Hadoop) - Knowledge of containerization and orchestration tools (e.g., Docker, Kubernetes) Salary Range $142,550 - $192,900
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
• Data ingestion for the Databricks environment. • Development and optimization of data pipelines using PySpark and SQL. • Integration with Microsoft Fabric for data management and governance. • Building semantic models in Power BI to support advanced analytics.
Data & AI Engineer – Microsoft Fabric
ReplyReply designs and implements innovative solutions in the areas: Digital Services, Technology and Consulting.
• Looking for a Senior Data Engineer with deep expertise in Microsoft Fabric and AI integration. • Hands-on experience architecting and delivering end-to-end data solutions within the Microsoft Fabric ecosystem, including Data Factory, Lakehouse, Warehouses, Notebooks, and Real-Time Intelligence. • Comfortable working with large-scale data pipelines, semantic models, and AI-powered workflows using Azure OpenAI or similar tools.
Staff Data Engineer
PayabliThe next-generation payments infrastructure for software companies to quickly and easily embed and monetize payments.
• Architect the platform. Set our warehouse/lakehouse direction and stand up the data lake and layered architecture that turns our raw system of record into trustworthy, queryable, intelligence-ready data. • Build the pipelines. Design and run batch and streaming pipelines that move data reliably out of our production systems - CDC, ELT, and real-time where it matters. • Model the data. Define the canonical datasets and models the whole company depends on, getting the grain, semantics, and contracts right. • Own reliability and accuracy. This is financial data, so correctness is non-negotiable. You'll own data quality, observability, integrity checks, and the testing and monitoring that let us trust it. • Build for a regulated environment. Design in role-based access, masking, lineage, and auditability from day one, and keep sensitive financial data out of places it doesn't belong. • Enable AI/ML and analytics. Build the feature pipelines and trustworthy data foundation our intelligence work relies on, moving us from systems of record toward systems of intelligence and action. • Set the standard. Establish the practices, tooling, and CI/CD for data that the future team inherits. You're setting the bar, not just clearing it.
Role Description Dein täglicher Verantwortungsbereich: - Entwicklung und Verantwortung der Architektur sowie Implementierung skalierbarer ETL‑Pipelines auf Basis von Microsoft Fabric. - Effiziente, automatisierte Integration heterogener Datenquellen, darunter MySQL‑Datenbanken, APIs und weitere. - Konzeption, Planung und Umsetzung von KI- und Automatisierungsprojekten. - Weiterentwicklung der Data‑Warehouse‑Architektur durch kontinuierliche Anpassung von Datenmodellen und Strukturen an neue Anforderungen. - Entwicklung robuster Prozesse für Datenmodellierung, -gewinnung und -produktion zur Schaffung einer stabilen Grundlage für Reporting, Analytics und KI‑Anwendungen. - Hohe Standards der Datenqualität, Sicherheit und Zuverlässigkeit im Blick behalten und Integrität, Verfügbarkeit und Vertraulichkeit entlang der gesamten Pipeline sicherstellen. - Aktives Verfolgen von Trends und technologischen Entwicklungen im Bereich Data Engineering, KI und Automatisierung. Qualifications - Abgeschlossenes Bachelor- oder Masterstudium in Informatik, Informationstechnologie oder einem vergleichbaren Fachgebiet. - Mindestens 5 Jahre Berufserfahrung mit tiefem Verständnis für die Entwicklung und Implementierung moderner ETL-Pipelines. - Praktische Erfahrung im Einsatz von Automatisierungs-Tools wie Azure Logic Apps, n8n oder Make. - Kenntnisse im Umgang mit Large Language Models sowie gängigen Patterns rund um generative KI. - Fundierte Kenntnisse in Microsoft Fabric oder Microsoft Data Factory, Apache Spark und Notebooks. - Ausgeprägtes Know-how in MySQL und Routine im Umgang mit unterschiedlichen Datenbanken und Data‑Warehousing-Lösungen. - Nachweisbare Erfolgsbilanz in der Stapelverarbeitung und der Arbeit mit strukturierten Daten (idealerweise im Immobilienumfeld). - Freude an der Weiterentwicklung im Bereich Power BI. - Fließende Deutsch- und sehr gute Englischkenntnisse. Benefits - Remote Work: Flexibles Arbeitsmodell – Arbeite bis zu 100% remote innerhalb Deutschlands oder nutze unser modernes Office in Hamburg nach Bedarf. - Workation: Arbeitsumfeld mit Urlaubsfeeling – genieße die perfekte Symbiose aus produktiven Schaffen und entspannter Atmosphäre im europäischen Raum. - Wellbeing: Betriebliches Gesundheitsmanagement – ein jährliches Gesundheitsbudget sowie eine bezuschusste Mitgliedschaft beim Urban Sports Club. - Altersvorsorge: Attraktive Bezuschussung zur betrieblichen Altersvorsorge. - Hardware: Modernste Technik – damit wir gemeinsam innovativ bleiben. - Rabatte: Attraktive Angebote bei über 1.500 Anbietern aus den Bereichen Sport, Mobilität, Mode und Reisen. - Knowledge Base: Vielfältige, individuelle Entwicklungs- und Weiterbildungsmöglichkeiten. - Networking: Regelmäßige Teamevents. - Kostenlose Limonade und Obstkorb.



