Data Engineer
Location
Washington
Posted
3 days ago
Salary
$125K - $150K / year
Seniority
Senior
Job Description
Data Engineer
Beyondsoft
• Support scalable data operations through development of ETL processes, SQL-based integrations, Power Platform solutions, and Power BI reporting capabilities. • Design, build, maintain, monitor, and troubleshoot data-processing automations. • Develop and maintain data ingestion pipelines from external sources into SQL databases. • Manage automated flows to trigger Logic Apps and handle lightweight processes. • Perform full-stack BI development including data modeling, DAX development, and report publishing. • Leverage Microsoft Fabric as the unified access platform. • Ensure alignment with security and compliance requirements. • Conduct root-cause analysis to reconcile discrepancies between systems.
Job Requirements
- 5+ years of experience in data engineering, BI development, automation engineering, or related technical roles.
- Strong hands-on experience with Azure Data Factory (ADF).
- Experience developing and maintaining Logic Apps and Power Automate workflows.
- Advanced SQL development skills including query optimization and ETL development.
- Strong experience with Power BI development including data modeling.
- Experience with Microsoft Fabric and enterprise BI/reporting platforms.
- Experience administering Azure resources including RBAC.
- Ability to troubleshoot and resolve issues across ETL pipelines.
- Strong analytical and problem-solving skills.
Benefits
- 15 days per year of Paid Time Off (PTO).
- 8 paid holidays + 1 personal floating holiday.
- 401(k) retirement plan with company match.
- Medical, dental, and vision insurance.
- Health savings account (HSA).
- Short-term and long-term disability.
- Employee assistance plan (EAP).
- Basic life and AD&D insurance.
- Health care flexible spending account.
- Dependent care flexible spending account.
- Commuter benefits.
- Voluntary accident & critical injury coverage.
- Voluntary long-term care coverage.
- Voluntary life and AD&D insurance.
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Role Description We are looking for a Senior Data Engineer to join the Innovation team as a core member of the PF-LLM programme — our initiative to build a from-scratch multivariate time-series foundation model across a fleet of ~1,000 wind and PV sites. You will be the connective tissue of the entire programme: - Owning the data foundation that makes state-of-the-art model training possible. - Managing the inference service that makes model outputs usable. - Overseeing platform integration that puts those outputs in front of pilot customers. - From production ETL through to shadow-mode validation pipelines, you will be the engineer who keeps every track moving. This role is critical-path from day one. Qualifications - 6+ years of back-end and data engineering experience, with a proven track record of shipping production systems. - Production-grade ETL/ELT pipeline design at scale: idempotency, retry logic, backfill jobs, incremental loading, and cost-controlled warehouse compute. - Schema design and data modelling across heterogeneous sources — experience reconciling signals from disparate systems into a canonical, queryable format. - Data quality engineering: automated quality gates (sparsity, flatline detection, outlier flagging, freshness checks), alerting pipelines, and dataset versioning for ML reproducibility. - API design and development: RESTful inference services with contract testing, latency and throughput budgeting, and structured observability (logs, metrics, traces). - Experience integrating ML model outputs into SaaS product surfaces: auth and authorisation, customer isolation, and feature flag management. - Cloud infrastructure proficiency (AWS preferred), containerisation (Docker, Kubernetes), and CI/CD pipeline ownership. - Python and SQL as core tools; hands-on experience with modern warehouse technologies (Snowflake, BigQuery, or Databricks). - Pipeline orchestration with Airflow, Prefect, Dagster, or equivalent. - Excellent written and verbal communication skills in English. Requirements - Design and build the production ETL pipeline from source systems to warehouse and feature store at fleet scale, covering thousands of wind and PV sites across multiple OEMs. - Own canonical signal schema design across wind and PV asset classes and OEMs — the deepest technical unknown in the programme and the foundation everything else depends on. - Implement automated data quality gates: sparsity and missingness checks, flatline detection, outlier flagging, and freshness validation, with alerting that generates tickets automatically. - Implement dataset versioning sufficient to reproduce every trained model from scratch. - Build and maintain backfill jobs, idempotency guarantees, and retry logic that survive mid-run failure without duplicating data. - Govern storage and compute costs on the warehouse from day one. - Build the batch and on-demand inference API with contract tests, sized for fleet-wide daily runs. - Establish latency and throughput baselines; own the cold-start and model-loading strategy. - Instrument the service with structured logs and metrics from the outset. - Integrate forecasts into the Power Factors product platform: auth and authorisation with customer isolation, observability hooked into the existing stack, and feature flags per customer and per site. - Build and maintain the shadow validation pipeline: run live inference in parallel with the existing forecast path, log predictions and actuals, and produce weekly validation reports broken down by asset class, OEM, and region. - Support the pilot customer rollout: enable the product for friendly customers behind flags and own incoming data and integration tickets throughout the pilot window. - Work closely with the ML Engineer to align on data quality requirements, feature store interfaces, and the handoff between the data platform and training pipeline. - Partner with the Tech Lead and Frontend Engineer during platform integration to ensure a clean, maintainable integration surface. - Contribute to architectural decisions across the programme and document data flows, schemas, and pipeline runbooks to a standard that supports the broader team. Benefits - Comprehensive benefits package including health, dental, and vision coverage, plus dedicated wellness support. - Generous paid vacation policy. - Employer RRSP matching program. - Work-from-abroad opportunities with manager approval. - Exposure to a global team operating across multiple countries and time zones. - A humble cause with a clear purpose — you will help us fight climate change with code every day at work.
Senior Data Engineer – Enterprise B2B Marketplace
Truelogic SoftwarePremium boutique software development company that helps brands with big ideas to make a difference in people’s lives.
• Data Platform Evolution: Guide the foundational architecture, scaling strategies, and long-term roadmap of the enterprise data platform. • Pipeline Engineering: Design and lead the development of highly scalable data pipelines using Airflow, dbt, and Python. • Modern Stack Integration: Build and maintain high-throughput integrations across core modern data stack tools, including Fivetran, Redshift, and Sigma. • Serverless Architecture: Develop and optimize serverless data services and ingestion layers leveraging AWS infrastructure (e.g., AWS Lambda). • Advanced Data Modeling: Partner with cross-functional stakeholders to define reliable, performant data warehouse architectures and analytical datasets. • Observability & Reliability: Implement automated testing, rigorous monitoring frameworks, and tracing to maximize pipeline reliability and minimize operational downtime. • Technical Leadership & Governance: Mentor data engineers and analysts on engineering best practices, while driving continuous improvements in data governance and documentation.
Senior Data Engineer – Enterprise B2B Marketplace
Truelogic SoftwarePremium boutique software development company that helps brands with big ideas to make a difference in people’s lives.
• Guide the foundational architecture, scaling strategies, and long-term roadmap of the enterprise data platform. • Design and lead the development of highly scalable data pipelines using Airflow, dbt, and Python. • Build and maintain high-throughput integrations across core modern data stack tools, including Fivetran, Redshift, and Sigma. • Develop and optimize serverless data services and ingestion layers leveraging AWS infrastructure (e.g., AWS Lambda). • Partner with cross-functional stakeholders to define reliable, performant data warehouse architectures and analytical datasets. • Implement automated testing, rigorous monitoring frameworks, and tracing to maximize pipeline reliability and minimize operational downtime. • Mentor data engineers and analysts on engineering best practices, while driving continuous improvements in data governance and documentation.
Senior Data Engineer – Enterprise B2B Marketplace
Truelogic SoftwarePremium boutique software development company that helps brands with big ideas to make a difference in people’s lives.
• Data Platform Evolution: Guide the foundational architecture, scaling strategies, and long-term roadmap of the enterprise data platform. • Pipeline Engineering: Design and lead the development of highly scalable data pipelines using Airflow, dbt, and Python. • Modern Stack Integration: Build and maintain high-throughput integrations across core modern data stack tools, including Fivetran, Redshift, and Sigma. • Serverless Architecture: Develop and optimize serverless data services and ingestion layers leveraging AWS infrastructure (e.g., AWS Lambda). • Advanced Data Modeling: Partner with cross-functional stakeholders to define reliable, performant data warehouse architectures and analytical datasets. • Observability & Reliability: Implement automated testing, rigorous monitoring frameworks, and tracing to maximize pipeline reliability and minimize operational downtime. • Technical Leadership & Governance: Mentor data engineers and analysts on engineering best practices, while driving continuous improvements in data governance and documentation.

