Mission-driven engineering firm helping government teams innovate.
Data Scientist / Data Engineer
Location
United States
Posted
13 days ago
Salary
0
Seniority
Senior
Job Description
Data Scientist / Data Engineer
Game Plan Tech
• Join a cross-functional product team supporting Government Logistics • Develop, validate, and deploy statistical and machine learning models • Design and build data pipelines that integrate authoritative Government systems • Translate ambiguous mission questions from government leaders into analytical problems • Productionize models within government environments complying with DoD RMF and controls • Document methodology for government stakeholders to defend model-informed decisions
Job Requirements
- U.S. citizenship with active DoD Secret clearance
- Bachelor’s or higher in a quantitative field (statistics, mathematics, computer science, etc.)
- 3+ years (mid-level) or 6+ years (senior) building and shipping data products in a production environment
- Strong proficiency in Python (pandas, NumPy, scikit-learn, etc.) and SQL
- Experience with at least one major cloud platform (AWS GovCloud or Azure Government preferred)
- Experience working with messy, real-world enterprise data
- Customer-facing role experience
Benefits
- Equal employment opportunities to all individuals
- Reasonable accommodation available during the application process
- Fostering a diverse and inclusive workplace
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
• Deploy new data pipelines • Design & build data observability platforms and metrics • Build metadata driven pipeline solutions • Monitor and optimize data transformations • Build third party data integrations
• Perform migration activities of Alteryx workflows to AWS; • Develop and refactor data pipelines using AWS Glue; • Adapt processes and business rules to a cloud-native architecture; • Ensure the quality, performance, and scalability of pipelines; • Provide technical support to the team with specialized Alteryx expertise; • Participate in the analysis, documentation, and optimization of ETL processes.
• Develop and maintain batch and streaming data pipelines in Databricks using PySpark and Spark SQL; • Implement pipelines following the established Medallion architecture (Bronze, Silver, and Gold); • Assist in migrating legacy pipelines and jobs from tools such as IBM DataStage, Azure Data Factory, and Azure Synapse Analytics to Databricks Workflows; • Help migrate routines and notebooks from Databricks on Azure to AWS; • Develop and version notebooks and code using Databricks Repos and Git; • Implement basic tests and data quality routines in pipelines; • Support dimensional modeling of the Data Warehouse (Star Schema) with the architecture team; • Collaborate with the DevOps team to automate deployments and CI/CD; • Participate in implementing data governance with Unity Catalog; • Monitor and provide support for production data pipelines.
• Build the new corporate Lakehouse using Databricks on AWS; • Develop batch and streaming data pipelines with PySpark and Spark SQL; • Create and maintain the Medallion architecture (Bronze, Silver, Gold) in Delta Lake; • Participate in designing the new Data Warehouse data model; • Migrate data, pipelines, and routines from the current Databricks on Azure environment to AWS; • Migrate jobs and integrations from IBM DataStage, Azure Data Factory, and Azure Synapse Analytics; • Migrate Databricks workspaces between Azure and AWS environments; • Configure and operate data governance using Unity Catalog and Odin; • Implement monitoring, testing, and data quality; • Work alongside DevOps on deployment automation and CI/CD; • Version notebooks, pipelines, and code using Git and Databricks Repos.


