Job Closed
This listing is no longer active.
The University of Maryland, founded in 1856, is the state’s flagship public research university and one of the nation’s preeminent institutions of higher ed
Data Architect (Assistant Research Engineer )
Location
United States
Posted
81 days ago
Salary
$165K - $177K / year
Job Description
Data Architect (Assistant Research Engineer )
University of Maryland
Job Description Summary Organization's Summary Statement: The A. James Clark School of Engineering at the University of Maryland serves as the catalyst for high-quality research, innovation, and learning, delivering on a promise that all graduates will leave ready to impact the Grand Challenges (e.g., energy, environment, security, and human health) of the 21st century. The Clark School is dedicated to leading and transforming the engineering discipline and profession, to accelerating entrepreneurship, and to transforming research and learning activities into new innovations that benefit millions. The Center for Advanced Transportation Technology (CATT) Laboratory is the industry leader for transportation information analysis, visualization, and user interface design. We provide cutting-edge analytics products and an integrated suite of situational awareness tools for transportation practitioners. These products and services are rapidly changing the way governments operate and make decisions. You can learn more about our products at https://ritis.org/. We receive hundreds of gigabytes of transportation data daily, making our petabytes of archived data likely the largest collection of traffic data in the world. Our clients use our software to monitor real-time operations and analyze historical data to generate valuable insights. Our work saves taxpayers money, improves the environment, and saves lives! We’re as passionate about transportation as we are about building great software. We care about building usable, stable, and secure software to analyze massive amounts of data. We use cutting-edge tech to build and maintain our software. We have a mature development process and use industry best practices to build the best software possible. Our team is composed of application developers, analysts, UX designers, data scientists, IT, quality assurance specialists, and customer support operating in an Agile environment. Our office is in College Park near the University of Maryland, easily accessible by DC Metro, MARC train, bus, car, and bike. Local employees are welcome to work in our office, or other locations, with a flexible schedule around our core hours. We also have many employees who are fully remote and work from different states. UMD requires all employees to live in the US, and we periodically bring remote employees to work with colleagues on-site. We believe varied perspectives build better products, are proud to have a diverse team, and encourage people of all backgrounds to apply. When you join our team, you will work to define, document, and test a wide variety of transportation data analytics and operations applications. You will learn new skills and stay current with industry best practices and emerging technologies. Position Summary The CATT Lab is seeking an experienced data engineer to architect and deliver advanced technical solutions to complex transportation data problems. This individual will lead cross functional developers to design and implement large-scale data processing pipelines. They will design geospatial data processing solutions for critical infrastructure planning and operations managers. If you’re looking for complex data problems in the GIS domain to collaboratively solve with a brilliant research and development team, please apply! Essential duties and responsibilities: ● Design, develop, deploy, and manage data storage and processing solutions. ● Develop data models and schemas for new data sources for both relational and non-relational distributed platforms. ● Write and maintain data pipelines to ingest large datasets from APIs and cloud storage to local storage instances. ● Write data retrieval, statistical calculation, and geospatial analysis queries and stored procedures. ● Assist with analysis and discovery of new datasets for integration into existing internal platforms. ● Collaborate with data platform team to optimize data processing pipeline performance. ● Estimate effort for work tasks and report on work progress to the technical team lead. ● Participate in our Tier II Support team rotation Minimum Qualifications Education: Doctoral Degree in Computer Science or Engineering. Experience: 20+ years experience architecting and leading the implementation of complex, high demand and volume data platform solutions. 10+ years experience in a principal or lead role, contributing directly to enterprise data services and product development. 5+ years experience in a technical team leadership role, mentoring less experienced developers in delivering robust, performant data features. Experience writing ETL scripts, APIs and standalone applications in both Python and Java with advanced knowledge of SQL. Knowledge, Skills, and Abilities: Experience designing and implementing large-scale data processing pipelines and analytics systems using Apache Spark and Apache Beam. Experience with data analytics, investigating new data sources and architecting storage and compute solutions that maximize the value of the dataset. Familiarity with data science best practices. Knowledge of frontend user interface design and development, including REACT, typescript and javascript. Ability to architect a full stack feature including backend and frontend development. Strong communication skills; ability to present technical plans to a non-technical audience efficiently. Preferences Experience with non-relational data storage and processing solutions like Cassandra, Hadoop, Spark, Iceberg, Sedona etc. Experience working with geospatial datasets, using geospatial extensions (such as PostGIS) and querying features. Experience with Linux (RHEL), including writing robust Bash scripts. Experience architecting and implementing backup and recovery strategies. Experience with Git, Jira, Confluence, and Bitbucket. Experience with cloud platforms including AWS and GCP. Physical Demands: Sedentary work performed in an office environment. Regularly required to communicate and exchange information and to use technology/devices. Position can be 100% remote (US based only) Licenses/ Certifications: NA Additional Job Details Required Application Materials: - CV/Resume - References upon request. - Professional Statement-required upon completion of the interview process and before an offer - 3 External Letters-required upon completion of the interview process and before an offer Best Consideration Date: N/A Posting Close Date: 04/17/2026 Open Until Filled: NO Financial Disclosure Required NoFor more information on Financial Disclosure, please visit Maryland's State Ethics Commission website. Department ENGR-Civil-Center for Advanced Transportation Technology Worker Sub-Type Faculty Regular Salary Range $165,000-$177,968.79 Benefits Summary For more information on Regular Faculty benefits, select this link. Background Checks Offers of employment are contingent on completion of a background check. Information reported by the background check will not automatically disqualify anyone from employment. Before any adverse decision, the finalist will have an opportunity to provide information to the University regarding disclosable background check information. The University reserves the right to rescind the offer of employment or otherwise decline or terminate employment if the information reported by the background check is deemed incompatible with the position, regardless of when the background check is completed. Employment Eligibility The successful candidate must complete employment eligibility verification (on Form I-9) by presenting documents that establish identity and work authorization within the timeframe required by federal immigration law, and where applicable, to demonstrate renewed employment authorization. Failure to complete employment eligibility verification or reverification within the timeframe set forth by law may result in suspension or termination of employment. EEO Statement The University of Maryland, College Park is an Equal Opportunity Employer. All qualified applicants will receive equal consideration for employment. Please read the University’s Equal Employment Opportunity Statement of Policy. Title IX Non-Discrimination Notice Resources - Learn how military skills translate to civilian opportunities with O*Net Online Search Firm Managed Recruitment There are some positions that are not advertised on this career site as the search is being managed by a Search Firm. Please visit the link below to see these available opportunities: Search Firm Managed Vacancies
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Overview CommIT Enterprises, Inc. is seeking a highly skilled Lead Data Engineer to join a team in Charleston, SC, but this role can be REMOTE, to design, develop, and maintain advanced data ingestion pipelines and models that support the United States Marine Corps (USMC) logistics situational awareness and decision-making. This role requires expertise in ETL/ELT processes, CI/CD pipeline development, and data modeling to ensure reliable, scalable, and secure data solutions. This role directly supports the USMC by enabling data-driven situational awareness, ensuring leaders have timely, accurate, and actionable insights to inform operational decisions. Established in 2001, CommIT is a Certified Veteran-Owned Small Business (CVOSB) providing innovative technical engineering and data science services. Our enterprise systems support includes the Department of Defense’s (DoD) GCSS-MC, CAC2S, TBMCS-MC, and the Department of Veteran’s Affairs’ (VA) telehealth communications. We offer acquisition management, systems engineering, Agile software development, cloud management, IT modernization, data analytics, cybersecurity, and training, including leading-edge DevSecOps, automated testing, and mobile application development. Responsibilities Your essential job functions will include but may not be limited to- - Lead the design, development, and maintenance of CI/CD pipelines that enable ingestion teams to seamlessly move ETL/ELT processes through Development, Testing, and Production environments. - Build and optimize ETL/ELT workflows to ingest, validate, and transform data from diverse sources. - Apply data modeling techniques to create Silver and Gold Tier data assets that enhance situational awareness and support decision-making. - Collaborate with cross-functional teams to ensure data pipelines and models align with operational requirements and reporting needs. - Drive best practices in data engineering, automation, and pipeline reliability to support mission-critical analytics. Qualifications Required Experience and Education: - Master’s degree with 6 years of experience (or Bachelors with 8 years of experience) in Computer Science, Software Engineering, Computer Engineering, Mathematics or relevant field. Degree may be substituted with additional relevant industry experience and / or industry accepted training and certification. - 5+ years of experience in data engineering with a focus on ETL/ELT and CI/CD. - Hands-on experience with Databricks, Delta Lake, and GitLab Enterprise. - Strong proficiency in Python and SQL. - Experience supporting logistics, defense, or mission-critical environments is a plus. - Proven experience with ETL and ELT processes for ingesting structured, semi-structured, and unstructured data sources. - Deep understanding of CI/CD practices, with the ability to design and implement pipelines using Databricks and GitLab Enterprise for Data Ingestion teams. - Strong background in data modeling, with the ability to build Silver and Gold Tier Tables and Views to support operational dashboards and reporting. Technical Requirements: - ETL and ELT methods - Databricks and Delta Tables - Python and SQL programming - CI/CD pipeline development and automation - GitLab Enterprise for version control and deployment Security Requirements: - Secret Clearance - Security+ Certification Equal Opportunity Employer: CommIT Enterprises, Inc. is an Equal Opportunity Employer. Employment decisions are made without regard to race, color, religion, national origin, gender, sexual orientation, gender identity, age, physical or mental disability, genetic factors, military/veteran status or other characteristics protected by law.
Senior Data Platform Engineer II (Databricks)
AledadeSelf-described as "a new company with an old-fashioned goal," Aledade aims to put healthcare control back into the hands of doctors. Headquartered in Bethesda, Maryland, the compan
As a Senior Data Platform Engineer II, you will architect and manage the high-performance, distributed data environments that power our healthcare analytics. You will move beyond traditional maintenance to ensure our Databricks Lakehouse and Snowflake environments scale indefinitely. You will be responsible for the health, optimization, and security of our data platforms, making complex data accessible and expressive for web applications and AI. Primary Duties: - Develop and implement scalable and performant solutions. - Partner, as a peer, with Engineering Managers, Product Managers, and stakeholders throughout Aledade to develop and execute technical roadmaps using Agile processes. - Mentor and coach more junior engineers including thorough pull request reviews for other developers and be receptive to critical feedback on your own work. Minimum Qualifications: - BS/BTech (or higher) in Computer Science, Engineering or a related field or equivalent experience. - 6+ years experience as an engineer building and optimizing highly scalable distributed data systems (e.g., Databricks, Spark, or Snowflake). - 3+ years of experience working with SQL and data modeling on large multi-table data sets. - 3+ years of experience acting as a trusted technical decision-maker in a team setting, solving for short-term and long-term business value. - 3+ years of experience coaching other engineers. Preferred KSA’s: - Platform & Infrastructure (The "Databricks/Cloud" Core) - Databricks & Lakehouse Architecture: Deep expertise in managing Databricks workspaces, including Unity Catalog for data governance, lineage, and fine-grained access control. - Infrastructure as Code (IaC): Advanced proficiency with Terraform (or similar) to automate the provisioning and scaling of Databricks clusters, cloud resources (AWS preferred), and networking. - Snowflake Proficiency (Nice-to-Have): Experience managing Snowflake environments, specifically focusing on warehouse cost optimization, security integration, and secure data sharing. - Modern Database Internals: In-depth knowledge of distributed systems, including partitioning, liquid clustering/Z-Ordering, sharding, and high-availability strategies for petabyte-scale data. Performance, Reliability & DevOps - Observability & Optimization: Proven track record in performance monitoring and query tuning for distributed workloads to ensure system reliability and cost-efficiency. - Data Engineering Lifecycle: Experience designing and optimizing high-throughput ETL/ELT pipelines and ingestion systems (batch and streaming) using Spark. - Deployment & Orchestration: Experience building robust CI/CD pipelines for data infrastructure and deploying services using containerization (Docker, Kubernetes). Security, Compliance & Domain Knowledge - Sensitive Data Handling: Expertise in building systems that handle protected information, with specific experience in HIPAA and SOX compliance frameworks. - Healthcare Data Expertise: Experience navigating health-tech data complexities, such as Electronic Health Records (EHR), clinical data formats (HL7/FHIR), and claims data. Physical Requirements: - Sitting for prolonged periods of time. Extensive use of computers and keyboard. Occasional walking and lifting may be required.
GHX is seeking a Software Engineer III to work on our Content Tooling solution with a focus on data engineering and analytics. This individual will be responsible for the creation, implementation, and support of data-intensive software solutions including complex SQL development, ETL pipelines, and analytics infrastructure. Reporting to the Manager, Software Engineering, the Software Engineer III will analyze, design, program, debug, and modify software enhancements and/or new products that process, transform, and deliver content data for business intelligence and operational reporting. This role is responsible for interacting with users to define system requirements and/or necessary modifications in an Agile/Scrum environment. Responsibilities - Design and implement complex SQL queries, stored procedures, and database optimization strategies - Develop and maintain ETL (Extract, Transform, Load) pipelines for data ingestion and transformation - Design and develop data architecture, data models, and data manipulation structures for content management systems - Build and optimize data warehousing solutions using cloud-based platforms - Create and maintain analytics reports and dashboards for business stakeholders - Perform data quality validation and implement data governance best practices - Translate ideas into clear, maintainable code with minimal supervision and perform code reviews - Coordinate with cross-functional teams including product, analytics, and offshore development teams in an Agile/Scrum environment Knowledge and Skills - Proficient with multiple technologies including SQL databases, ETL tools, cloud data platforms, and data visualization tools - Expert-level SQL development skills with deep understanding of query optimization and performance tuning - Strong understanding of ETL design patterns, data pipeline architectures, and data orchestration - Identifies, implements, and applies best practices for data engineering; is the “go to” person on the team - Ability to handle multiple projects and possesses a proven track record of high-quality deliverables - Delivers code with high quality and throughput consistently - Effectively communicates technical concepts to both technical staff and cross-functional teams with varying degrees of technical experience - Possesses a broad understanding of Agile/LEAN principles and ability to apply agile methodology effectively Required Experience - Bachelor’s degree in Computer Science, Engineering, Data Science, or related degree - 5+ years of experience writing complex SQL queries including joins, subqueries, window functions, and CTEs with relational databases (PostgreSQL, MySQL, SQL Server) - 3+ years of experience designing and implementing ETL processes and data pipelines - Experience with data modeling (dimensional modeling, normalization/denormalization, star/snowflake schemas) and data transformation techniques - Experience with programming languages used in data engineering (Python, Java), version control systems (Git), CI/CD pipelines, and Agile/Scrum environments Preferred Experience - Hands-on experience with Snowflake data warehouse and Sigma Computing for building analytics reports and dashboards - Knowledge of healthcare supply chain or healthcare IT systems Estimated Salary Range for this position: $91,000 to $121,000 The base salary range represents the anticipated low and high end of the GHX’s salary range for this position. The base salary is one component of GHX’s total compensation package for employees. Other rewards and benefits include: health, vision, and dental insurance, accident and life insurance, 401k matching, paid-time off, and education reimbursement, to name a few. To view more details of our benefits, visit us here: https://www.ghx.com/about/careers/ #LI-SR GHX: It's the way you do business in healthcare Global Healthcare Exchange (GHX) enables better patient care and billions in savings for the healthcare community by maximizing automation, efficiency and accuracy of business processes. GHX is a healthcare business and data automation company, empowering healthcare organizations to enable better patient care and maximize industry savings using our world class cloud-based supply chain technology exchange platform, solutions, analytics and services. We bring together healthcare providers and manufacturers and distributors in North America and Europe - who rely on smart, secure healthcare-focused technology and comprehensive data to automate their business processes and make more informed decisions. It is our passion and vision for a more operationally efficient healthcare supply chain, helping organizations reduce - not shift - the cost of doing business, paving the way to delivering patient care more effectively. Together we take more than a billion dollars out of the cost of delivering healthcare every year. GHX is privately owned, operates in the United States, Canada and Europe, and employs more than 1000 people worldwide. Our corporate headquarters is in Colorado, with additional offices in Europe. Disclaimer Global Healthcare Exchange, LLC and its North American subsidiaries (collectively, “GHX”) provides equal employment opportunities (EEO) to all employees and applicants for employment without regard to race, color, national origin, sex, sexual orientation, gender identity, religion, age, genetic information, disability, veteran status or any other status protected by applicable law. All qualified applicants will receive consideration for employment without regard to any status protected by applicable law. This EEO policy applies to all terms, conditions, and privileges of employment, including hiring, training and development, promotion, transfer, compensation, benefits, educational assistance, termination, layoffs, social and recreational programs, and retirement.GHX believes that employees should be provided with a working environment which enables each employee to be productive and to work to the best of his or her ability. We do not condone or tolerate an atmosphere of intimidation or harassment based on race, color, national origin, sex, sexual orientation, gender identity, religion, age, genetic information, disability, veteran status or any other status protected by applicable law. GHX expects and requires the cooperation of all employees in maintaining a discrimination and harassment-free atmosphere. Improper interference with the ability of GHX’s employees to perform their expected job duties is absolutely not tolerated. Read our GHX Privacy Policy
Staff Data Engineer – Regulatory
RecargaPayNossa missão é democratizar os meios de pagamentos pelo celular por meio de um serviço inovador, econômico e seguro.
• Define the data architecture and modeling strategy for the Regulatory domain, ensuring scalability, reliability, and regulatory compliance. • Design end-to-end domain data solutions, including transformation layers, curated datasets, and data consumption models. • Provide technical guidance and mentorship to Data Engineers and Analytics Engineers. • Ensure solutions align with company-wide data platform standards and collaborate closely with the Data Platform team. • Design and implement scalable data models (conceptual, logical, and physical) using PySpark and advanced SQL. • Build and maintain data transformation pipelines and curated datasets to support regulatory reporting and analytics. • Work with datasets ingested and managed by the Data Platform team, transforming them into reliable regulatory data products. • Develop batch and streaming pipelines using Spark, Airflow, Kafka, and Databricks. • Lead complex data initiatives, driving technical design and delivery across teams. • Produce technical specifications and architecture documentation for data solutions. • Break down complex problems into incremental deliveries, prioritizing MVPs and quick wins with a clear evolution roadmap. • Collaborate with engineering, product, and compliance stakeholders to translate regulatory requirements into technical solutions. • Implement best practices for pipeline modularization, testing, CI/CD, version control, and observability. • Promote standards for data quality, documentation, and maintainable data pipelines.


