Connecting You to Better: MeridianLink is the developer of the industry's first multi-channel loan origination system.
Principal Data Architect – AI
Location
United States
Posted
23 days ago
Salary
$126K - $214.9K / year
Seniority
Lead
Job Description
Principal Data Architect – AI
MeridianLink
• Define the enterprise data architecture: Own the conceptual, logical, and physical data models for MeridianLink's analytical and operational data platform, including source-aligned, integrated, and consumption-ready layers. • Build the meta-model: Design and maintain a meta-model that captures entities, relationships, business definitions, ownership, lineage, sensitivity classifications, and SLAs — and make sure it is wired into our tooling, not stuck in a slide deck. • Drive the lakehouse strategy: Architect our medallion (bronze / silver / gold) Delta Lake patterns on Databricks; define standards for partitioning, clustering, schema evolution, slowly changing dimensions, and historical reproducibility. • Be hands-on: Write PySpark, SQL, and Delta Lake code. Build reference implementations, prototype patterns, review pull requests, and personally model critical domains rather than delegating every detail. • Lead data integration design: Set patterns for ingestion through Informatica Data Management Cloud (IDMC) and direct Databricks pipelines, including CDC, batch, streaming, and API-based sourcing from our SaaS products and third-party systems. • Champion data governance and lineage: Partner with data governance, security, and compliance leaders to operationalize cataloging, lineage, classification, masking, and access controls across the platform (Unity Catalog, IDMC, and adjacent tools). • Standardize data modeling practices: Establish the standards, naming conventions, and review processes used by the Data Engineering team. Coach engineers on dimensional modeling, Data Vault, and other techniques where they best fit the use case. • Partner across the business: Work closely with Product, Engineering, Analytics, ML, Finance, Risk, and Customer-facing teams to translate business needs into durable data designs. • Influence the roadmap: Identify gaps in tooling, capability, and skill; propose investments; and drive multi-quarter initiatives that materially improve how MeridianLink uses its data.
Job Requirements
- 12–15+ years of progressive experience in data engineering, data warehousing, and data architecture roles, with at least the most recent several years at the architect level.
- Demonstrated experience as a Data Architect at a SaaS company in the FinTech or financial services software space (lending, banking, payments, capital markets, insurance, or a closely related domain).
- Deep, hands-on expertise with Databricks and PySpark on Azure, including Delta Lake, Unity Catalog, structured streaming, and performance tuning at scale.
- Production experience with Informatica Data Management Cloud (IDMC) — or comparable enterprise integration platforms — for ingestion, transformation, and metadata-driven pipelines.
- Proven track record of designing and implementing detailed meta-models and end-to-end data models (conceptual, logical, and physical) that have shipped to production and stood up over time.
- Strong command of dimensional modeling (Kimball), Data Vault 2.0, and modern lakehouse patterns, including the ability to choose the right approach for the right use case.
- Expert SQL skills and strong proficiency in Python/PySpark; comfortable writing the code, not just the diagrams.
- Demonstrated experience implementing data governance, lineage, and metadata management programs (e.g., Unity Catalog, IDMC Data Governance, Collibra, Atlan, or similar).
- Working knowledge of FinTech-relevant regulatory and compliance considerations (e.g., GLBA, SOC 2, PCI, NIST, state lending regulations) and how they shape data design.
- Excellent written and verbal communication skills; able to explain complex data concepts to engineers, executives, customers, and auditors.
Benefits
- Insurance coverage (medical, dental, vision, life, and disability)
- Flexible paid time off
- Paid holidays
- 401(k) plan with company match
- Remote work
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
• Management of Cogito suite of reporting tools • Cogito Security management • Maintenance of Cogito Related System Wide Settings • Coordinates Analytics Processes/build across all Epic Applications • Oversite of Cogito Tools build and provisioning standards • Coordinating, auditing, troubleshooting technical system functionality – including reporting queues • Represent Cogito in Epic governance groups • Coordination & Management of Cogito related projects – including Epic Releases
• Architect and develop large-scale, mission-critical BI and data platform solutions serving millions of users across the globe, leveraging AWS native technologies including Athena, Redshift, Glue, QuickSight, and S3. • Lead the design and implementation of robust data pipelines, data lakes, and data warehouses using modern architectures (Iceberg, Parquet, columnar formats) to support real-time and batch analytics at scale. • Drive technical strategy and architectural decisions for the BI platform, including data modeling, query optimization, performance tuning, and cost optimization across AWS services. • Build and maintain sophisticated back-end services, ETL/ELT workflows, and front-end analytics applications using Python, SQL, React, and modern web technologies. • Design and implement efficient data storage solutions across relational databases (Redshift, PostgreSQL) and non-relational databases (DynamoDB, S3), ensuring optimal performance and cost-efficiency. • Develop and maintain REST APIs and event-driven architectures to enable seamless integration between data services, analytics tools, and customer-facing applications. • Serve as the technical lead and mentor for engineering teams, conducting architecture reviews, code reviews, and providing guidance on complex technical challenges. • Collaborate with cross-functional teams including data engineers, analytics engineers, product managers, and DevOps to deliver innovative BI solutions that drive business value. • Champion engineering excellence by establishing best practices, design patterns, and coding standards for data-intensive applications. • Lead Agile ceremonies, drive sprint planning, and ensure timely delivery of high-quality software solutions while maintaining technical debt at manageable levels. • Evaluate and integrate emerging AWS services and open-source technologies to continuously improve platform capabilities and developer productivity. • Troubleshoot and resolve complex performance issues in distributed data systems, optimizing query performance, data processing workflows, and infrastructure costs. • Participate in strategic planning and roadmap development, translating business requirements into scalable technical solutions. • Contribute to the team on-call rotation, providing expert-level support for production environments and mentoring team members on incident response.
• Serve as the expert for ETL and DB solutions, collaborating with business stakeholders and IT teams to define requirements, gather data, and implement optimized data solutions. • Design, implement, and maintain data systems in Snowflake to ensure data scalability and accessibility. • Implement and manage data lakes and data warehouses, creating pipelines and data models to enable efficient analytics and reporting. • Establish and document strategies for managing data transfer processes, including secure file transfers (SFTP), batch data processing, and real-time streaming. • Build and optimize ETL pipelines for data extraction, transformation, and loading into operational databases or analytical platforms. • Integrate and support data visualization tools such as Power BI, Sisense, Google Looker, Tableau, or similar platforms to enable actionable insights for business stakeholders. • Develop and maintain optimized data models for dashboards and reporting, ensuring compatibility with visualization tools. • Plan, coordinate, and implement database migrations, upgrades, and patches with minimal downtime. • Define and enforce database governance policies, including data integrity, security, and compliance with regulatory requirements. • Analyze and resolve database performance issues by optimizing queries, indexes, and schema designs. • Partner with vendors to evaluate, select, and implement database tools, services, and technologies; stay informed about product roadmaps and industry trends. • Develop disaster recovery and high-availability solutions, including replication, clustering, and failover.
• Design and implement data pipelines (ETL/ELT) using modern tools (e.g., Apache Airflow, DBT, Dataflow); • Integrate data from transactional systems, APIs, and relational and non-relational databases; • Create and maintain optimized data structures in analytical environments (data lakes and data warehouses); • Ensure data governance, data quality, and data cataloging; • Automate routines for data extraction, transformation, and loading; • Support data scientists, analysts, and product squads with reliable, well-modeled data; • Participate in modernization and data migration projects to the cloud; • Monitor and resolve failures in pipelines and other critical data processes.




