OneSeven Tech logo
OneSeven Tech

Founded by James Sullivan, OneSeven Tech is a premier digital product agency serving startups and enterprises. Our clients have collectively raised over $100M in VC, and our enterprise partners include 2,000+ person hospitality groups and NASDAQ-listed companies. We work fully remote, move fast with 1-week sprints, and focus on elegant solutions to complex problems.

Data Engineer Ssr / Sr — Python, AWS, Terraform, Pinecone

Data EngineerData EngineerFull TimeRemoteSeniorTeam 11-50

Location

Argentina

Posted

59 days ago

Salary

$3.5K - $5K / month

Seniority

Senior

Job Description

Data Engineer Ssr / Sr — Python, AWS, Terraform, Pinecone

OneSeven Tech

Location: Remote — Argentina, Colombia, México, Chile, Uruguay Job Type: Full-time contract Date Posted: March, 2026 Project Brief We're building a cloud-native data infrastructure powering AI-driven products — from ingestion pipelines and vector search to MCP-based integrations and multi-tenant architectures. You'll own the design, deployment, and reliability of data pipelines end-to-end, working across AWS, Terraform, and Pinecone in a fast-moving environment where architectural decisions matter and ownership is real. Must-Haves - Strong Python skills — Lambda functions, scripting, API consumption, and data processing pipelines - Hands-on AWS experience across: Lambda, S3, API Gateway, ECS/Fargate, Secrets Manager, Cognito, CloudWatch, and SQS + DLQ - Terraform for infrastructure-as-code — remote state with S3 + DynamoDB, modular resource definitions, and multi-environment management - Docker — containerization, Dockerfile hardening, and ECS task image builds - Pinecone — index creation, namespace design for multi-tenancy, metadata schema, upsert logic, and hybrid search configuration - Experience with embedding models and ETL/data transformation pipelines - MCP protocol experience — FastMCP SDK, Streamable HTTP transport, tool definition - REST API consumption — Cognito-authenticated requests, response parsing, and S3 persistence - Git/GitHub — trunk-based branching, semantic PRs, and branch-per-feature workflows - JSON schema design for pipeline service contracts and API response structures - Retry and error handling patterns — exponential backoff and failure architecture - Clear English communication and ability to work independently with minimal supervision Nice to Have - Incremental update and change detection logic for pipeline efficiency - CloudWatch alarm design and pipeline health dashboards - Content-hash deduplication for embedding cost optimization - Lambda memory and concurrency tuning - VPN site-to-site configuration - LangChain / LangGraph experience - Prompt engineering fundamentals - GitHub Actions — CI/CD pipeline setup and deployment automation - Marketing Business Acumen What You Will Do - Design, build, and maintain cloud-native data pipelines across AWS Lambda, S3, ECS/Fargate, and SQS - Manage and evolve infrastructure-as-code using Terraform across multiple environments - Build and optimize vector search workflows in Pinecone — including multi-tenant namespace design, metadata schemas, and hybrid search - Develop and improve MCP server integrations using FastMCP SDK and Streamable HTTP transport - Implement and harden authentication flows using AWS Cognito and Secrets Manager - Monitor pipeline health via CloudWatch — log groups, alarms, and observability for Lambda and ECS - Apply cost modeling discipline — flag and mitigate high-cost operations (e.g. embedding calls at scale) - Work within an established SA governance model, contributing to architectural decisions with clear documentation and standards - Collaborate with engineering and applied science teams to deliver reliable, scalable data infrastructure Why This Could Be Your Next Big Move - 🧱 Own the data foundation — You're not maintaining legacy systems. You're designing the infrastructure that AI products are built on top of. - 🤖 Work at the AI/data intersection — Vector search, embeddings, MCP integrations, and multi-tenant architectures. This is where data engineering and AI meet. - ☁️ Deep AWS stack — Lambda, ECS, Cognito, SQS, Secrets Manager and more — a real opportunity to broaden and deepen your cloud expertise across a full production stack. - 🔝 High ownership, high impact — Fast-moving environment where your architectural decisions shape the product directly, with visibility across the entire engineering org. Benefits & Compensation - 💵 $[3,500] – $[5,000]/month — paid in USD, bi-weekly via Deel - 🌎 Fully Remote — work from anywhere in Latin America - 📄 Full-time contract with a U.S. company — 6-month initial term, full-time conversion - 🏖️ Paid PTO — competitive package, grows with tenure - 🤝 Referral Program — earn a bonus for referring talent that gets hired To Apply Please send your resume in English. Include the following depending on your role: - 🔗 LinkedIn Profile URL (required) - 💻 GitHub Portfolio or code samples — show us your pipeline and infrastructure work (required) - ✉️ Cover Letter — tell us about a data pipeline or cloud architecture you built and the impact it had (optional but encouraged) About OneSeven Tech Founded by James Sullivan, OneSeven Tech is a premier digital product agency serving startups and enterprises. Our clients have collectively raised over $100M in VC, and our enterprise partners include 2,000+ person hospitality groups and NASDAQ-listed companies. We work fully remote, move fast with 1-week sprints, and focus on elegant solutions to complex problems. Salary: 3500 - 5000 USD Per month

Related Categories

Related Job Pages

More Data Engineer Jobs

Full TimeRemoteTeam 201-500Since 2014H1B No Sponsor

¡Trabaja en DaCodes! Somos una firma de expertos en software y transformación digital de alto impacto. Durante 10 años hemos creado soluciones enfocadas en la tecnología e innovación gracias a nuestro equipo de casi 300 talentosos #DaCoders, incluyendo desarrolladores, arquitectos, diseñadores UX/UI, PMs, QA testers y más. Nuestro equipo colabora en proyectos con clientes en LATAM y Estados Unidos, logrando resultados sobresalientes. En DaCodes, tendrás la oportunidad de impulsar tu desarrollo profesional, trabajar en diversos proyectos dentro de distintas industrias, y contribuir al diseño, implementación y optimización de infraestructuras en la nube. Nuestros DaCoders tienen un gran impacto en el éxito de nuestro negocio y el de nuestros clientes. Serás el experto que participará en nuestros proyectos y tendrás acceso a startups disruptivas y marcas globales. ¿Te interesa? Buscamos un Data Solutions Engineer con perfil técnico híbrido, capaz de desenvolverse tanto en el desarrollo de software como en la construcción de soluciones de datos modernas. Este rol combina mantenimiento de sistemas existentes, desarrollo de nuevos componentes y la implementación de pipelines de datos utilizando tecnologías como Microsoft Fabric y Databricks. Es ideal para alguien con mentalidad práctica, orientado a la ejecución, que pueda moverse con soltura entre entornos legacy y procesos de modernización tecnológica. Tendrás un rol activo en la construcción de soluciones end-to-end, colaborando con equipos técnicos y de negocio para resolver problemas reales mediante tecnología.

Mexico
Virtual Rockstar Careers logo

Data Migration, Processing Specialist

Virtual Rockstar Careers

To build & strengthen families across the globe.

Data Engineer59 days ago
Full TimeRemoteTeam 51-200H1B No Sponsor

• Upload and organize patient records from multiple data sources (CSV, spreadsheets, clinical documents) • Prepare and structure large datasets for system migration • Ensure accurate and efficient data transfer into the platform • Clean and standardize datasets from external systems • Identify missing, duplicate, or inconsistent data • Normalize formats (names, dates, medical data) • Validate data accuracy before and after migration • Utilize AI tools to extract and process patient data • Generate structured patient records • Review and validate AI-generated outputs for accuracy • Perform detailed QA checks to ensure near-zero error rates • Identify and resolve issues proactively before they impact implementation • Maintain high standards of data integrity and consistency • Update implementation trackers and progress reports • Communicate updates, blockers, and issues clearly with the team.

Arizona
$7 / hour
Job Closed
Guidehouse logo

Senior Data Engineer – Multiple Levels

Guidehouse

Guidehouse, a "next-generation consultancy" and a portfolio company of Veritas Capital, provides management, risk consulting, and technology services to help cl

Data Engineer59 days ago

• Assist in developing and maintaining data pipelines and ETL/ELT processes under the guidance of more senior engineers. • Write Python and SQL to extract, transform, validate, and load data from common sources. • Perform data quality checks (validation, reconciliation, basic monitoring) and help troubleshoot data issues. • Develop dashboards and analytic products using data visualization tools (e.g., Power BI, Tableau). • Support cloud-based data workloads (e.g., Azure/AWS/GCP basics) and learn platform-native services and patterns. • Document pipeline steps and technical processes to support maintainability and knowledge transfer. • Participate in team delivery rhythms (standups, sprint ceremonies) and contribute to reviews with a learning mindset. • Design, build, test, and maintain scalable data pipelines (batch and/or streaming as applicable) with increasing independence. • Integrate data from multiple sources, resolve inconsistencies, and deliver curated datasets for analytics and operational use. • Own data quality for assigned domains by implementing validation checks, reconciliation, and monitoring/alerting patterns. • Build, maintain, and deploy data products for analytics and data science teams on cloud platforms (e.g. AWS, Azure, GCP). • Optimize performance of pipelines and queries (tuning, partitioning patterns, efficient compute usage). • Collaborate cross-functionally with analysts, data scientists, and stakeholders to translate requirements into technical designs and delivery plans. • Produce and maintain technical documentation for data flows, data models, and operational procedures. • Contribute to governance and compliance practices (access controls, lineage awareness, controlled data handling) within your scope. • Lead the design and build of scalable data pipeline architectures and tools, including patterns for reliability, security, and maintainability. • Drive ETL/ELT and data quality strategy (frameworks, standards, repeatable testing/monitoring approaches) and raise engineering maturity across the team. • Architect solutions in cloud data platforms (e.g., Azure + Databricks, Snowflake) and guide implementation tradeoffs (cost, performance, scalability, governance). • Design data stores and interactions across storage types (relational, warehouse, lake/lakehouse, and NoSQL where needed) aligned to use cases. • Enable data science / ML readiness by delivering well-modeled, reliable, well-documented datasets and features. • Lead requirements gathering and technical planning; translate ambiguous problem statements into actionable architectures, backlogs, and delivery increments. • Champion data quality and governance standards through the development of sophisticated data quality frameworks, dashboards, and feedback loops to ensure transparency in data completeness, consistency, and quality for partners and researchers. • Own client and stakeholder engagement for your workstream, including organizing/leading meetings, producing clear written outputs, and tracking follow-through. • Mentor and review: provide strong code/design reviews, coach engineers, and help remove technical blockers.

United States
$85K - $141K / year
Job Closed
Digital Media Solutions logo

Manager, Data Engineer

Digital Media Solutions

Digital Media Solutions is a leading provider of technology-enabled digital performance advertising solutions.

Data Engineer59 days ago
Full TimeRemoteTeam 201-500H1B No Sponsor

• Lead, mentor, and grow a team of data engineers and architects • Define and execute the technical roadmap for production database systems (MySQL, PostgreSQL, DynamoDB, Elastic) • Own the architecture and governance of binlog replication, logical replication, and CDC workflows • Drive strategy and reliability for ELT/ETL pipelines and Kafka-based streaming architectures • Set standards for performance optimization, query tuning, indexing, and database scaling across teams • Oversee backup, failover, disaster recovery (PITR), and incident response for all production data systems • Drive cost efficiency, infrastructure optimization, and monitoring across cloud-managed data services (AWS RDS, Aurora, DynamoDB) • Champion data integrity, security, and compliance standards across all data engineering work • Partner cross-functionally with backend, data science, infrastructure, and product teams to align on data platform priorities • Establish engineering guardrails, best practices, and documentation to enable team autonomy and quality at scale • Lead the evaluation and selection of next-generation data warehousing technology (Snowflake, Databricks, AWS Redshift Serverless) — assessing performance, cost, ecosystem fit, and migration complexity to inform a platform decision • Own the design of an upgraded data model for the warehouse in partnership with data engineers and architects, establishing standards for schema design, partitioning, access patterns, and downstream consumption • Oversee the end-to-end migration from the current Redshift warehouse — planning the phased approach, managing cutover risk, and ensuring continuity of downstream reporting and analytics throughout

Alabama + 26 moreAll locations: Alabama | Arizona | California | Colorado | Connecticut | District Of Columbia | Florida | Idaho | Illinois | Nevada | New Hampshire | New Jersey | New York | North Carolina | Ohio | Maryland | Massachusetts | Michigan | Missouri | Pennsylvania | Rhode Island | Tennessee | Texas | Utah | Virginia | Washington | Wisconsin
$180K - $200K / year
Job Closed