Job Closed
This listing is no longer active.
Mactores is a trusted leader among businesses in providing modern data platform solutions.
Senior AWS Data Engineer
Location
India
Posted
55 days ago
Salary
0
Seniority
Senior
Job Description
Senior AWS Data Engineer
Mactores
• Develop and maintain data pipelines using Amazon EMR or Amazon Glue. • Create data models and end-user querying using Amazon Redshift or Snowflake, Amazon Athena, and Presto. • Build and maintain the orchestration of data pipelines using Airflow. • Collaborate with other teams to understand their data needs and help design solutions. • Troubleshoot and optimize data pipelines and data models. • Write and maintain PySpark and SQL scripts to extract, transform, and load data. • Document and communicate technical solutions to both technical and non-technical audiences. • Stay up-to-date with new AWS data technologies and evaluate their impact on our existing systems.
Job Requirements
- Bachelor's degree in Computer Science, Engineering, or a related field.
- 3+ years of experience working with PySpark and SQL.
- 2+ years of experience building and maintaining data pipelines using Amazon EMR or Amazon Glue.
- 2+ years of experience with data modeling and end-user querying using Amazon Redshift or Snowflake, Amazon Athena, and Presto.
- 1+ years of experience building and maintaining the orchestration of data pipelines using Airflow.
- Strong problem-solving and troubleshooting skills.
- Excellent communication and collaboration skills.
- Ability to work independently and within a team environment.
- AWS Data Analytics Specialty Certification preferred.
- Experience with Agile development methodology preferred.
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
• Design, build, and optimize data pipelines, architectures, and data sets. • Work with big data technologies to solve complex data processing challenges. • Implement ETL processes and data warehousing solutions. • Attend meetings, providing pre-sales support and technical insights. • Collaborate with cross-functional teams to integrate data solutions into broader projects. • Engage in cloud-based data solutions using AWS, Azure, or GCP. • Ensure data integrity, efficiency, and scalability in all solutions.
• Design and evolve generative AI solutions for real business use, focusing on agents, prompting, RAG and evaluation. • Design agents (agent workflows), tools (tool calling) and prompts for real-world use cases. • Build/evolve RAG pipelines (ingestion, chunking, embeddings, retrieval, reranking, grounding). • Define guardrails and policies: security, privacy, compliance, and hallucination prevention. • Create evaluation strategy: metrics, datasets, automated tests, and acceptance criteria. • Optimize quality and cost (latency, context, error rates, caching, model routing). • Partner with engineering for production readiness (logging, auditing, monitoring, versioning).
• You will lead a multidisciplinary squad (AI + Engineering) to deliver AI solutions in production, focusing on process automation, reliability, governance, and value generation. • Lead a 5-person squad (AI Process Engineers / AI Scientists + Software/Integration Engineer). • Translate business objectives into clear deliverables (scope, success metrics, risks, roadmap). • Own production outcomes: adoption, quality, stability, cost, and impact. • Prioritize the backlog, manage stakeholders and align expectations (business, technology, compliance). • Ensure disciplined execution (rituals, quality, documentation, governance, and audit). • Drive solution architecture and process design decisions (with support from Tech Lead/Science/Governance).
• Own the design, build, and optimization of end-to-end data pipelines that power our vendor universe. • Establish and enforce best practices in data modeling, orchestration, and system reliability. • Collaborate with product, engineering, and business stakeholders to translate requirements into robust, scalable data solutions. • Work extensively with Databricks and Airflow for large-scale data processing and orchestration. • Troubleshoot and resolve complex pipeline issues to ensure reliability and performance. • Contribute to the team’s technical strategy, helping drive improvements in scalability, performance, and efficiency. • Lead, mentor, and support engineers through challenges, code reviews, and project execution.



