Hadoop Developer

Location

United States

Posted

3 days ago

Salary

$100K - $150K / year

Seniority

Senior

Bachelor Degree

Job Description

Hadoop Developer

Bright Vision Technologies

Title: Hadoop Developer Location: Remote Job Description:Bright Vision Technologies is a forward-thinking software development company dedicated to building innovative solutions that help businesses automate and optimize their operations. We leverage cutting-edge technologies to create scalable, secure, and user-friendly applications. As we continue to grow, we’re looking for a skilled Hadoop Developer to join our dynamic team and contribute to our mission of transforming business processes through technology. This is a fantastic opportunity to join an established and well-respected organization offering tremendous career growth potential. Job Title: Hadoop Developer Location: 100% Remote (Continental United States) Position Type: In-house Bright Vision Technologies SOW engagement (no third-party client or vendor) Salary: $100K - $150K Experience: 5+ years Sponsorship: No new H1B sponsorship available. H1B transfers welcomed for qualified candidates. Employment Type: Full-time, direct W2 with Bright Vision Technologies (no C2C, no 1099, no third-party) Engagement: Long-term, multi-year, aligned to the Bright Vision SOW delivery roadmap Compensation: Competitive base salary commensurate with experience, plus benefits. Employment Terms & Visa Policy This is a 100% remote, full-time, direct W2 position with Bright Vision Technologies. This role is part of Bright Vision Technologies’ in-house Statement of Work (SOW) engagement. The client, end customer, and employer for this position is Bright Vision Technologies — there is no third-party client, vendor, or implementation partner involved. We do not engage in C2C, 1099, or third-party arrangements for this role. BUT STRICTLY NO C2C/1099/3RD PARTY COMPANIES. ALL OUR ROLES ARE W2 AND NO 3RD PARTY BROKERING PLEASE. Candidates must be willing to work directly as a full-time W2 employee of Bright Vision Technologies and contribute to our in-house SOW deliverables. No new H1B sponsorship is available for this role. However, candidates who are currently on a valid H1B visa and require a transfer are welcome to apply. We will support H1B transfers for qualified candidates. For every role, a technical coding assessment is mandatory. Please apply only if you are confident in your technical abilities and hands-on experience. Job Summary We are seeking an experienced Hadoop Developer to design, build, and operate large-scale data processing pipelines and analytics platforms on Hadoop and related big-data ecosystems. In this role you will be responsible for ingesting, transforming, and analyzing massive volumes of structured and unstructured data to support enterprise analytics, machine learning, and reporting workloads. The ideal candidate will combine deep technical expertise across the Hadoop ecosystem with strong software engineering fundamentals and a clear understanding of how to deliver reliable, performant, and cost-effective data platforms in production environments. Key Responsibilities - Design, develop, and operate end-to-end big-data pipelines on Hadoop, ingesting data from a diverse mix of relational, file-based, streaming, and API-driven sources. - Build robust ETL/ELT workflows using Apache Spark, Hive, Pig, and Sqoop, with strong attention to data quality, idempotency, error handling, and recoverability. - Develop high-throughput streaming data pipelines using Kafka, Spark Streaming, or Flink, and integrate them with downstream analytical and operational systems. - Optimize Spark and MapReduce jobs through careful tuning of partitioning, memory, serialization, and skew handling to meet demanding SLAs at minimal cost. - Design and maintain data models and storage layouts on HDFS, Hive, HBase, and modern lakehouse formats (Parquet, ORC, Delta, Iceberg, Hudi) to balance flexibility and performance. - Implement data governance, lineage, and quality controls in collaboration with data governance and security teams. - Build robust monitoring, alerting, and logging strategies for big-data pipelines, including job-level SLAs and proactive failure detection. - Partner with data scientists and analysts to deliver curated, reliable, and well-documented datasets that accelerate their work. - Automate pipeline orchestration using Airflow, Oozie, or similar workflow engines, with clean dependency management and clear ownership boundaries. - Continuously evaluate and adopt new technologies in the big-data and cloud ecosystem (EMR, Databricks, Snowflake, BigQuery) where they offer meaningful improvements. - Lead performance reviews and architecture audits of existing pipelines, proposing concrete refactoring and optimization initiatives. - Document data architectures, schemas, pipeline behaviors, and operational runbooks in a way that makes the platform supportable as the team scales. - Mentor junior engineers and contribute to the team’s engineering standards and best practices. Required Qualifications - Bachelor’s degree in Computer Science, Engineering, or a related technical discipline. - Five or more years of professional experience designing and operating big-data pipelines on Hadoop. - Strong hands-on expertise with Apache Spark (Scala, Python, or Java) in production environments. - Solid experience with Hive, HDFS, Sqoop, HBase, and the broader Hadoop ecosystem. - Hands-on experience with streaming data platforms such as Kafka, Spark Streaming, or Flink. - Strong SQL skills and experience working with both relational and NoSQL data stores. - Experience with workflow orchestration tools such as Airflow or Oozie. - Solid understanding of distributed systems concepts, including partitioning, replication, and fault tolerance. - Strong scripting skills in Python or Shell. - Excellent troubleshooting, debugging, and documentation skills. Preferred Qualifications - Experience operating Hadoop on cloud platforms such as AWS EMR, Azure HDInsight, or Databricks. - Familiarity with modern lakehouse formats (Delta, Iceberg, Hudi). - Exposure to data governance tooling such as Apache Atlas or Collibra. - Experience with Kubernetes-based data platforms (Spark-on-K8s, Trino). - Hands-on experience with CI/CD and infrastructure-as-code in data engineering workflows.

Related Job Pages

More Software Engineer Jobs

Full TimeRemoteTeam 10,001+Since 1986H1B No Sponsor

• Provide leadership to the development team and mentor junior staff members • Collaborate with Scrum Master to plan and deliver sprint artifacts • Partner with Principal Engineers and Architects to align solutions with enterprise strategy • Develop, configure, code, and test programs of varying complexity • Ensure implementations are scalable, performant, and secure • Escalate delays, issues, and highlights to project managers and project leads

New Jersey + 1 moreAll locations: New Jersey | Pennsylvania

• Design, develop, and maintain RESTful APIs to support frontend applications and enterprise integrations. • Design and optimize DynamoDB schemas, indexes, and data access patterns for scalable applications. • Develop and maintain Python-based AWS Lambda functions for business logic and automation. • Modify, enhance, and maintain Amazon Connect contact flows and routing logic. • Collaborate with UI development teams to translate business requirements into backend services and APIs. • Work directly with Amazon Professional Services on Amazon Connect implementation and enhancement initiatives. • Design and support event-driven architectures using AWS services such as Lambda, EventBridge, SQS, SNS, and API Gateway. • Support and optimize data flows across AWS services and integrated systems. • Implement secure, scalable, and highly available cloud-native solutions. • Troubleshoot application, integration, and infrastructure issues across AWS environments. • Collaborate with cross-functional teams to ensure reliable and efficient service delivery. • Participate in production support activities and occasional after-hours deployments when required.

Portugal

• Design, develop, and maintain RESTful APIs to support frontend applications and enterprise integrations • Design and optimize DynamoDB schemas, indexes, and data access patterns for scalable applications • Develop and maintain Python-based AWS Lambda functions for business logic and automation • Modify, enhance, and maintain Amazon Connect contact flows and routing logic • Collaborate with UI development teams to translate business requirements into backend services and APIs • Work directly with Amazon Professional Services on Amazon Connect implementation and enhancement initiatives • Design and support event-driven architectures using AWS services such as Lambda, EventBridge, SQS, SNS, and API Gateway • Support and optimize data flows across AWS services and integrated systems • Implement secure, scalable, and highly available cloud-native solutions • Troubleshoot application, integration, and infrastructure issues across AWS environments • Collaborate with cross-functional teams to ensure reliable and efficient service delivery • Participate in production support activities and occasional after-hours deployments when required

Spain

• Design, develop, and maintain RESTful APIs to support frontend applications and enterprise integrations • Design and optimize DynamoDB schemas, indexes, and data access patterns for scalable applications • Develop and maintain Python-based AWS Lambda functions for business logic and automation • Modify, enhance, and maintain Amazon Connect contact flows and routing logic • Collaborate with UI development teams to translate business requirements into backend services and APIs • Work directly with Amazon Professional Services on Amazon Connect implementation and enhancement initiatives • Design and support event-driven architectures using AWS services such as Lambda, EventBridge, SQS, SNS, and API Gateway • Support and optimize data flows across AWS services and integrated systems • Implement secure, scalable, and highly available cloud-native solutions • Troubleshoot application, integration, and infrastructure issues across AWS environments • Collaborate with cross-functional teams to ensure reliable and efficient service delivery • Participate in production support activities and occasional after-hours deployments when required

Romania