Protect identities, stop threats, and deliver dynamic access to empower and secure a work-from-anywhere world.
Senior Data Engineer
Location
United States
Posted
66 days ago
Salary
0
Seniority
Senior
Job Description
Senior Data Engineer
BeyondTrust
• Take ownership of data modelling to solve complex identity security problems. • Solve complex analytical problems across disparate systems to provide a unified view of security posture in the datalake • Optimize data workloads at a software level by improving processing efficiency. • Identify opportunities for engineering process improvement and collaborate with senior resources to execute a plan of action. • Use monitoring and observability best practices to ensure optimal pipeline performance. • Assist with ML Operations to ensure optimal model efficiency. • Leverage CI/CD best practices to effectively develop and release source code.
Job Requirements
- Strong programming and technology knowledge in cloud data processing.
- Spark experience is needed, existing Databricks knowledge is a plus.
- Interest and aptitude for cybersecurity; interest in identity security is highly preferred.
- Technical understanding of underlying systems and computation minutiae.
- Experience working with distributed systems and data processing on object stores.
- Ability to work autonomously
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
Data Engineer
Software Technology Inc.Creating Exceptional Experiences | Digital Transformations| Staffing & Recruiting
• Develop and deploy data pipelines for streaming and batch data sources • Create data models and implement business logic using required tools and languages • Collaborate with teams for performance tuning and data processing • Utilize Azure and Spark technologies in daily tasks
Data Engineer
DataChefWe develop Data Mesh, Data Lake and Predictive Analytics solutions on AWS to demystify data and simplify information.
• Design and build a green field data mesh platform on AWS. • Create a scalable data foundation for a large distribution company. • Help in building stream processing backend using AWS services, Spark and Kafka. • Ensure that the data pipelines are robust and handle low-quality raw data effectively. • Work collaboratively within a small team to achieve project goals.
• Create and maintain optimal data pipeline architecture. • Assemble large, complex data sets that meet functional / non-functional business requirements. • Identify, design, and implement internal process improvements: automating manual processes, optimizing data delivery, re-designing infrastructure for greater scalability, etc. • Build the infrastructure required for optimal extraction, transformation, and loading of data from a wide variety of data sources using SQL and AWS ‘big data’ technologies. • Build analytics tools that utilize the data pipeline to provide actionable insights for Information Asset as well as Information Asset Customers’ operational efficiency and other key business performance metrics. • Work with stakeholders including the Executive, Product, Data and Design teams to assist with data-related technical issues and support their data infrastructure needs. • Create data tools for analytics to build and optimize offering into an industry leading solution. • Work with data and analytics experts to strive for greater functionality data systems.
Senior Data Engineer – GovTech, Public Sector
Sigma Software GroupWe support enterprises, product houses, and startups with custom software solutions development and IT consulting.
• Design and implement data ingestion pipelines for batch and streaming data • Configure and maintain data orchestration workflows (Airflow, NiFi) and CI/CD automation for data processes • Design and organize data layers within Data Lake architecture (HDFS, Iceberg, S3) • Build and maintain secure and governed data environments using Apache Ranger, Atlas, and SDX • Develop SQL queries and optimize performance for analytical workloads in Hive/Impala • Collaborate on data modeling for analytics and BI, ensuring clean schemas and dimensional models • Support machine learning workflows using Spark MLlib or Cloudera Machine Learning (CML)




