Lineate is a US-based international software development company with over two decades of experience. We deliver AI-driven custom solutions for FinTech, HealthTech, AdTech, and beyond, empowering businesses to grow smarter, faster, and more efficiently. Building Custom AI Solutions: Deploying high-impact, AI-enabled technology utilizing IDP, Agentic RAG. Cloud and Data Infrastructure: Optimizing business operations with our data management and cloud computing solutions. Team Augmentation: Providing specialized experts in FinTech, AdTech, and HealthTech to integrate seamlessly and accelerate project timelines. Lineate is proud to be an Equal Opportunity Employer - We do not hire on the basis of race, colour, religion, creed, gender, national origin, citizenship, age, disability, veteran status, marital status, pregnancy, parental status, sex, gender expression or identity, sexual orientation, or any other basis protected by Georgian legislation. All employment is decided based on qualifications, merit, and business need. Learn more about Lineate at our website: www.lineate.com
Software Engineer Java + Data (PySpark)
Location
EST (UTC-5)
Posted
39 days ago
Salary
0
Seniority
Mid Level
Job Description
Software Engineer Java + Data (PySpark)
Careers at Lineate
Role Description Design, develop, and maintain scalable backend services using Java and Python. - Build and optimize data processing pipelines and APIs for high-performance applications. - Collaborate with cross-functional teams to deliver reliable and efficient solutions. - Improve system performance, scalability, and reliability. - Work with large datasets to support search, recommendation, or ML-driven features. - Contribute to architecture decisions and technical design. - Write clean, maintainable, and well-documented code. Qualifications - 6+ years of commercial software development experience. - Strong hands-on experience with both Java and Python (primarily PySpark code). - Experience in designing, developing, and optimizing scalable data processing pipelines and backend APIs for high-performance applications. - Solid understanding of backend development principles and system design. - Experience working with APIs, microservices, and distributed systems. Requirements - Nice-to-have: - Databricks OR AWS EMR OR Hadoop. - Search technologies experience, such as: - Lexical search (e.g., Solr, Elasticsearch). - Semantic search, vector search, or RAG-based systems. - Search relevance tuning and optimization. - Machine Learning experience, especially in: - Recommendation systems. - User behavior prediction (e.g., click-through rate prediction, relevance estimation). - Practical ML application in production systems. Benefits - B2B contract with our US office. - NY working hours (at least 6 hours overlap).
Related Guides
Related Categories
Related Job Pages
More Data Engineer Jobs
IT / Atlassian Admin / Power BI & Data Engineer
TOWA GmbHYour Teamlead is Matthias Frank - click here to get to know him.
Role Description You love it when tools run smoothly, data is reliable, and reports actually say something meaningful? Then you're in the right place. In this role, you'll be the link between our Atlassian setup, our reporting, and our internal IT – with a clear focus on Power BI and Data Engineering. You'll join our team at our Krakow location and work with us on a B2B basis. Power BI & Data Engineering (High Priority – your main focus) - You build, maintain, and continuously improve our Power BI reports and dashboards. - You take care of data modeling and data quality in our data warehouse. - You develop and maintain our data pipelines (Airflow, Azure). - You support ad-hoc requests from the business. - You work closely with our existing data team – including a structured handover phase. Atlassian Administration (Medium Priority) - You own our entire Atlassian stack (Jira, Confluence, Tempo). - You manage users and permissions and keep things clean with regular audits. - You set up and structure projects in Jira properly. - You administer Tempo (time tracking, reporting). - You continuously improve workflows, automations, and dashboards. - You keep our Confluence structure (spaces, pages, permissions) tidy. - You're the go-to person for internal users. Operational IT Support (Low Priority – nice to have) - You support license management (Google Workspace, M365, SaaS tools). - You evaluate and implement new tools. - You help roll out security measures and IT policies. - You document processes and keep them up to date. Qualifications - 3+ years of experience in a comparable role (Intermediate to Senior level). - Experience with Power BI – you don't just build reports; you think in data models. - Solid understanding of data warehousing and data engineering (Airflow, Azure are a plus). - Experience with Atlassian products (Jira, Confluence, Tempo) – ideally including administration. - Structured, independent way of working and a sense of ownership. - An eye for data quality and clean setups – you leave things better than you found them. - Strong communication skills in English (Polish and/or German are a plus). - You work independently on a B2B basis. - Nice to have: experience with license management, IT policies, or security topics. Requirements - Your CV. - Your salary expectations for the performance you bring. - Your notice period and/or earliest possible start date. Benefits - Pure team spirit: Collaborative teams with genuine support, strong connections, and a healthy dose of humor. - Transparent structures: Flat hierarchies and a lived open-door mentality; communication at eye level is part of our culture. - Work from anywhere: Whether from home or temporarily abroad, you decide where you work most productively. - Flexible working hours: A wide flexitime framework lets you define your own rhythm while ensuring reliability for client and team meetings. - Models that fit your life: Full-time or part-time, permanent or temporary, together we’ll find the setup that works best for you. - Exciting projects: Innovative, diverse, and with a steep learning curve, you’ll help shape real progress with us. - Guaranteed growth: Individual learning and development budget for workshops, certifications, and coaching. - Career paths: Even with flat hierarchies, clear structures support your professional growth and development. - Mobility & events: Up to €467 for public transport or the Deutschlandticket, plus regular team events and visits to trade fairs and conventions. - Loft offices: Centrally located, modern, and creative spaces designed for collaboration, ideas, and, of course, snacks & drinks. - Rewards: A bonus system that recognizes both your personal achievements and contributions to the company’s success.
Senior Data Engineer, Data Lakehouse Infrastructure
TRM LabsTRM Labs specializes in blockchain investigations and risk management, empowering organizations to detect, investigate, and prevent crypto-related fraud and fin
Title: Senior Data Engineer, Data Lakehouse Infrastructure Location: North America Department: R&D Job Description: TRM Labs provides AI-powered intelligence solutions that help public and private sector agencies investigate and disrupt crime. TRM's platforms enable investigators to trace illicit activity, build cases, and construct operating pictures of threat networks. Leading agencies and businesses worldwide rely on TRM to make the world safer and more secure. We’re building the foundational data infrastructure powering next-generation analytics at scale. As part of our mission, we’re architecting a modern data lakehouse to support complex workloads, real-time data pipelines, and secure data governance—at petabyte scale. We are looking for a Senior Data Engineer to help us design, implement, and scale core components of our lakehouse architecture. You will have ownership over data modeling, ingestion, query performance optimization, and metadata management using cutting-edge tools and frameworks like Apache Spark, Trino, Hudi, Iceberg, and Snowflake. We’re looking for engineers with deep expertise in at least one area and a solid understanding of the trade-offs among different technologies. The impact you’ll have here: - Architect and scale a high-performance data lakehouse on GCP, leveraging technologies like StarRocks, Apache Iceberg, GCS, BigQuery, Dataproc, and Kafka. - Design, build, and optimize distributed query engines such as Trino, Spark, or Snowflake to support complex analytical workloads. - Implement metadata management in open table formats like Iceberg and data discovery frameworks for governance and observability using Iceberg compatible catalogs. - Develop and orchestrate robust ETL/ELT pipelines using Apache Airflow, Spark, and GCP-native tools (e.g., Dataflow, Composer). - Collaborate across departments, partnering with data scientists, backend engineers, and product managers to design and implement What we’re looking for: - 5+ years of experience in data or software engineering, with a focus on distributed data systems and cloud-native architectures. - Proven experience building and scaling data platforms on GCP, including storage, compute, orchestration, and monitoring. - Strong command of one or more query engines such as Trino, Presto, Spark, or Snowflake. - Experience with modern table formats like Apache Hudi, Iceberg, or Delta Lake. - Exceptional programming skills in Python, as well as adeptness in SQL or SparkSQL. - Hands-on experience orchestrating workflows with Airflow and building streaming/batch pipelines using GCP-native services. About the Team: - The Data Platform team is the funnel between all of TRM's data world and product world. We care about all layers of stack including petabyte of data stores, pipelines, and processes. - We have quite a big scope as a the team with new and exciting projects every quarter. As a result, we collaborate across the board with most teams at TRM. - We believe in async communication and are also not afraid to jump on a quick huddle if that helps to move things faster. We are both scrappy when the situation demands and also process-oriented when we need to achieve our OKRs. - We are always looking for people who can elevate the quality our tech and our execution. If you enjoy a remote-first and async friendly environment to achieve efficacy and efficiency at petabyte scale, our team could be a great pick for you! - Team members are based in the US across almost all timezones! - We do try to reserve some overlap in the day for meetings. Our north star - no IC spends more than 3-4 hours/week in meetings. Learn about TRM Speed in this position: - Build scalable engines to optimize routine scaling and maintenance tasks like create self-serve automation for creating new pgbouncer, scaling disks, scaling/updating of clusters, etc. - Enable tasks to be faster next time and reducing dependency on a single person. - Identify ways to compress timelines using 80/20 principle. For instance, what does it take to be operational in a new environment? Identify the must have and nice to haves that are need to deploy our stack to be fully operation. Focus on must haves first to get us operational and then use future milestones to harden for customer readiness. We think in terms of weeks and not months. - Identify first version, a.k.a., "skateboards" for projects. For instance, build an observability dashboard within a week. Gather feedback from stakeholders after to identify more needs or bells and whistles to add to the dashboard. About TRM's Engineering Levels: Engineer: Responsible for helping to define project milestones and executing small decisions independently with the appropriate tradeoffs between simplicity, readability, and performance. Provides mentorship to junior engineers, and enhances operational excellence through tech debt reduction and knowledge sharing. Senior Engineer: Successfully designs and documents system improvements and features for an OKR/project from the ground up. Consistently delivers efficient and reusable systems, optimizes team throughput with appropriate tradeoffs, mentors team members, and enhances cross-team collaboration through documentation and knowledge sharing. Staff Engineer: Drives scoping and execution of one or more OKRs/projects that impact multiple teams. Partners with stakeholders to set the team vision and technical roadmaps for one or more products. Is a role model and mentor to the entire engineering organization. Ensures system health and quality with operational reviews, testing strategies, and monitoring rigor. The following represents the expected range of compensation for this role: - Individual pay is determined by skills, qualifications, experience, and location. The compensation details listed in this posting reflect the US base salary only. - The estimated base salary range for this role is $190,000 - $220,000. - Additionally, this role may be eligible to participate in TRM’s equity plan. - Please note – we factor in the different costs for geographies outside the United States. Life at TRM We are building a safer world. That promise shows up in how we work every day. TRM moves quickly. We are a high velocity, high ownership team that expects clarity, follow-through, and impact. People who thrive here are energized by hard problems, experimentation, and continuous feedback. If something takes months elsewhere, it will ship here in days. Our work sits at the intersection of AI, national security, and fighting crime. The problems are complex, the stakes are real, and the environment evolves quickly. The pace and intensity of the work reflect the importance of the mission. As a result, the way we operate requires a high level of ownership, adaptability, collaboration, and creative problem-solving. At TRM, you should expect: - Priorities and targets to change quickly as we experiment and iterate - Work that often requires operating with a high degree of ambiguity - A high level of personal ownership and accountability - Close collaboration across teams and functions - Frequent, high-touch communication - Creative problem solving and out-of-the-box thinking - A pace that rewards urgency, adaptability, and outcomes This environment is energizing for people who enjoy building, solving hard problems, and making progress in situations that are not always fully defined. It also requires comfort navigating ambiguity, adjusting course as new information emerges, and maintaining focus and positivity in a fast-moving and intense environment. We also recognize that this style of operating is not for everyone. If you are primarily optimizing for predictability or a consistently balanced workload, we encourage you to use the interview process to pressure test whether this environment is truly the right fit. We want teammates who thrive here, not just survive here. At the same time, many people find this work deeply rewarding. If you are excited by meaningful problems, motivated by ambitious goals, and energized by working alongside mission-driven colleagues, there is a good chance you will find TRM to be an exceptional place to grow and contribute. Learn more: Interviewing at TRM: How We Hire and What Success Looks Like AI Fluency at TRM AI fluency is a baseline expectation at TRM. We believe AI meaningfully changes how top performers operate. We expect every team member to use AI to accelerate and reimagine their craft, not just automate surface tasks. At TRM, AI fluency means you are among the top 10 percent of operators in your function in how you apply AI to: - Accelerate repeatable workflows - Structure and solve problems - Improve output quality - Increase speed and leverage You will be evaluated on applied AI fluency during the interview process. Leadership Principles We hire and grow against three leadership principles. They’re the standards for how we operate, treat each other, and make decisions. - Impact-Oriented Trailblazer: We put customers first and move with speed, focus, and adaptability. We treat every plan like an experiment – test, ship, measure, and iterate quickly. - Master Craftsperson: We care deeply about our craft. We balance speed with high standards, own outcomes end‑to‑end, and invest in getting better everyday. - Inspiring Colleague: We add clarity and energy, not noise. We bring humility, candor, and a one‑team mindset — giving and receiving feedback to make the team stronger. Join our Mission At TRM we care deeply about our craft. We are looking for individuals who want their work to matter, who experiment with speed and rigor, and who take pride in building a safer world for billions of people. If you’re excited by TRM’s mission but don’t check every box, we encourage you to apply — we hire for slope, judgment, and the will to learn fast. TRM is a Series C company with $220M in total funding, backed by Blockchain Capital, Goldman Sachs, Bessemer, Y Combinator, Thoma Bravo, and others. Headquartered in San Francisco, TRM operates as a distributed-first company with hubs in Los Angeles, San Francisco, New York, Washington D.C., London, and Singapore. Privacy Policy and Additional Information By submitting your application, you are agreeing to allow TRM to process your personal information in accordance with the TRM Privacy Policy. Our typical hiring cycles for specialized roles span 24 to 36 months. Accordingly, we retain your personal information for up to 36 months to evaluate your application and to consider you for current and future employment opportunities, unless you request earlier deletion or a different retention period is required or permitted by law.
Role Description Für unseren Kunden in Zürich, suchen wir eine:n erfahrene:n, motivierte:n und aufgeschlossene:n Software Engineer – Data Migration. Für unseren Kunden suchen wir eine engagierte Persönlichkeit, die unser Software‑Engineering‑Team bei der Weiterentwicklung moderner Data Migration Services unterstützt und aktiv dazu beiträgt, neue Kunden effizient an innovative Steuerlösungen anzubinden. - Du gestaltest, optimierst und skalierst die Migrationsstrecken. - Komplexe Datenstrukturen sind dein Metier. Du analysierst Abhängigkeiten, um intelligente Mapping-Logiken und automatisierte Cleansing-Strategien zu entwerfen und implementieren. - Du entwickelst automatisierte Tests, die Integrität und Qualität der Daten über den gesamten Prozess hinweg sicherstellen. - Du bereitest die Migrations- und Testdaten auf und überwachst den Verarbeitungsprozess. Deine Erkenntnisse aus der Fehleranalyse bringst du ins Team ein, damit gemeinsam Lösungen ausgearbeitet werden können. - Du treibst die kontinuierliche Weiterentwicklung der Entwicklungsprozesse, Tools und Best Practices voran. Qualifications - Mehrjährige praktische Erfahrung in der Erstellung und Optimierung komplexer SQL-Abfragen. - Vertiefte Kenntnisse in Datenmodellierung, Datenmanagement und Datenarchitektur. - Fähigkeit, sich rasch in komplexe Legacy-Datenbanken einzuarbeiten. - Erfahrung in der Analyse komplexer Datenstrukturen sowie im Identifizieren von Anomalien. - Gute Kenntnisse in SQL-Datenbanken, insbesondere PostgreSQL und MS SQL Server. - Kenntnisse in Python bzw. Erfahrung mit Python-Frameworks sind von Vorteil. - Eigeninitiative und analytisches Denken zur Verbesserung von Datenabfragen und Verarbeitungszeiten. - Sehr gute Sprachkenntnisse in Deutsch (C1) und Englisch. Company Description Die Coopers Group AG ist eine agile Schweizer Recruiting Agentur, die Spezialisten und Führungskräfte in den Bereichen IT, Life Sciences, Engineering und Finance vermittelt. Mit flexiblen Ansätzen bringen wir Kandidat:innen und Unternehmen zusammen, die nicht nur fachlich, sondern auch menschlich zusammenpassen.
Senior Python Engineer for Data Retraining
EPAM SystemsEPAM Systems is an information technology (IT) company that has become a leading global digital and product design, digital platform engineering, and product development agency. As
Title: Senior Python Engineer for Data Retraining Location: Boston United States Job Description: Are you prepared to take your Python engineering expertise to the next level and transition into the dynamic world of Big Data? EPAM provides a unique chance to secure a role after passing a single technical interview, while acquiring Big Data skills without changing your title or compensation. This focused 8-week program is tailored for Python engineers making the shift to Big Data Engineering. The curriculum encompasses three distinct phases: theoretical coursework delivered by industry professionals, practical tasks or hands-on projects, and a well-rounded knowledge assessment accompanied by feedback. Key topics include data management, distributed systems, Spark, Kafka, NoSQL databases, and cloud-native services, ensuring a solid grounding in modern data platforms. The training is fully remote, enabling participants to learn from the comfort of their own space. What will you do in this role? Skills - 4+ years of production experience in IT - Proficiency in Python, SQL, and cloud platforms (AWS, GCP, Azure) - Background in tools such as Databricks, Spark, Docker, Kubernetes is a plus - Familiarity with AI, LLM - English proficiency at B2+ or higher What we offer With us you can: - Work on a flexible schedule remotely or from any of our comfortable offices or coworking spaces in Ukraine - Receive the necessary equipment to perform your work tasks - Change projects and technology stacks within EPAM - Gain experience in various business domains (Insurance, E-commerce, Healthcare, Finance, Travelling, Media, Artificial Intelligence, and more) - Relocation opportunities may be available for eligible candidates, depending on the role and openings at other EPAM locations - Participate in volunteer, charity programs and communities (both technical and interest-based) We focus on your professional growth: - You can plan your individual career path together with your manager - Receive regular feedback from colleagues - Improve your English for free with certified teachers (Speaking Clubs, client interview preparation courses, etc.) - Get the opportunity to undergo free training and certification in AWS, GCP, or Azure Clouds - Use the internal E-learn training program (18,200+ specialized training and mentoring programs) - Access corporate accounts on LinkedIn Learning, Get Abstract and other partner resources - Study at EPAM Solution Architecture School with the instructors who are practicing architects - Develop as a leader, join Delivery Management, Resource Management, Leadership Essentials school and more - Participate in internal communities (500+ meetups, technical discussions, brainstorming sessions, online events and conferences annually) What we offer: - Vacation and sick leave (including a sick leave without a medical certificate) - A wide range of Voluntary Medical Insurance programs providing both medical treatment and various preventive options (including sports activities) - Medical insurance for family members at corporate rates - Company support during significant life events (childbirth or adoption, marriage, etc.) - Support for psychological comfort: discounts on services from mental health specialists or coaches, thematic training - E-kids program - a free programming language training program for EPAMers' children Kindly be advised that the set of benefits, including learning, certification, and other opportunities, may vary depending on the role you apply for. Our recruiter will be able to share more details about the specific opportunity during your general interview. ABOUT EPAM EPAM strives to provide its global team of over 62,350 professionals in more than 55 countries with opportunities for professional growth from day one of collaboration. Our colleagues are the source of EPAM's success, so we value cooperation, strive to always understand our clients' business and aim for the highest quality standards. No matter where you are, you will join a dedicated, diverse community that will help you realize your potential to the fullest.

