Job Closed

This listing is no longer active.

Innodata Inc logo
Innodata Inc

Innodata (NASDAQ: INOD) is a leading data engineering company. With more than 2,000 customers and operations in 13 cities around the world, we are an AI technology solutions provider-of-choice for 4 out of 5 of the world’s biggest technology companies, as well as leading companies across financial services, insurance, technology, law, and medicine. By combining advanced machine learning and artificial intelligence (ML/AI) technologies, a global workforce of subject matter experts, and a high-security infrastructure, we’re helping usher in the promise of AI. Our global workforce includes over 7,000 employees in the United States, Canada, United Kingdom, the Philippines, India, Sri Lanka, Israel and Germany. We’re poised for a period of explosive growth over the next few years.

Language Data Scientist

Data ScientistData ScientistOtherRemoteTeam 5,001-10,000

Location

United States

Posted

97 days ago

Salary

$85K - $95K / year

No structured requirement data.

Job Description

Language Data Scientist

Innodata Inc

Job Title: Language Data Scientist Location: Fully Remote within the U.S. (excluding California, Washington, Alaska, Colorado, Montana, New York, Puerto Rico, Nevada, Nebraska) Employment Type: Full-Time (40 hours per week) Fixed-Term Who we are: Innodata (NASDAQ: INOD) is a leading data engineering company. With more than 2,000 customers and operations in 13 cities around the world, we are an AI technology solutions provider-of-choice for 4 out of 5 of the world’s biggest technology companies, as well as leading companies across financial services, insurance, technology, law, and medicine. By combining advanced machine learning and artificial intelligence (ML/AI) technologies, a global workforce of subject matter experts, and a high-security infrastructure, we’re helping usher in the promise of AI. Innodata offers a powerful combination of both digital data solutions and easy-to-use, high-quality platforms. Our global workforce includes over 7,000 employees in the United States, Canada, United Kingdom, the Philippines, India, Sri Lanka, Israel and Germany. We’re poised for a period of explosive growth over the next few years. About the Role: Innodata is building a team of Language Data Scientists and Gen AI experts to help our customers advance GenAI applications. You will work hands-on with multi-modal and multi-lingual datasets and collaborate with cross-functional partners. You will use your experience with human and synthetic data workflows to drive innovation and continuous improvement. The ideal candidate must have the right mix of skills in (computational) linguistics and human evaluation tasks, data science, and data engineering. Key Responsibilities: - Design/improve workflows to create data for AI/ML training and evaluation. Includes human annotation and data collection workflows, as well as synthetic ones. - Dive deep into existing workflows and processes to gather data and insights, make recommendations, and drive improvement through innovation and cross-functional collaboration with customers - Critically assess annotation tooling and workflows - Quantitatively analyze large datasets, perform statistical analysis, calculate metrics, and make recommendations to improve accuracy and performance - Work closely with client stakeholders on understanding goals, gathering requirements, proposing solutions and executing them. Qualifications: - Knowledge of how components of GenAI products or services combine to work - Collaborating with cross-functional teams to define AI project requirements and objectives, ensuring alignment with overall business goals - MA in (computational) linguistics, data science, computer science (AI / ML / NLU), quantitative social sciences or a related scientific / quantitative field, PhD strongly preferred - Language and language data expertise: Extensive experience working with human language data and designing human evaluation tasks, including multi-phase and complex workflows. - Deep understanding of language and its relationship with culture - Ability to identify ambiguity and subjectivity in language - Ability to work with multi-lingual and multi-modal projects - Quantitative Analysis Skills: Advanced knowledge of statistics, metrics (e.g. f1 score, inter-rater reliability metrics), and data analysis methods such as sampling. - Technical skills: - Experience with Natural Language Processing (NLP) techniques and tools, such as SpaCy, NLTK, or Hugging Face. - Proficiency in Python to - handle / transform large datasets (e.g. pre- and postprocessing data, pandas) - perform quantitative analyses - visualize data (for example matplotlib, seaborn) - Data processing: - Deep understanding of data pipelines to support ML and NLP workflows, - Knowledge of efficient data collection, transformation, and storage - Knowledge of data structures, algorithms, and data engineering principles - Excellent interpersonal skills for effective cross-functional stakeholder engagement - Excellent problem-solving skills, with the ability to think critically and creatively to develop innovative AI solutions - Ability to work independently and collaborate as part of a team - Adaptable to changing technologies and methodologies - Ability to translate experience, research and development information to understand client products and services. Preferred Qualifications: - Conducting research to stay up-to-date with the latest advancements in generative AI, machine learning, and deep learning techniques - Knowledge of optimizing existing generative AI models for improved performance, scalability, and efficiency - Experience of developing and maintaining ML/AI pipelines, including data preprocessing, feature extraction, model training, and evaluation - Model Fine-Tuning: Knowledge of Fine-tuning pre-trained models to adapt them to specific tasks and datasets, improving their performance and relevance - Developing clear and concise documentation, including technical specifications, user guides, and presentations, to communicate complex AI concepts to both technical and nontechnical stakeholders - Contributing to establishing best practices and standards for generative AI development with customers and within the organization - Providing technical mentorship and guidance to junior team members - Understanding of techniques such as GPT, VAE, and GANs Salary Range: Up to $95k USD Rates at Innodata vary depending on a wide array of factors, which may include but are not limited to the role, skill set, educational background and geographic location.

Related Categories

Related Job Pages

More Data Scientist Jobs

Attain Partners is an innovative consulting firm dedicated to disrupting the status quo to change the world and improve the lives of those we touch. From strategy to technology and everywhere in between, our experts use their unique skills to advance the important missions of education, nonprofit, healthcare, and state and local government clients. People are at the center of all we do, and that’s why we empower career growth, provide industry-leading benefits packages, encourage a flexible work environment, and foster a culture of inclusion to support the needs of our team. We share a collective passion for our mission and our people. Guided by our seven core values, The Attain Way, our vision is the foundation of our culture—to be and attain the best. Job Description Attain Partners is seeking an Integration & Data Leader responsible for designing, leading, and executing enterprise data integration and migration initiatives across client environments. This role focuses on SQL and SSIS based data movement, enterprise system integrations, and ensuring data accuracy, completeness, and alignment with business requirements during complex system transformations. The Integration & Data Leader partners closely with business stakeholders, architects, and technical teams to define integration requirements, map data across systems, and ensure high-quality data movement between platforms. This role also plays a critical part in helping organizations modernize their integration capabilities by evolving from traditional ETL-based architectures toward modern iPaaS platforms such as the Boomi Platform and other cloud integration tools. Job Responsibilities - Data Migration & Integration Leadership - Lead the migration and integration of enterprise data across systems using SQL, SSIS, and related data movement technologies. - Design and oversee data integration processes that support enterprise system interoperability. - Ensure data migrations are delivered with high levels of accuracy, completeness, and validation controls. - Coordinate the end-to-end lifecycle of data integrations including discovery, source-to-target mapping, build, testing, and deployment. - Requirements & Stakeholder Collaboration - Partner with business and technical stakeholders to gather integration requirements and understand system dependencies. - Define data attributes, transformation rules, and mapping logic necessary for successful integration. - Identify data quality risks and remediation strategies prior to integration or migration activities. - Facilitate integration discovery workshops and architecture discussions with client teams. - Integration Architecture & Delivery - Design scalable integration solutions leveraging SQL, SSIS, and enterprise integration patterns. - Oversee the development and execution of data pipelines, transformation logic, and integration workflows. - Ensure integrations align with enterprise architecture standards and system interoperability requirements. - Establish and maintain documentation including source-to-target mappings, integration specifications, and technical runbooks. - Data Quality & Governance Alignment - Identify and address data quality considerations during migration and integration processes. - Work with data governance teams to ensure integrations follow data standards and stewardship practices. - Implement validation and reconciliation processes to ensure data integrity across systems. - Modern Integration Enablement - Continuously expand technical capabilities by learning and adopting modern integration technologies and iPaaS platforms. - Support the organization’s transition from traditional ETL approaches toward cloud-based integration architectures. - Contribute to the development of modern integration frameworks and reusable patterns. Required Skills - 7+ years of experience in data integration, ETL development, or enterprise data migration. - Strong experience with SQL and SSIS for building and managing data pipelines. - Proven ability to lead complex data migration initiatives across enterprise systems. - Experience defining source-to-target mappings and data transformation rules. - Strong understanding of data quality practices, validation, and reconciliation techniques. - Ability to collaborate effectively with both business and technical stakeholders. - Strong documentation and communication skills for technical and functional audiences. Desired Skills - Experience with modern integration platforms (iPaaS) such as the Boomi Platform, MuleSoft, or similar tools. - Experience integrating with enterprise systems such as ERP, CRM, HRIS, and data warehouse platforms. - Consulting or professional services experience leading data integration workstreams. - Exposure to cloud data platforms, APIs, and modern integration architectures. - Experience in Higher Education Software Additional Information Attain Partners values your mental, emotional, and physical health and wellbeing. Our comprehensive benefits package starts on your first day of employment and includes benefits such as: - Competitive health, dental, and vision coverage, HSA and FSA accounts, life and disability insurance, fertility and family planning benefits, and employee assistance and discount programs - 11 paid federal holidays and flexible unlimited time off (UTO) - Generous 401(k) matching with immediate vesting - Flexible career paths – our career tracks provide advancement, mobility, and flexibility as you continue to grow with us - A healthy environment where we value unique experiences, and care about everything that makes you, you. Attain Partners is committed to fair and equitable compensation practices. The individual base salary for this position is unique to each candidate and will be commensurate with experience, education, and skills. In addition to base salary, this role is eligible for an annual discretionary bonus. Interested in this position but the compensation isn’t quite right? Let us know your expectations, and we’ll see if we can make it happen based on your qualifications. Salary Range $160,000—$170,000 USD Attain Partners is an Equal Opportunity/Affirmative Action employer. All qualified applicants will receive consideration for employment without regard to sex, race, color, religion, national origin, disability, protected Veteran status, age, or any other characteristic protected by law. Applicants have rights under Federal Employment Laws. For more Information visit EEO, EEO Poster Supplement, Family and Medical Leave Act (FMLA), and Employee Polygraph Protection Act (EPPA). If you are unable to complete this application due to a disability, contact this employer to ask for an accommodation or an alternative application process.

United States
$160K - $170K / year
Job Closed
OtherRemoteTeam 10,001+H1B No Sponsor

• Define and maintain the data model and architecture for HR foundational data in Workday and connected systems • Facilitate and oversee accurate, timely execution of foundational data requests (new orgs, restructures, mergers/divestitures, entity changes) • Develop, implement, and enforce policies, procedures, and best practices for foundational HR data • Establish data quality metrics, dashboards, and automated controls to monitor integrity across systems • Serve as the primary contact for data usage, requests, and governance inquiries • Partner with HR analytics to ensure foundational data supports accurate reporting

United States
$136K - $185K / year
Job Closed
The EMMES Corporation logo

Associate Biostatistician

The EMMES Corporation

Emmes Group is transforming the future of clinical research, bringing the promise of new medical discovery closer within reach for patients. Emmes Group was founded as Emmes more than 47 years ago, becoming one of the primary clinical research providers to the US government before expanding into public-private partnerships and commercial biopharma. Emmes has built industry-leading capabilities in cell and gene therapy, vaccines and infectious diseases, ophthalmology, rare diseases, and neuroscience. We believe the work we do will have a direct impact on patients’ lives and act accordingly. We strive to build a collaborative culture at the intersection of being a performance and people-driven company. We’re looking for talented professionals eager to help advance clinical research as we work to embed innovation into the fabric of our company.

Data Scientist97 days ago
OtherRemoteTeam 1,001-5,000

This description is a summary of our understanding of the job description. Click on 'Apply' button to find out more. Role Description Collaborates with clinical investigators to determine study design, contributes to protocol development, writes statistical analysis plans, makes statistical inference, and writes and presents reports summarizing findings including publications in peer-reviewed journals. - Contributes to manuscripts and/or scientific presentations. - Collaborates with clinical investigators to determine study design. - Writes sections of protocols that require statistical input. - Reviews protocols and case report forms to ensure protocol objectives are met and standards are maintained. - Generates treatment allocations in randomized clinical research studies and ensures proper implementation. - Supports development of statistical analysis plans and programs to perform analyses and display study data. - Performs statistical analyses and writes and validates application programs. - Implements data and safety monitoring reports to ensure participants’ safety. - Generates quality control and operational reports to support clinical operations. - Generates study reports to be distributed to internal and external monitoring committees and regulatory bodies including, but not limited to, Data and Safety Monitoring Board reports, IND annual reports, and Clinical Study reports. - Participates in professional development activities both within and outside the company. - Other duties as assigned. Qualifications - MS in biostatistics, statistics, epidemiology or related field. - Demonstrated proficiency with statistical methods and applications in clinical research. - Strong programming skills in SAS and/or R. - Expertise in state-of-the-art data manipulation and statistical methodology. - Ability to manage multiple tasks. - Ability to work independently as well as in a team environment. - Ability to effectively communicate complex statistical concepts, both written and oral. Benefits - Flexible Approved Time Off - Tuition Reimbursement - 401k Retirement Plan - Work From Home Anywhere in the US - Maternal/Paternal Leave - Casual Dress Code & Work Environment

United States + 1 moreAll locations: United States | United Arab Emirates
Job Closed
LivePerson logo

Data Scientist II

LivePerson

LivePerson is an online engagement solutions company, which means that it works with clients to provide their customers with real, live assistance and advice. T

Data Scientist97 days ago

Location: USA Remote Status: Fully Remote (#LI-Remote) LivePerson (NASDAQ: LPSN) is a leader in trusted enterprise conversational AI and digital transformation. The world's leading brands use our award-winning Conversational Cloud platform to connect with millions of consumers. We power nearly a billion conversational interactions every month, providing uniquely rich data analytics and safety tools to unlock the power of conversational AI for better business outcomes. Fast Company named LivePerson the #1 Most Innovative AI Company in the world. Position Overview At LivePerson, the Data Scientist II helps advance Conversational AI by applying research and experimentation with Large Language Models (LLMs) to real-world customer interactions. This role designs and executes applied research initiatives, develops effective prompting and orchestration strategies using deep knowledge of transformer architectures, and optimizes modern LLM inference and retrieval pipelines. The scientist analyzes large-scale conversational datasets to improve Retrieval-Augmented Generation (RAG) systems and builds robust experimentation frameworks for evaluating non-deterministic model outputs. Working closely with other scientists and cross-functional teams, this role translates insights on model behavior into recommendations that inform product and business strategy while staying at the forefront of Generative AI advancements. You Will: Key Responsibilities & Impact - Propose, plan, and execute applied research initiatives that will make a real-world impact on Conversational AI using Large Language Models (LLMs) within a commercial setting. - Use expert knowledge of Transformer architecture mechanics (Attention, Tokenization, Embeddings) to consistently find the most effective, efficient, stable, explainable, and elegant prompting and orchestration approaches. - Utilize modern LLM inference and retrieval pipelines. - Analyze large scale conversational datasets for RAG (Retrieval-Augmented Generation) optimization. - Bring research-inspired prompt strategies to life within production systems. - Create durable, flexible scientific evaluation and experimentation capabilities for non-deterministic model outputs. - Present findings on model behavior and context efficacy in ways that inform and guide business and product strategy. - Work closely with other scientists. - Stay current with the rapidly evolving state of Generative AI. You Have: Required Skills & Qualifications Requires a Master's degree in Computer Science, Artificial Intelligence, Computational Linguistics, Operations Research, Industrial Engineering, or a directly related field plus two (2) years of experience as a data scientist. Must have two (2) years of experience in the following (experience may be gained concurrently): - Programming experience in Python - Developing and designing Large Language Model (LLM) interaction flows, evaluation frameworks, or advanced prompting strategies - Partnering with engineering teams to deploy GenAI/LLM solutions into production Must have one (1) year of experience in the following (experience may be gained concurrently): - Experience with Generative AI, including Large Language Models, Prompt Engineering, Retrieval Augmented - Generation (RAG), and Semantic Search/Vector Databases - Researching and implementing advanced prompting techniques, such as Chain-of-Thought or Few-Shot prompting - Experience with dialogue systems and context management - Hands on experience with LLM Orchestration Frameworks, such as LangChain / LangGraph - Experience with Transformer architectures, specifically understanding Attention mechanisms, Context Windows, and Tokenization constraints - Experience with SQL The salary range for this role will be between $120,000 to $145,000 USD. Final compensation will be determined by a variety of factors, including, but not limited to your location, skills, experience, education, and/or professional certifications. Our Benefits & Perks We are committed to supporting the complete well-being, health, financial security, family, and professional growth of our permanent employees. 🏥 Health & Wellbeing - Medical, Dental, and Vision Insurance: Comprehensive plans to support your health needs. Wellness Resources: Access to wellbeing resources and programs including our EAP plan.Health & Mental Support: Access a confidential and free Employee Assistance Program (EAP), providing professional counseling. 💰 Financial Security & Growth - 401(k) Retirement Plan: To help you plan for your financial future by offering both the plan and a 4% employer match (100% match on the first 3% contribution and 50% match on the next 2% contribution) HSA & FSA Plans: To help you plan for health related expenses on a pre-tax basis - Employee Stock Purchase Program (ESPP): Participate and receive a discount on company shares, allowing you to directly share in LivePerson's success and growth. - Additional Insurances: Basic and supplemental life insurance, Accidental Death & Dismemberment (AD&D) insurance, long-term and short-term disability insurance coverage, legal plan, identity theft protection plan, and critical illness supplemental insurance. - Development: Access to internal professional development resources. 👨‍👩‍👧‍👦 Time Away & Family Support - Flexible Paid Time Off (PTO): Discretionary PTO package for flexible days off with manager approval. - Paid Public Holidays. - Generous Parental Leave Policy: Including maternity/paternity support and fertility services. 💻 Workplace Flexibility - Remote-First Model: This role is fully remote (#LI-Remote), offering excellent work-life balance and flexibility. We also maintain dedicated WeWork space for those who wish to meet colleagues or collaborate in person. Why You’ll Love Working Here As leaders in enterprise customer conversations, we celebrate diversity, empowering our team to forge impactful conversations globally. LivePerson is a place where uniqueness is embraced, growth is constant, and everyone is empowered to create their own success. We're very proud to have earned recognition from Fast Company, Newsweek, and BuiltIn for being a top innovative, beloved, and remote-friendly workplace, and recognized by Gartner as a leader in the Conversational AI space. Belonging at LivePerson: Equal Opportunity Employer We are committed to fostering an inclusive workplace and are proud to be an Equal Opportunity Employer (EOE). We believe that diverse perspectives drive innovation. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, veteran status, or other characteristics protected by US Federal, State, or Local law. Accessibility Commitment LivePerson is dedicated to the accessibility needs of our applicants and employees. We provide reasonable accommodations to job applicants with disabilities. Applicants who require a reasonable accommodation for any part of the application or hiring process should inform their recruiting contact upon initial connection. Important Candidate Notice The talent acquisition team at LivePerson has recently been notified of a phishing scam targeting candidates applying for our open roles. Scammers have been posing as hiring managers and recruiters in an effort to access candidates' personal and financial information. The phishing scam is not isolated to only LivePerson and has been documented in news articles and media outlets. Please note that any communication from our hiring teams at LivePerson regarding a job opportunity will only be made by a LivePerson employee with an @liveperson.com email address. LivePerson does not ask for personal or financial information as part of our interview process, including but not limited to your social security number, online account passwords, credit card numbers, passport information, and other related banking information. If you have any questions and or concerns, please feel free to contact recruiting-lp@liveperson.com

United States
$120K - $145K / year
Job Closed