Innodata logo
Innodata

Innodata, with over 35 years of expertise, is a trusted leader in data solutions and AI innovation. The company specializes in training and deploying generative

Data Annotation

Location

Minnesota

Posted

7 days ago

Salary

$0 / hour

Seniority

Senior

Job Description

Data Annotation

Innodata

Title: Data Annotation Location: Remote - Minnesota Job Description: Innodata (Nasdaq: INOD) is a global data engineering company. We believe that data and Artificial Intelligence (AI) are inextricably linked. Our mission is to enable the responsible advancement of artificial intelligence by providing the data, evaluation frameworks, and human expertise required to build AI systems that can be trusted at scale. We provide a range of transferable solutions, platforms, and services for Generative AI / AI builders and adopters. In every relationship, we honor our 36+ year legacy delivering the highest quality data and outstanding outcomes for our customers. Scope of the Role: At Innodata, we’re partnering with the world’s leading technology companies to build the future of generative AI and large language models (LLMs). We’re on the lookout for smart, savvy, and curious Data Annotators to join our global contributor community as part of our Subject Matter Expert (SME) on Demand program. This is not a traditional full-time role. It’s a part-time, remote, flexible, project-specific opportunity designed for those who want to make a real impact—on their schedule. Whether you're a writer, linguist, educator, researcher, or just deeply passionate about language and logic, this role lets you contribute to cutting-edge AI development while maintaining control over your time. You’ll be helping LLMs learn the intricacies of language and reasoning—not just how to write, but how to think. If you’ve ever dreamed of shaping the intelligence behind tomorrow’s technology, this is your chance. This is more than just a gig—it’s a rare chance to help shape the future of AI from anywhere in the world, on your own terms. What You’ll Own: - Rating/assessing the performance of AI models or algorithms based on their output or behavior through a set of evaluative questions. - Labeling elements of a piece of content rather than the content as a whole. - Assigning predefined categories or labels to items. - Evaluating the perceived quality and/or appropriateness of content - Generating labels to advance understanding of a concept, trend etc. - Creation of additional training data for machine learning models by applying transformations to the original data, such as modifying images (rotation, flipping, cropping), generating new text (paraphrasing, summarization), or altering audio/video signals (speed modification, pitch shifting) to reduce overfitting and increase dataset diversity. - Reviewing data and identifying whether or not a product feature works as intended based on the project's guidelines. - Labeling model outputs to identify if a piece of content is or isn't something. Examples: identify clickbait; identifying gaming videos; identifying branded content. - Ordering or ranking items based on a set of preferences or criteria. - Creating prompts or questions that will be used to generate responses from a language model or other AI system. - Projects that evaluate the relevance of content based on a relevancy scale (1-3, 1-5, etc.). - Generating responses to prompts or questions using a language model or other AI system. - Rewriting existing text while preserving the original meaning, often to improve clarity or style and adherence to guidelines. - Producing concise summaries of longer pieces of text or data. - Converting spoken language or audio content into written text. - Converting text or spoken language from one language to another. - Gathering and compiling various forms of data to be used for training, evaluating, or fine-tuning the AI models. This may include text, images, videos, audio files, or other types of digital content. You’ll Thrive in This Role If You Have: - A High School Diploma or higher is required. - Professional or Expert level proficiency (C1/C2) in English The expected hourly salary range for this position is $13 p/hour, based on experience, skills, and qualifications.

Related Categories

Related Job Pages

More Data Engineer Jobs

Zensar logo

Senior Azure Data Engineer

Zensar

At Zensar, we’re “experience-led everything”. We are committed to conceptualizing, designing, engineering, marketing, and managing digital solutions and experiences for over 130 leading enterprises. We are a company driven by a bold purpose: Together, we shape experiences for better futures. Whether for our clients, our people, or the world around us, this belief powers everything we do. At the heart of our culture is ONE with Client - a set of four core values that reflect who we are and how we work: One Zensar, Nurturing, Empowering, and Client Focus. Part of the $4.8 billion RPG Group, we’re a community of 10,000+ innovators across 30+ global locations, including Milpitas, Seattle, Princeton, Cape Town, London, Zurich, Singapore, and Mexico City. We believe the best work happens when individuality is celebrated, growth is encouraged, and well-being is prioritized. We are an equal employment opportunity (EEO) and affirmative action employer, committed to creating an inclusive workplace. All qualified applicants will be considered without regard to race, creed, color, ancestry, religion, sex, national origin, citizenship, age, sexual orientation, gender identity, disability, marital status, family medical leave status, or protected veteran status.

Data Engineer7 days ago
Full TimeRemoteTeam 10,001

Role Description - Migrates SSIS packages to ADF, builds iBuy ingestion pipelines (SFTP / ODBC), and automates Scorecard load processes — replacing all MS Access macros and Windows Scheduler dependencies with ADF orchestration. - Implements the DQ framework (PK checks, null validation, deduplication, reject handling) and establishes the audit layer (row counts, hash totals, source-to-target traceability) with automated stakeholder alerting. Qualifications - Hands-on experience migrating SSIS packages to ADF. - Experience building production-grade pipelines with dependency management, scheduling, and error handling. - Strong T-SQL and ADF debugging skills. Requirements - Must deliver working, tested pipelines with minimal handholding. Benefits - Inclusive workplace with a commitment to equal employment opportunity (EEO) and affirmative action. - Encouragement of individuality, growth, and well-being.

India
Full TimeRemoteTeam 10,001+H1B No Sponsor

• contribute to the development and deployment of AI and machine learning infrastructure • Design, develop, and maintain ELT/ETL pipelines • Build and optimize data models in cloud-based data warehouses • Architect and provision cloud data infrastructure on GCP • Ensure all data systems comply with HIPAA and HITECH regulations • Implement data quality frameworks and validation rules • Partner with data scientists, analysts, and ML engineers

New York
$135K - $165K / year
Job Closed
Full TimeRemoteTeam 1,001-5,000Since 1985H1B No Sponsor

• Define enterprise architecture strategy across enterprise applications, integrations, APIs, Azure cloud platforms, enterprise data, analytics, automation, and AI enablement. • Establish scalable architecture patterns enabling digital customer and employee experiences across CRM, web, mobile, communications, and operational platforms. • Define enterprise engineering and platform standards across APIs, integrations, cloud services, observability, reusable services, technical design patterns, and platform reliability. • Promote reusable enterprise capabilities and scalable platform approaches that accelerate delivery and reduce technical fragmentation. • Serve as a trusted advisor to business and technology leadership on enterprise technology direction, modernization priorities, and technical investment decisions. • Own and evolve enterprise integration and platform strategy including APIs, middleware, orchestration, interoperability, and event-driven architecture. • Guide architecture and engineering direction across Microsoft Dynamics 365 (CE/F&O preferred), ERP, CRM, operational systems, digital platforms, and third-party ecosystems. • Improve observability, resiliency, monitoring, supportability, and enterprise reliability. • Guide modernization of legacy systems and synchronization approaches into scalable, maintainable cloud-native patterns. • Partner with Digital Solution Delivery, ERP, and Infrastructure leaders to enable scalable digital experiences through reusable enterprise services and shared data models. • Establish pragmatic enterprise data governance practices that improve trust, quality, consistency, accessibility, and usability of enterprise data without creating unnecessary bureaucracy. • Enable trusted enterprise data, analytics, automation, and AI-ready architectures leveraging Microsoft Fabric, Snowflake, Databricks, and Azure data services.

United States
$175K - $220K / year
Onyx Government Services logo

Senior Data Architect

Onyx Government Services

SDVOSB, Systems Integrator to Federal Civilian Agencies, the Intelligence Community, and Department. of Defense.

Data Engineer7 days ago
Full TimeRemoteTeam 11-50Since 2006H1B No Sponsor

• Develop and maintain enterprise data and analytics strategy documents, roadmaps, and improvement plans (including POA&Ms) for DLA J6TF • Design and support an enterprise data governance framework — governance structure documentation, data ownership assignments, data stewardship roles, and operating procedures • Manage metadata per DoD standards, including DoD Metadata Registry (DoD 8320 series) and Net-centric Enterprise Services requirements • Provide architecture recommendations for DLA analytical platforms — data ingestion, storage, processing, and delivery layers • Support enhancement of DLA's Enterprise Data Warehouse (EDW) — logical and physical schema design, data source integration, and analytical layer development • Develop data dictionaries, data inventories, and data lineage documentation across DLA's multi-system enterprise environment • Identify and register authoritative data sources across DLA data domains; document authoritative source designations per governance policy • Conduct data quality assessments — profile source data, document quality issues, and produce remediation recommendations • Support data governance working groups and data stewardship council activities — facilitation, briefings, and deliverable preparation • Develop and maintain artifacts including logical data models, conceptual data models, metadata management plans, and enterprise architecture alignment documents • Prepare monthly progress reports, IPR briefing charts, and deliverables for COR review

Pennsylvania + 1 moreAll locations: Pennsylvania | Virginia
Job Closed