Vosyn logo
Vosyn

Vosyn: Uniting Voices, Visions, and Values in Every Tongue.

AI Integration and Data Engineer – Master-Level Internship

Location

Canada

Posted

3 days ago

Salary

$32 / hour

Seniority

Entry Level

Postgraduate DegreeEnglishJavaScriptNode.jsPythonSQL

Job Description

AI Integration and Data Engineer – Master-Level Internship

Vosyn

• Connect to client systems — databases, APIs, CRMs, and other SaaS tools — and get data reliably out of and into them. • Extract, clean, transform, and move data so it is structured and usable by AI features, including retrieval and agentic workflows. • Build and maintain data pipelines that feed the Applied AI Engineer’s RAG and agent systems with current, correct data. • Use AI coding tools to accelerate integration work, while applying real engineering judgment to the bespoke, messy parts those tools handle poorly. • Handle the practical realities of client data: undocumented schemas, inconsistent formats, partial records, and access constraints. • Apply data-handling, security, and privacy best practices throughout, especially with client data. • Collaborate with the Builder and the Applied AI Engineer to ensure data flows cleanly end to end, and document integrations and pipelines in Notion so they are repeatable and maintainable.

Job Requirements

  • Currently enrolled or recently graduated from a Master’s program in Computer Science, Software Engineering, Data Engineering, Information Systems, or a related field.
  • Solid back-end and data fundamentals: Python and/or Node.js, plus comfort with SQL and relational databases.
  • Demonstrated experience using AI coding tools (Cursor, Copilot, Claude Code, Windsurf, or similar) as a core part of how you build.
  • Hands-on experience moving data between systems — working with APIs, building simple pipelines, or cleaning and transforming real datasets.
  • Ability to read, evaluate, and improve AI-generated code rather than accepting it blindly, particularly for integration logic.
  • Understanding of databases, APIs, and asynchronous request handling.
  • Comfort working with incomplete, messy, or undocumented data and making sensible assumptions.
  • Awareness of data security and privacy best practices.
  • Excellent verbal and written communication skills within a cross-functional team environment.

Related Categories

Related Job Pages

More Data Engineer Jobs

Celigo logo

Senior Data Integrations Engineer - AI

Celigo

Celigo is proud to be a 2025 Gartner Customers’ Choice for iPaaS and a Visionary in the Gartner Magic Quadrant for iPaaS for the second consecutive year. We are ranked #1 iPaaS on G2 for multiple quarters and named a Leader in both B2B/EDI and API Management. Remote-first culture, built on trust, collaboration, and transparency A high-growth, inclusive work environment where innovation thrives Lightspeed learning opportunities to keep you at the leading edge of your field Exceptional coworkers who challenge and inspire you daily

Data Engineer3 days ago
Full TimeRemoteTeam 501-1,000

Role Description We are seeking a Senior Data Integrations Engineer - AI with expertise in Enterprise System Integrations, Business Process Automation, and AI technologies. In this role, you will design, build, and deploy scalable integration solutions and automations, as well as develop AI Agents using various technologies across multiple domains. The Global Technology Operations team is dedicated to digitally transforming and scaling Celigo's business, enhancing productivity, and improving operational efficiency. As an AI Integrations Engineer, you will support the business by seamlessly connecting systems and data. Additionally, you will build automations using integrator.io and AI concepts to eliminate redundant tasks, enabling teams to focus on higher-value activities. - Champion the Celigo product through transformational integrations that enable scalable business growth. - Build AI agents/integrations using Celigo’s integrator.io and frameworks like LangChain and LangGraph, and using best practices in AI SDLC. - Be an advocate for Celigo and show the art of the possible with AI for different stakeholders within Celigo and to Celigo customers/partners/builders. - Design, build, and deliver scalable data pipelines leveraging modern cloud architectures. - Collaborate with business process experts to develop solutions for complex systems and data platforms that drive revenue growth. - Ensure your integrations adhere to engineering best practices, emphasizing data quality, accuracy, operability, and security. - Build and maintain trusted relationships with technical and business teams. - Manage multiple projects concurrently, including operational tasks. - Serve as the owner and final escalation point for your integrations. - Produce clear, accurate technical documentation, develop support processes and procedures, and facilitate smooth hand-offs to peers and other organizational units. Qualifications - Proven ability to interface with diverse stakeholders across business units and levels, gather requirements, and manage follow-ups. - Demonstrated track record of delivering projects end-to-end with strong accountability. - Thinks like an owner—takes full charge of projects from inception to delivery while keeping Celigo’s best interests in mind. - Willingness to work flexible hours to collaborate with India stakeholders effectively. - Excellent written communication skills; experience with data visualization (graphs and plots) is a significant advantage. - Able to quickly learn business processes, understand requirements, and communicate designs that address both technical and business challenges. - Basic knowledge of SQL, Snowflake stored procedures, AWS S3, along with a clear understanding of integration design patterns. - Basic knowledge of Agent SDKs and Model context protocol. - Familiar with various authentication mechanisms such as OAuth 2.0, SAML2, and JWT Bearer Tokens. Requirements - A BS/BA in Information Systems, Computer Science, or equivalent practical experience is preferred. - Strong expertise in developing integrations using REST and SOAP APIs, with at least 5+ years of experience in JSON and XML schema design. (must have) - 1+ years of experience in building AI solutions using RAG, LangGraph, Web scraping, or any other AI technology. - Proficient in JavaScript/Python for data processing within integration projects. - Good to have experience in developing AI agents or bots that autonomously interact with various tools and systems to derive and act upon business insights. - Good to have experience in integrating Large Language Models (LLMs) such as OpenAI's GPT series, Google's Gemini, or Anthropic's Claude with external APIs, vector databases, and enterprise platforms to automate workflows and enhance decision-making processes. - Solid understanding of prompt engineering, context management, and the orchestration of multi-step tasks using AI-driven solutions. - Knowledge of system and software engineering architecture principles and best practices is a plus. - Experience supporting business teams and a strong grasp of processes like Quote-to-Cash, Procure-to-Pay, Lead-to-Opportunity, and Hire-to-Retire. - Hands-on experience with cloud applications like Salesforce, HubSpot, NetSuite, Zendesk, Gong, Gainsight, and similar platforms. - Experience in designing and building integration solutions, ideally within Sales or Customer Success domains. Benefits - Three weeks of vacation (starting year one) - Wellness days and holidays to recharge - Parental leave and a generous benefits package - Monthly tech stipend - Recognition and career development opportunities

United States
$145K - $160K / year
BlueCloud logo

Senior Data Engineer

BlueCloud

Global leader in Data, Analytics and AI with exceptional focus on Innovation, Customer Service and Employee engagement

Data Engineer3 days ago
Full TimeRemoteTeam 501-1,000Since 2004H1B Sponsor

• This is a senior-level role focused on the design, development, and ownership of data solutions built primarily on Snowflake Data Cloud. • You will lead the architecture and implementation of scalable data pipelines, establish robust data models, and enforce data governance and security standards across the platform. • Design, build, and own scalable data pipelines and ingestion processes using SQL, Python, dbt, and cloud-native tools. • Architect and implement ELT/ETL patterns across batch, incremental, and CDC pipelines. • Lead the development of data models using Dimensional Modeling, Data Vault, or Lakehouse approaches. • Own the design and optimization of Snowflake environments, including warehouse sizing, cost governance, storage standards, schema management, and performance tuning. • Work with cloud platforms (AWS, Azure, or GCP) to integrate with Snowflake and deliver high-performance, cost-efficient solutions. • Lead pipeline orchestration using Airflow, dbt Cloud, or similar tools. • Establish and enforce data governance frameworks, access controls, and security protocols with particular focus on Snowflake-native capabilities such as row-level security, dynamic data masking, and data sharing. • Define and maintain data quality standards, lineage documentation, and compliance requirements.

Colombia
Emergent BioSolutions logo

Enterprise Data Architect – Technical Lead

Emergent BioSolutions

Protecting against emerging global health threats.

Data Engineer3 days ago
Full TimeRemoteTeam 1,001-5,000Since 1998H1B No Sponsor

• Own the architecture, implementation, and delivery of the Enterprise Data Warehouse (EDW) • Knowledgeable on middleware & ETL technologies and ability to partner with vendors to implement capabilities • Deep understanding of MDM technologies and how to integrate into an organizational operational processes • Translate business requirements into scalable data architecture and actionable technical plans • Define and enforce data standards, lineage, and attribute mapping across systems • Design logical and dimensional data models to support enterprise reporting and analytics • Build and maintain a comprehensive data dictionary and metadata repository • Conduct in-depth data analysis using SQL and other tools to validate and audit data quality • Ensure adherence to best practices in data governance, security, and performance optimization • Work closely with functional partners and IT business analysts to understand requirements and translate them into technical solutions • Communicate technical concepts clearly to both technical and non-technical audiences • Partner with other IT teams to ensure seamless integration across systems and platforms • Work independently and proactively in a fast-paced, deadline-driven environment

Maryland
$155.5K - $188.2K / year
Ylopo logo

Data Feed Coordinator

Ylopo

Ylopo is a next-generation Complete Digital Marketing Solution designed to help find more clients & build your brand.

Data Engineer3 days ago
Full TimeRemoteTeam 11-50H1B No Sponsor

Role Description The [DATA FEED COORDINATOR] is responsible for coordinating the various components of Ylopo’s IDX data feed integrations, including compliance, billing, and reporting. This team member will correspond with various members of MLS boards and may interact with clients and/or brokers on occasion. This role will be email and phone based. This role reports to Erin Druskbasky [MLS Operations Manager]. - Manage IDX paperwork and reporting for Ylopo clients to ensure compliance with MLS boards - Maintain existing documentation and build new documentation (Google Sheets/Docs/Excel Spreadsheets) based on internal organization of MLS boards and their related IDX feeds - Review and report IDX feed technical issues to MLS boards and vendor contacts - Respond (via phone/email/text) to all client and/or MLS board requests in a timely manner (2 hour TAT) to provide a high level of customer support - Speak confidently and professionally with MLS board representatives, clients, and brokers - Complete special projects and outbound calls as needed - Serve as subject matter expert for Ylopo MLS/IDX data feed process - Learn the ins and outs of Ylopo product Qualifications - Previous experience in a support, administrative, or customer service role - Experience working with an enterprise CRM (Salesforce, Zoho, HubSpot) - Intermediate level knowledge of Excel or Google Sheets - Professional manner including a positive demeanor, trustworthy character - Consistent work habits and strong work ethic - Strong organizational skills and attention to detail - Ability to multitask, and work independently toward deadlines - Strong written and verbal communication skills, ability to work well in a small group setting - Ability to take initiative and see projects and tasks through to completion - Ability to understand and convey detailed information - Understanding of real estate and the real estate profession a plus, but not necessary Requirements - The processor should be 2.0ghz and above, Intel core 5/7 is highly required for both main and back-up hardware - RAM should be at least 16 GB with 100 GB Free disk space - A headset with the noise-canceling feature - 20 Mbps & up wired connection for the main internet service - Strictly no USB Sticks allowed for backup internet connection Benefits - A commitment to personal development - Guidance and support at a high level through interfacing with our Executive Team to prioritize goals as a company - Excellent leadership and mentoring for our entry-level to senior staff, and recognition of outstanding efforts - Team building events, team lunches/happy hours, and other company-wide events - A supportive, caring environment dedicated to continuous learning and growth

PST (UTC-8)
₱40K / month