AI Engineer (m/f/n)
Location
Poland
Posted
45 days ago
Salary
0
Seniority
Mid Level
No structured requirement data.
Job Description
AI Engineer (m/f/n)
InPost
Company Description InPost Group is an innovative European out of home deliveries company, revolutionizing the way parcels are delivered to customers. With operations across several countries, our network of intelligent lockers provides customers with a fast, convenient, and secure delivery option. InPost Group is a publicly traded company, with a market capitalization of about $5 billion as of March 2023. With over 10,000 employees worldwide, InPost Group is one of the largest out of home delivery providers in Europe, committed to providing sustainable and efficient delivery solutions to meet the evolving needs of customers in today's rapidly changing landscape. Job Description We are seeking a skilled and innovative AI Engineer expertise who has experience working with Generative AI (GenAI) models, such as Large Language Models (LLMs), and implementing these solutions into web applications. This role combines the responsibilities of a full-stack developer with advanced knowledge of machine learning models, APIs, and cloud infrastructure, focusing on integrating state-of-the-art AI capabilities into user-friendly and efficient web-based applications. Key Responsibilities: Model Integration & API Development: - Integrate LLMs and other GenAI models into web applications through efficient API design and implementation. - Build and optimize API endpoints to allow seamless, real-time interaction between front-end applications and back-end AI models. - Design and develop secure, scalable, and high-performing microservices for AI model deployment. Back-End Development & AI Pipelines: - Develop robust back-end systems in Python to support the deployment, scalability, and maintenance of GenAI models. - Build and maintain data pipelines, including preprocessing data and post-processing AI model outputs for consumption by applications. - Implement best practices for handling sensitive data and maintaining model performance. Infrastructure & Deployment: - Use Kubernetes and Docker for containerization and orchestration to ensure scalable deployment and management of AI applications. - Implement continuous integration and continuous deployment (CI/CD) pipelines for automated testing and deployment of code changes. - Maintain a scalable and secure cloud infrastructure, leveraging platforms such as Google Cloud Platform, or Azure for model training, storage, and deployment. LLM and GenAI Ecosystem Expertise: - Utilize vector databases (e.g., Pinecone, Weaviate, or Faiss) to manage and retrieve embeddings for efficient similarity search and recommendation systems. - Work with frameworks for model development and deployment, including Hugging Face Transformers, LangChain, and OpenAI. - Optimize and fine-tune LLMs to improve performance based on application needs. Qualifications - Min Bachelor's degree in Computer Science, Engineering, or a related field. - 7+ years of experience as a full-stack developer, ideally with a focus on AI model integration. - Proficiency in Python - Strong knowledge of GenAI models and LLMs, including experience with model selection, tuning, and embedding strategies. - Experience with API development and integration to facilitate communication between front-end applications and AI models. - Working knowledge of containerization technologies like Docker and container orchestration with Kubernetes. Familiarity with cloud platforms (AWS, GCP, or Azure) for AI model deployment and scalability. - Proficient in vector databases and their integration with LLM models for enhanced application functionality. - Familiarity with database management (SQL, NoSQL) and caching solutions (e.g., Redis). - Experience in CI/CD pipelines, code versioning (Git), and DevOps practices. - Excellent knowledge of English AND Polish. Preferred Qualifications: - Knowledge of streaming architectures for real-time data processing and ingestion (e.g., Apache Kafka). - Familiarity with serverless architectures (e.g., AWS Lambda) for scalable deployment of AI features. - Prior experience in ML frameworks like TensorFlow, PyTorch, or ONNX. - Understanding of data privacy and security best practices in AI applications. Soft Skills: - Strong problem-solving skills and ability to work independently and as part of a team. - Excellent communication skills to translate technical requirements into actionable development tasks. - Proactive approach to staying updated with emerging AI technologies and frameworks. Additional Information Why Join InPost? - This is a unique opportunity to work on cutting-edge AI applications that leverage the latest advancements in Generative AI and Large Language Models, impacting the future of intelligent software development. - The option to work from the office or 100% remotely - Opportunity to work in a diverse, international and cross-functional environment, along with leading experts. - Fulfilling careers with a range of benefits for employees and invests in providing training opportunities for their development. - Involvement in technology monitoring and choices - Your impact will be visible instantly and you will be making a difference in our users lives - Direction: Software Developers - InPost Tech - Organisation: InPost Group - Technology
Related Guides
Related Job Pages
More AI Engineer Jobs
Freelance Agent Evaluation Engineer - AI Projects on Mindrift
MindriftApply → Pass qualification(s) → Join a project → Complete tasks → Get paid. Project time expectations: Tasks are estimated to require around 10–20 hours per week during active phases, based on project requirements; This is an estimate, not a guaranteed workload, and applies only while the project is active. Note: Rates vary based on expertise, skills assessment, location, project needs, and other factors. Higher rates may be offered to highly specialized experts. Lower rates may apply during onboarding or non-core project phases. Payment details are shared per project.
Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What this opportunity involves We're building a dataset to evaluate AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria within realistic simulated environments: - Build virtual companies following a high-level plan - codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history - Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is solvable and the evaluation is fair - Design tasks set in isolated environments - emulations of a developer's workstation: a Linux machine with development tools (terminal, CLI), MCP servers (repository, task tracker, messenger, documentation, etc.), and a real web application codebase - Write tests that accept all correct solutions and reject incorrect ones - neither too strict (breaking on valid approaches) nor too lenient (passing bad ones) - Iterate with an AI agent on tests - verifying they catch real problems, don't miss bad solutions, and don't break on good ones - Review code written by agents, analyze why an agent failed or succeeded, and design edge cases and adversarial scenarios - Iterate based on feedback from expert QA reviewers who score your work on quality criteria What this is NOT - Not data labeling - Not prompt engineering - Not writing code from scratch - the agent writes most of the code; you guide and evaluate A significant part of the work is done together with AI - it's very hard to create tasks that challenge frontier models without using frontier models. What we look for This opportunity is a good fit for experienced developers, software engineers, and/or test automation specialists open to part-time, non-permanent projects. Ideally, contributors will have: - Degree in Computer Science, Software Engineering, or related fields - 5+ years in software development, primarily Python (FastAPI, pytest, async/await, subprocess, file operations) - Background in full-stack development, with experience building React-based interfaces (JavaScript/TypeScript) and robust back-end systems - Experience writing tests (functional, integration — not just running them) - Docker containers, and familiarity with infrastructure tools (Postgres, Kafka, Redis) - CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results) - English proficiency - B2 You don't need to be an expert in every item, but you should be comfortable reading and reasoning about code across the stack. Why this is hard - Frontier models are already good at coding. Creating a task that genuinely challenges the best models is non-trivial. You need to deeply understand where models fail and what scenarios reveal the difference between a good and a bad solution. - Tasks have many valid solutions. Writing tests that accept all correct solutions and reject incorrect ones is harder than it sounds. How it works Apply → Pass qualification(s) → Join a project → Complete tasks → Get paid Effort estimate Tasks for this project are estimated to take 20 hours to complete, depending on complexity. This is an estimate and not a schedule requirement; you choose when and how to work. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted. Compensation On this project, contributors can earn up to $50 per hour equivalent, depending on their level and pace of contribution. Compensation varies across projects depending on scope, complexity, and required expertise. Please note that other projects on the platform may offer different earning levels based on their requirements.
Freelance Agent Evaluation Engineer - AI Projects on Mindrift
MindriftApply → Pass qualification(s) → Join a project → Complete tasks → Get paid. Project time expectations: Tasks are estimated to require around 10–20 hours per week during active phases, based on project requirements; This is an estimate, not a guaranteed workload, and applies only while the project is active. Note: Rates vary based on expertise, skills assessment, location, project needs, and other factors. Higher rates may be offered to highly specialized experts. Lower rates may apply during onboarding or non-core project phases. Payment details are shared per project.
Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What this opportunity involves We're building a dataset to evaluate AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria within realistic simulated environments: - Build virtual companies following a high-level plan - codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history - Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is solvable and the evaluation is fair - Design tasks set in isolated environments - emulations of a developer's workstation: a Linux machine with development tools (terminal, CLI), MCP servers (repository, task tracker, messenger, documentation, etc.), and a real web application codebase - Write tests that accept all correct solutions and reject incorrect ones - neither too strict (breaking on valid approaches) nor too lenient (passing bad ones) - Iterate with an AI agent on tests - verifying they catch real problems, don't miss bad solutions, and don't break on good ones - Review code written by agents, analyze why an agent failed or succeeded, and design edge cases and adversarial scenarios - Iterate based on feedback from expert QA reviewers who score your work on quality criteria What this is NOT - Not data labeling - Not prompt engineering - Not writing code from scratch - the agent writes most of the code; you guide and evaluate A significant part of the work is done together with AI - it's very hard to create tasks that challenge frontier models without using frontier models. What we look for This opportunity is a good fit for experienced developers, software engineers, and/or test automation specialists open to part-time, non-permanent projects. Ideally, contributors will have: - Degree in Computer Science, Software Engineering, or related fields - 5+ years in software development, primarily Python (FastAPI, pytest, async/await, subprocess, file operations) - Background in full-stack development, with experience building React-based interfaces (JavaScript/TypeScript) and robust back-end systems - Experience writing tests (functional, integration — not just running them) - Docker containers, and familiarity with infrastructure tools (Postgres, Kafka, Redis) - CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results) - English proficiency - B2 You don't need to be an expert in every item, but you should be comfortable reading and reasoning about code across the stack. Why this is hard - Frontier models are already good at coding. Creating a task that genuinely challenges the best models is non-trivial. You need to deeply understand where models fail and what scenarios reveal the difference between a good and a bad solution. - Tasks have many valid solutions. Writing tests that accept all correct solutions and reject incorrect ones is harder than it sounds. How it works Apply → Pass qualification(s) → Join a project → Complete tasks → Get paid Effort estimate Tasks for this project are estimated to take 20 hours to complete, depending on complexity. This is an estimate and not a schedule requirement; you choose when and how to work. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted. Compensation On this project, contributors can earn up to $50 per hour equivalent, depending on their level and pace of contribution. Compensation varies across projects depending on scope, complexity, and required expertise. Please note that other projects on the platform may offer different earning levels based on their requirements.
Freelance Agent Evaluation Engineer - AI Projects on Mindrift
MindriftApply → Pass qualification(s) → Join a project → Complete tasks → Get paid. Project time expectations: Tasks are estimated to require around 10–20 hours per week during active phases, based on project requirements; This is an estimate, not a guaranteed workload, and applies only while the project is active. Note: Rates vary based on expertise, skills assessment, location, project needs, and other factors. Higher rates may be offered to highly specialized experts. Lower rates may apply during onboarding or non-core project phases. Payment details are shared per project.
Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What this opportunity involves We're building a dataset to evaluate AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria within realistic simulated environments: - Build virtual companies following a high-level plan - codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history - Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is solvable and the evaluation is fair - Design tasks set in isolated environments - emulations of a developer's workstation: a Linux machine with development tools (terminal, CLI), MCP servers (repository, task tracker, messenger, documentation, etc.), and a real web application codebase - Write tests that accept all correct solutions and reject incorrect ones - neither too strict (breaking on valid approaches) nor too lenient (passing bad ones) - Iterate with an AI agent on tests - verifying they catch real problems, don't miss bad solutions, and don't break on good ones - Review code written by agents, analyze why an agent failed or succeeded, and design edge cases and adversarial scenarios - Iterate based on feedback from expert QA reviewers who score your work on quality criteria What this is NOT - Not data labeling - Not prompt engineering - Not writing code from scratch - the agent writes most of the code; you guide and evaluate A significant part of the work is done together with AI - it's very hard to create tasks that challenge frontier models without using frontier models. What we look for This opportunity is a good fit for experienced developers, software engineers, and/or test automation specialists open to part-time, non-permanent projects. Ideally, contributors will have: - Degree in Computer Science, Software Engineering, or related fields - 5+ years in software development, primarily Python (FastAPI, pytest, async/await, subprocess, file operations) - Background in full-stack development, with experience building React-based interfaces (JavaScript/TypeScript) and robust back-end systems - Experience writing tests (functional, integration — not just running them) - Docker containers, and familiarity with infrastructure tools (Postgres, Kafka, Redis) - CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results) - English proficiency - B2 You don't need to be an expert in every item, but you should be comfortable reading and reasoning about code across the stack. Why this is hard - Frontier models are already good at coding. Creating a task that genuinely challenges the best models is non-trivial. You need to deeply understand where models fail and what scenarios reveal the difference between a good and a bad solution. - Tasks have many valid solutions. Writing tests that accept all correct solutions and reject incorrect ones is harder than it sounds. How it works Apply → Pass qualification(s) → Join a project → Complete tasks → Get paid Effort estimate Tasks for this project are estimated to take 20 hours to complete, depending on complexity. This is an estimate and not a schedule requirement; you choose when and how to work. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted. Compensation On this project, contributors can earn up to $50 per hour equivalent, depending on their level and pace of contribution. Compensation varies across projects depending on scope, complexity, and required expertise. Please note that other projects on the platform may offer different earning levels based on their requirements.
Freelance Agent Evaluation Engineer - AI Projects on Mindrift
MindriftApply → Pass qualification(s) → Join a project → Complete tasks → Get paid. Project time expectations: Tasks are estimated to require around 10–20 hours per week during active phases, based on project requirements; This is an estimate, not a guaranteed workload, and applies only while the project is active. Note: Rates vary based on expertise, skills assessment, location, project needs, and other factors. Higher rates may be offered to highly specialized experts. Lower rates may apply during onboarding or non-core project phases. Payment details are shared per project.
Please submit your CV in English and indicate your level of English proficiency. Mindrift connects specialists with project-based AI opportunities for leading tech companies, focused on testing, evaluating, and improving AI systems. Participation is project-based, not permanent employment. What this opportunity involves We're building a dataset to evaluate AI coding agents — how well a model handles real-world developer tasks. You'll create challenging tasks and evaluation criteria within realistic simulated environments: - Build virtual companies following a high-level plan - codebase, infrastructure, and context (conversations, documentation, tickets) that form a realistic environment with development history - Assemble and calibrate tasks from intermediate states of the virtual company: craft the prompt, define evaluation criteria, and ensure the task is solvable and the evaluation is fair - Design tasks set in isolated environments - emulations of a developer's workstation: a Linux machine with development tools (terminal, CLI), MCP servers (repository, task tracker, messenger, documentation, etc.), and a real web application codebase - Write tests that accept all correct solutions and reject incorrect ones - neither too strict (breaking on valid approaches) nor too lenient (passing bad ones) - Iterate with an AI agent on tests - verifying they catch real problems, don't miss bad solutions, and don't break on good ones - Review code written by agents, analyze why an agent failed or succeeded, and design edge cases and adversarial scenarios - Iterate based on feedback from expert QA reviewers who score your work on quality criteria What this is NOT - Not data labeling - Not prompt engineering - Not writing code from scratch - the agent writes most of the code; you guide and evaluate A significant part of the work is done together with AI - it's very hard to create tasks that challenge frontier models without using frontier models. What we look for This opportunity is a good fit for experienced developers, software engineers, and/or test automation specialists open to part-time, non-permanent projects. Ideally, contributors will have: - Degree in Computer Science, Software Engineering, or related fields - 5+ years in software development, primarily Python (FastAPI, pytest, async/await, subprocess, file operations) - Background in full-stack development, with experience building React-based interfaces (JavaScript/TypeScript) and robust back-end systems - Experience writing tests (functional, integration — not just running them) - Docker containers, and familiarity with infrastructure tools (Postgres, Kafka, Redis) - CI/CD understanding (GitHub Actions as a user: triggers, labels, reading results) - English proficiency - B2 You don't need to be an expert in every item, but you should be comfortable reading and reasoning about code across the stack. Why this is hard - Frontier models are already good at coding. Creating a task that genuinely challenges the best models is non-trivial. You need to deeply understand where models fail and what scenarios reveal the difference between a good and a bad solution. - Tasks have many valid solutions. Writing tests that accept all correct solutions and reject incorrect ones is harder than it sounds. How it works Apply → Pass qualification(s) → Join a project → Complete tasks → Get paid Effort estimate Tasks for this project are estimated to take 20 hours to complete, depending on complexity. This is an estimate and not a schedule requirement; you choose when and how to work. Tasks must be submitted by the deadline and meet the listed acceptance criteria to be accepted. Compensation On this project, contributors can earn up to $50 per hour equivalent, depending on their level and pace of contribution. Compensation varies across projects depending on scope, complexity, and required expertise. Please note that other projects on the platform may offer different earning levels based on their requirements.