Project details

Recommended projects

AI Agent Evaluation Analyst (m/f/d)

We are looking for an Freelance Agent Evaluation Analyst to take ownership of quality, structure, and insight across the project. This role goes far beyond task-checking - it’s about critical thinking, systems-level analysis, and ensuring clarity, reliability, and consistency at scale. You’ll work as both a hands-on evaluator and an analyst, collaborating with domain experts, delivery managers, and engineers. Beyond reviewing outputs, you’ll be expected to understand the “why” behind the work, identify logical gaps or inconsistencies, and propose meaningful improvements. This is a flexible, impact-driven role where you’ll have space to grow, contribute ideas, and help shape how evaluation and quality are scaled across the project. This role is especially well-suited for: Analysts, researchers, or consultants with strong structuring and reasoning skills Junior product managers or strategists curious about AI and evaluation work Smart problem-solvers (students or early-career professionals) who enjoy digging into logic, systems, and edge cases You do not need a coding background. What matters most is curiosity, intellectual rigor, and the ability to evaluate complex setups with precision. What you’ll be doing - Fully own the QA pipeline for agent evaluation tasks; - Review and validate tasks and golden paths created by scenario writers and experts; - Spot logical inconsistencies, vague requirements, hidden risks, and unrealistic assumptions; - Provide structured feedback and ensure quality alignment across contributors; Train, onboard, and mentor new QA team members; - Collaborate with domain experts, delivery managers, and engineers to improve test clarity and coverage; - Maintain and improve QA checklists, SOPs, and review guidelines; - Contribute to test planning, prioritization, and quality benchmarks; - Take initiative to suggest new approaches, tools, and processes that help scale validation and analysis.
AI Studio
Amsterdam, Netherlands
100% remote

Freelance AI Consultant (German) (m/w/d)

For our client we are looking for a German speaking AI consultant: As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. Responsibilities: - Carefully review provided data (text, images, or videos). - Review tasks submitted by the developer team and ensure quality assurance/quality control. - Label or classify content based on project guidelines. - Identify and flag factually incorrect, sensitive, inappropriate, or unclear material.
AI Studio
100% remote

Freelance AI Consultant (Japanese) (m/w/d)

We’re looking for a Japanese-speaking AI consultant for our client: As a consultant, you may be asked to join online projects to train models in your area of expertise. This flexible role is open to experts seeking part-time work (minimum a few hours per week) as well as those interested in full-time opportunities. Responsibilities: - Carefully review the provided data (text, images, or videos). - Evaluate tasks submitted by the development team and handle quality assurance/quality control. - Label or categorize content following project guidelines. - Identify and flag factually incorrect, sensitive, inappropriate, or unclear material.
AI Studio
100% remote

Freelance Rust Developer (m/w/d)

For an AI lab, we are looking for a Rust Developer to train an AI model (Large Language Model - LLM). You will help the AI make sense of the world. As a consultant, you may be invited to join online projects to train the model in your area of expertise. This flexible role works for both experts seeking part-time work (minimum a few hours/week) and those interested in full-time roles. - Code generation and code review - Prompt evaluation and complex data annotation - Training and evaluating large language models - Benchmarking and agent-based code execution in sandboxed environments - Working across multiple programming languages (Python, JavaScript/TypeScript, Rust, SQL, etc.) - Adapting guidelines for new domains and use cases - Following project-specific rubrics and requirements - Collaborating with project leads, solution engineers, and supply managers on complex or experimental projects
AI Lab
100% remote

Freelance Ruby Developer (m/f/d)

For an AI lab we are looking for Ruby Developer to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. - Code generation and code review - Prompt evaluation and complex data annotation - Training and evaluation of large language models - Benchmarking and agent-based code execution in sandboxed environments - Working across multiple programming languages (Python, JavaScript/TypeScript, Rust, SQL, etc.) - Adapting guidelines for new domains and use cases - Collaborating with project leads, solution engineers, and supply managers on complex or experimental projects
AI Lab
100% remote

Developer for Consent Management Implementation (m/f/d)

To replace the consent layers currently displayed on the web by third-party CMPs for our international brands, these layers need to be newly implemented so they can be maintained and delivered in-house. This requires solid knowledge of TypeScript, Vue.js and classic web display techniques (HTML and CSS). The goal is to deliver executable code that implements all requirements and includes automated tests that prove correct functionality. What exactly is the scope of work: The main focus is on developing elements to decide the approach and on implementing measures along the project course defined by this. This specifically includes the following service packages: - Implementation of code - Implementation of executable tests that must pass for delivery, test coverage >= 80% - Creation of documentation for the code - Creation of brand-specific cmp-config files. - Creation of a project (including treasury requirements) as a copy of the consent management platform. - Removal of netID references. - Creation of brand-specific settings and files for custom purposes/vendors. - Addition of new brand-specific CSS themes (variable values, logos, etc.). - Inclusion of the required official IAB GVL translations (ES, FR) in the weekly sync with the GVL - Implementation of I18n and preparation of brand-specific data sources - Implementation of PMC2.0 backend usage modules - Implementation of the playout logic - Implementation of the layer initialization process (mode=default and mode=resurface) - CDN upload and release process - Project documentation Project implementation: - The desired result should be written in TypeScript and Vue.js, build with Vite, tests with Vitest.
Telecommunications
Karlsruhe, Germany
100% remote

Freelance AI Consultant (Korean) (m/f/d)

For our client we are looking for a Korean speaking AI consultant: As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. Responsibilities: - Carefully review provided data (text, images, or videos). - Review tasks submitted by the developer team and ensure quality assurance/quality control. - Label or classify content based on project guidelines. - Identify and flag factually incorrect, sensitive, inappropriate, or unclear material.
AI Studio
100% remote

AI Evaluation Consultant (all genders)

We are seeking an analytical and technically-minded professional to: - Evaluate AI outputs and processes - Ensure quality, accuracy, and reliability - Identify logical errors, risks, and structural inconsistencies - Provide actionable insights and recommendations to the team Ideal candidates: - Consultants, auditors, analysts, data researchers, or business/technical analysts with strong reasoning skills - Professionals curious about AI, process improvement, and quality evaluation - Problem-solvers who enjoy analyzing complex systems, logic, and scenarios Key Responsibilities: - Lead evaluation of AI outputs and related processes - Review tasks against expected/ideal scenarios; identify gaps and risks - Provide structured, actionable recommendations to engineers, domain experts, and managers - Maintain and improve evaluation guidelines, checklists, SOPs - Suggest new approaches, tools, and processes to enhance AI evaluation
AI Labs
100% remote

Freelance Java Developer (m/w/d)

For an AI lab we are looking for Java Developer to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. - Code generation and code review - Prompt evaluation and complex data annotation - Training and evaluation of large language models - Benchmarking and agent-based code execution in sandboxed environments - Working across multiple programming languages - Adapting guidelines for new domains and use cases - Following project-specific rubrics and requirements - Collaborating with project leads, solution engineers, and supply managers on complex or experimental projects
AI Lab
100% remote

Freelance Consultant - AI Training (Portugese-Speaking)

For an AI lab we are looking for a Portugese speaking freelance consultants to train an AI model (Large Language Model - LLM) in various domains: You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities Responsibilities: - Carefully review analyze provided data by AI in your domain of expertise. - Improve the model in your domain of expertise. - Review AI results and ensure quality assurance/quality control. - Label or classify content based on project guidelines.
AI Lab
100% remote

Freelance Automotive Engineer (with Python) - Quality Assurance / AI Trainer

Generative AI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. Although every project is unique, you might typically: - Content Creation & Refinement: Create and refine content to ensure accuracy and relevance across a variety of topics in Physics, while also developing references and examples of tasks. - Experts Acquisition: Assess the qualification tests of experts, ensuring their competency. - Chat Moderation: Provide support by addressing project-related questions from other experts in Discord chats, especially those related to project guidelines. - Auditing Work: Review and evaluate tasks completed by other experts, ensuring they align with project guidelines. Provide constructive feedback, verify expertise-related information, and edit content as necessary to improve quality.
AI Studio
100% remote
New

AI Trainer for Vibe Coding (m/w/d)

An AI Lab is looking for a AI Trainer for Vibe Coding. This role involves producing accurate, well-reasoned outputs across diverse domains, leveraging automation and AI tools. The position requires expertise in coding and optimizing Python scripts, handling large datasets, improving AI-generated content, and formatting and troubleshooting technical workflows. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Conduct advanced web research and data mining using multiple tools to locate and extract information from official sources. Use LLMs and advanced prompts to refine search strategies and validate data accuracy by cross-referencing authoritative sources. - Perform web scraping and data extraction by navigating complex website structures and multi-level pages (regions → companies → detailed pages). Handle dynamic content, archived pages, and various HTML formats, and organize extracted data into clean, well-formatted CSV files. - Write and optimize Python scripts for data processing and analysis using libraries such as pandas, BeautifulSoup, Selenium, and matplotlib. Transform raw data into structured formats (CSV, JSON, tables) and create visualizations when required. - Carry out data processing and quality assurance by cleaning, validating, and structuring datasets. - - Ensure data integrity across multiple sources, apply formatting specifications, and run verification steps to maintain high output quality. - Apply strong problem-solving and task execution skills to break down complex workflows, troubleshoot technical issues independently, and adapt quickly between different domains and task types with minimal supervision. - Produce clear documentation and high-quality outputs that follow exact requirements for file formats, naming conventions, and data structure. Maintain reproducible workflows and well-organized code.
AI Lab
100% remote

Fullstack Engineer (m/f/d)

- Product and web development in the data-driven domain - Co-design of the software architecture for new data products - Collaboration in interdisciplinary teams (e.g. with data scientists and business developers)
Media company
Munich, Germany
100% remote
New

AI Trainer - Machine Learning (m/w/d)

For an AI lab we are looking for Machine learning experts to train an AI model (Large Language Model - LLM). GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: - Design original computational STEM problems that simulate real scientific workflows - Create problems that require Python programming to solve - Ensure problems are computationally intensive and cannot be solved manually within reasonable timeframes (days/weeks) - Develop problems requiring non-trivial reasoning chains and creative problem-solving approaches - Verify solutions using Python with standard libraries (numpy, pandas, scipy, sklearn) - Document problem statements clearly and provide verified correct answers
AI Lab
100% remote

Freelance Cybersecurity Consultant for AI Red Teaming

For an AI lab we are looking for cybersecurity consultants to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities - Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. - Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. - Develop and implement automation scripts, custom tools, environments and test harnesses. - Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. - Advise on cybersecurity best practices and policy implications.
AI Lab
100% remote

Mathematician with Python Experience (m/w/d)

For an AI lab we are looking for mathematicians with python experience to train an AI model (Large Language Model - LLM). As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. Although every project is unique, you might typically: - Design original computational mathematics problems that simulate real mathematical research workflows. - Create problems requiring Python programming to solve (using numpy, scipy, sympy). - Ensure problems are computationally intensive and cannot be solved manually within reasonable timeframes (days/weeks). - Develop problems requiring non-trivial reasoning chains in areas like number theory, combinatorics, graph theory, and numerical analysis. - Base problems on real research challenges or practical applications from mathematical practice. - Verify solutions using Python with standard mathematical libraries. - Document problem statements clearly and provide verified correct answers. Support in: - Number Theory: Prime factorization, Diophantine equations, modular arithmetic, cryptographic computations. - Combinatorics: Enumerations, partitions, generating functions, combinatorial optimization. - Graph Theory: Network analysis, path finding, graph coloring, spanning trees. - Numerical Analysis: Root finding, numerical integration, differential equations, matrix computations. - Discrete Mathematics: Recurrence relations, algorithmic complexity, discrete optimization. - Algebra: Polynomial computations, group theory calculations, matrix decompositions.
AI Lab
100% remote

Physicist with Python Experience (m/w/d)

For an AI lab we are looking for phycists with python experience to train an AI model (Large Language Model - LLM). GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as a phycist, you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: - Design original computational physic problems that simulate real research workflows. - Create problems requiring Python programming to solve (using numpy, scipy, sympy). - Ensure problems are computationally intensive and cannot be solved manually within reasonable timeframes (days/weeks). - Develop problems requiring non-trivial reasoning chains. - Base problems on real research challenges or practical applications from physical practice. - Verify solutions using Python with standard libraries. - Document problem statements clearly and provide verified correct answers.
AI Lab
100% remote

ERP-Transformation Manager (m/w/d)

An established company is looking for an experienced ERP Transformation Manager to take full responsibility for planning and steering a comprehensive ERP transformation program. The project's goal is harmonizing processes, implementing a new ERP system, and meeting IFRS requirements. The ERP Transformation Manager will analyze, redesign, and standardize the commercial core processes in civil and rail construction. This includes translating IFRS requirements into system structures and posting logic, closely coordinating with Finance, Controlling, Project Management, and IT departments. The role includes managing the ERP rollout, including fit-gap analysis, process design, test management, and migration. In addition, a unified reporting and KPI framework for group financial statements and project management will be established. The manager will act as the central interface between operational units, Finance, management, and the group, and will set up a sustainable change and training concept for users. - Planning and steering the ERP transformation program (IFRS transition, process harmonization, ERP rollout) - Analyzing, redesigning, and standardizing commercial core processes - Translating IFRS requirements into system structures and posting logic - Managing the ERP rollout, including fit-gap analysis, process design, test management, and migration - Building a unified reporting and KPI framework - Stakeholder management and ensuring smooth communication - Leading interdisciplinary project teams and managing external consultants and implementation partners - Establishing a sustainable change and training concept - Ensuring measurable process improvements after the ERP system goes live
Infrastrukturbau
Eisenach, Germany
70% remote

Freelance Data Annotator (Spanish) (m/f/d)

For an AI studio we are looking for a Spanish speaking data annotation specialist: Annotation is what helps AI make sense of the world. As a QA Annotator, you may be invited to take part in online projects such as rating AI-generated content, evaluating factual accuracy, or comparing responses — when projects are available. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities Responsibilities: - Carefully review provided data (text, images, or videos). - Review tasks submitted by the Annotators team and ensure quality assurance/quality control. - Label or classify content based on project guidelines. - Identify and flag factually incorrect, sensitive, inappropriate, or unclear material.
AI Lab
100% remote

Freelance AI Consultant (Chinese) (m/f/d)

For our client we are looking for a Chinese-speaking AI consultant: As a consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum a few hours/week) and those interested in full-time opportunities. Responsibilities: - Carefully review provided data (text, images, or videos). - Review tasks submitted by the developer team and ensure quality assurance/quality control. - Label or classify content based on project guidelines. - Identify and flag factually incorrect, sensitive, inappropriate, or unclear material.
AI Studio
100% remote

Frontend developer to HR platform with Angular experience

Reach out to us if you are interested in working with us on the project.
FRATCH
Munich
90% remote
Sign up to get access to more exciting projects that match your skills and preferences!

Time's up! We are no longer accepting applications.

QA Specialist – Digital Implementation

Industry
Manufacturing
Area
Quality Assurance (QA)

Project info

  • Period
    07.07.2025 - 21.10.2025
  • Capacity
    from 90%
  • Daily rate
    750 - 850€
  • Location
    Germany
  • Language
    • English
      (Advanced)
  • Remote
    from 95%

Description

Within a digitization initiative, software quality is to be ensured and improved through modern testing, analysis and review methods.

The goal is to increase development quality and efficiency through strategic measures and coaching.

  • Definition, monitoring and improvement of test coverage strategies for critical software components.
  • Organization and execution of code reviews in compliance with coding standards, architectural and security requirements.
  • Setup, execution and evaluation of SonarQube analyses, as well as coordination of technical debt and security fix remediation.
  • Training and support of developers in Clean Code, TDD and CI/CD practices.
  • Assessment and continuous improvement of QA processes, tools and metrics.

Requirements

  • Degree in (business) computer science or a related field.
  • Solid experience in software development and quality assurance.
  • Familiarity with GitLab CI/CD and static code analysis (ideally SonarQube).
  • Knowledge of Clean Code, TDD, CI/CD, review concepts and testing strategies.
  • Experience with tools such as GitLab, SonarQube, optionally Jira/Confluence.
  • English at working level.
  • Experience coaching or sparring with development teams.
  • Certifications or relevant qualifications in QA.
  • German skills for internal communication.
  • Strong communication skills; team player.
  • Proactive and structured way of working.