Project details

Recommended projects

AI Evaluation Consultant (m/w/d)

We are seeking an analytical and technically-minded professional to: - Evaluate AI outputs and processes - Ensure quality, accuracy, and reliability - Identify logical errors, risks, and structural inconsistencies - Provide actionable insights and recommendations to the team Ideal candidates: - Consultants, auditors, analysts, data researchers, or business/technical analysts with strong reasoning skills - Professionals curious about AI, process improvement, and quality evaluation - Problem-solvers who enjoy analyzing complex systems, logic, and scenarios Key Responsibilities: - Lead evaluation of AI outputs and related processes - Review tasks against expected/ideal scenarios; identify gaps and risks - Provide structured, actionable recommendations to engineers, domain experts, and managers - Maintain and improve evaluation guidelines, checklists, SOPs - Suggest new approaches, tools, and processes to enhance AI evaluation
AI Labs
100% remote

Evaluation Scenario Writer (m/w/d)

We’re looking for someone who can design realistic and structured evaluation scenarios for LLM-based agents. You’ll create test cases that simulate human-performed tasks and define gold-standard behavior to compare agent actions against. You’ll work to ensure each scenario is clearly defined, well-scored, and easy to execute and reuse. You’ll need a sharp analytical mindset, attention to detail, and an interest in how AI agents make decisions. Although every project is unique, you might typically: - Designing structured test scenarios based on real-world tasks. - Defining the golden path and acceptable agent behavior. - Annotating task steps, expected outputs, and edge cases. - Working with devs to test your scenarios and improve clarity. - Reviewing agent outputs and adapting tests accordingly
100% remote

Freelance Chemistry Expert for AI Model Training (m/f/d)

An AI lab is looking for a freelance chemistry experts to evaluate AI models. The goal of the project is to assess the performance, accuracy, and reliability of AI models applied in chemistry contexts. The role involves working closely with the development team to ensure the models meet industry standards and provide actionable insights. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Evaluate AI models for chemistry applications. - Analyze model outputs and provide feedback for improvement. - Collaborate with the development team to ensure alignment with industry standards. - Document findings and recommendations for model optimization. - Conduct tests to validate model performance and reliability.
AI Lab
100% remote

Freelance Biology Expert for AI Model Training (m/f/d)

An AI lab is looking for a freelance biology experts to evaluate AI models. The goal of the project is to assess the performance, accuracy, and reliability of AI models applied in biology (all areas) contexts. The role involves working closely with the development team to ensure the models meet industry standards and provide actionable insights. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Evaluate AI models for biology applications. - Analyze model outputs and provide feedback for improvement. - Collaborate with the development team to ensure alignment with industry standards. - Document findings and recommendations for model optimization. - Conduct tests to validate model performance and reliability.
AI Lab
100% remote

Freelance Automotive Engineer (with Python) - Quality Assurance / AI Trainer

Generative AI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. Although every project is unique, you might typically: - Content Creation & Refinement: Create and refine content to ensure accuracy and relevance across a variety of topics in Physics, while also developing references and examples of tasks. - Experts Acquisition: Assess the qualification tests of experts, ensuring their competency. - Chat Moderation: Provide support by addressing project-related questions from other experts in Discord chats, especially those related to project guidelines. - Auditing Work: Review and evaluate tasks completed by other experts, ensuring they align with project guidelines. Provide constructive feedback, verify expertise-related information, and edit content as necessary to improve quality.
AI Studio
100% remote

Freelance Ruby Developer (m/f/d)

For an AI lab we are looking for Ruby Developer to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. - Code generation and code review - Prompt evaluation and complex data annotation - Training and evaluation of large language models - Benchmarking and agent-based code execution in sandboxed environments - Working across multiple programming languages (Python, JavaScript/TypeScript, Rust, SQL, etc.) - Adapting guidelines for new domains and use cases - Collaborating with project leads, solution engineers, and supply managers on complex or experimental projects
AI Lab
100% remote

Quality Compliance Auditor (GCP/GCLP/GVP) (M/W/D)

An organization is looking for an experienced Quality Compliance Auditor responsible for ensuring compliance with GCP, GCLP, and GVP standards. The project's goal is to conduct internal and external audits, prepare and support regulatory inspections, and identify compliance gaps and derive corrective actions. The role includes planning and executing audits, supporting regulatory inspections, and ensuring compliance with ICH guidelines as well as EMA/FDA regulations. - Conducting internal and external audits (GCP, GCLP, GVP) - Preparing and supporting regulatory inspections (e.g. MHRA, FDA, EMA) - Identifying compliance gaps and deriving corrective actions
Pharma
Germany
100% remote

Salesforce Service Cloud / Field Service Consultant (m/f/d)

For our client, we are looking for a Salesforce Service Cloud / Field Service Consultant (m/f/d) starting immediately. The role includes analyzing business requirements, developing solutions within the Salesforce platform and collaborating with various stakeholders to ensure seamless integration and use of the tools. - Analysis of business requirements and translating them into technical solutions within Salesforce Service Cloud and Field Service. - Implementation and configuration of Salesforce solutions. - Advising and training end users and stakeholders. - Collaborating with internal and external teams to ensure successful project delivery. - Supporting the integration of Salesforce with other systems.
100% remote

Senior Project Manager Customer Interaction

An organization is looking for support for a project to evaluate, implement, and further develop quality surveys in digital channels. The goal of the project is to increase customer satisfaction in digital channels by evaluating, implementing, and enhancing survey methods to enable consistent measurement of customer satisfaction across all channels. At the same time, areas for improvement should be identified and implemented. The role includes consulting, developing, and implementing measures to collect and improve customer satisfaction in digital channels. Main tasks: - Advising on survey methods to capture customer experience and quality in digital channels, including market standards, benchmarks, and future orientation. - Developing a future model for quality in digital channels, relevant KPIs, survey methods, and standard processes. - Implementing decided measures, including interface management and coordination with technology partners and social partners. - Testing implemented measures to collect data and ensure required standards are met. - Consolidating and listing existing and missing customer survey methods/quality KPIs across all responsible digital channels. - Advising on the creation of decision templates and implementing the necessary actions. - Identifying areas for improvement and developing a standard process for transparency and execution.
Telecommunication
Munich, Germany
100% remote

Freelance Physics Expert (with Python) - Quality Assurance / AI Trainer

Generative AI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. Although every project is unique, you might typically: - Content Creation & Refinement: Create and refine content to ensure accuracy and relevance across a variety of topics in Physics, while also developing references and examples of tasks. - Experts Acquisition: Assess the qualification tests of experts, ensuring their competency. - Chat Moderation: Provide support by addressing project-related questions from other experts in Discord chats, especially those related to project guidelines. - Auditing Work: Review and evaluate tasks completed by other experts, ensuring they align with project guidelines. Provide constructive feedback, verify expertise-related information, and edit content as necessary to improve quality.
AI Studio
100% remote

Business Analyst – SAP S/4HANA Output Management (m/f/d)

- A company is looking for an experienced Business Analyst to support the transformation from SAP ECC to S/4HANA Utilities. - The project goal is to analyze, document, and optimize output and archiving processes, as well as to create functional designs and specifications. - The analyst will work closely with product owners, IT, and business units to align feasibility, effort, and prioritization of requirements.
Energy
Munich, Germany
100% remote

Freelance Electrical Engineer with Python Experience (m/w/d)

Generative AI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. Although every project is unique, you might typically: - Content Creation & Refinement: Create and refine content to ensure accuracy and relevance across a variety of topics in Physics, while also developing references and examples of tasks. - Experts Acquisition: Assess the qualification tests of experts, ensuring their competency. - Chat Moderation: Provide support by addressing project-related questions from other experts in Discord chats, especially those related to project guidelines. - Auditing Work: Review and evaluate tasks completed by other experts, ensuring they align with project guidelines. Provide constructive feedback, verify expertise-related information, and edit content as necessary to improve quality.
AI Studio
100% remote

Senior Regulatory Compliance Expert (FDA Inspection Preparation) (m/f/d)

A company is looking for a Senior Regulatory Compliance Expert to support its team in getting ready for FDA inspections. The role includes conducting mock inspections, providing strategic advice on inspection readiness, and assisting with pre-approval and routine inspections. The ideal candidate has extensive expertise in compliance with legal requirements, especially FDA standards, and plays a key role in ensuring the company meets global compliance demands. - Conduct mock inspections according to FDA standards - Provide strategic advice on inspection readiness - Support pre-approval and routine inspections
Pharma
Munich, Germany
100% remote

AI Consultant - Machine Learning (m/w/d)

For an AI lab we are looking for Machine learning experts to train an AI model (Large Language Model - LLM). GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: - Design original computational STEM problems that simulate real scientific workflows - Create problems that require Python programming to solve - Ensure problems are computationally intensive and cannot be solved manually within reasonable timeframes (days/weeks) - Develop problems requiring non-trivial reasoning chains and creative problem-solving approaches - Verify solutions using Python with standard libraries (numpy, pandas, scipy, sklearn) - Document problem statements clearly and provide verified correct answers
AI Lab
100% remote

Cyber Risk Consulting (Senior Level)

- Identification and analysis of cyber risks arising from changes in the digital landscape and the growing capabilities of attackers. - Development and assignment of appropriate countermeasures, as well as creation of roadmaps to effectively address digital threats. - Translation of security incidents and threats into concrete, business-relevant risks with suitable countermeasures. - Continuous improvement of processes for managing the cyber risk lifecycle and increasing the maturity of the Cyber Risk Desk. - Preparation of project reports on the status, impact, and necessary actions related to identified risks. - Preparation of risk analyses and management processes that comply with applicable regulatory standards (SOX, PCI, data protection). - Conducting an initial risk assessment (likelihood, impact, risk level), including a precise description of the risks, impacts, and probability of occurrence. - Evaluation and detailed description of the residual risk after potential implementation of identified risk mitigation measures.
Telecommunications
Munich, Germany
100% remote

ERP-Transformation Manager (m/w/d)

An established company is looking for an experienced ERP Transformation Manager to take full responsibility for planning and steering a comprehensive ERP transformation program. The project's goal is harmonizing processes, implementing a new ERP system, and meeting IFRS requirements. The ERP Transformation Manager will analyze, redesign, and standardize the commercial core processes in civil and rail construction. This includes translating IFRS requirements into system structures and posting logic, closely coordinating with Finance, Controlling, Project Management, and IT departments. The role includes managing the ERP rollout, including fit-gap analysis, process design, test management, and migration. In addition, a unified reporting and KPI framework for group financial statements and project management will be established. The manager will act as the central interface between operational units, Finance, management, and the group, and will set up a sustainable change and training concept for users. - Planning and steering the ERP transformation program (IFRS transition, process harmonization, ERP rollout) - Analyzing, redesigning, and standardizing commercial core processes - Translating IFRS requirements into system structures and posting logic - Managing the ERP rollout, including fit-gap analysis, process design, test management, and migration - Building a unified reporting and KPI framework - Stakeholder management and ensuring smooth communication - Leading interdisciplinary project teams and managing external consultants and implementation partners - Establishing a sustainable change and training concept - Ensuring measurable process improvements after the ERP system goes live
Infrastrukturbau
Eisenach, Germany
70% remote

Project Manager Magazines / Magazine Production (m/f/d)

- Responsibility for coordinating and managing the entire production process of magazine publications - Planning and overseeing issue structure, schedules, advertisements, and workflows - Close collaboration with editorial team, publishing management, sales, technical, marketing, distribution, printers, and service providers - Quality assurance for layouts, copy, and print approvals - Estimation and organization of add-ons (e.g. inserts, posters, supplements) - Active role in strategic projects, conferences, and the rollout of new formats
Media Company
Munich, Germany
50% remote

AI Consultants - Data Science (m/w/d)

We are seeking experienced data scientists to create computationally intensive data science problems for an advanced AI evaluation project. This is a remote, project-based opportunity for experts who can design challenging problems that require computational methods to solve and mirror the full data science lifecycle - from data acquisition and processing to statistical analysis and actionable business insights. What You'll Do - Design original computational data science problems that simulate real-world analytical workflows across industries (telecom, finance, government, e-commerce, healthcare) - Create problems requiring Python programming to solve (using pandas, numpy, scipy, sklearn, statsmodels, matplotlib, seaborn) - Ensure problems are computationally intensive and cannot be solved manually within reasonable timeframes (days/weeks) - Develop problems requiring non-trivial reasoning chains in data processing, statistical analysis, feature engineering, predictive modeling, and insight extraction - Create deterministic problems with reproducible answers - avoid stochastic elements or require fixed random seeds for exact reproducibility - Base problems on real business challenges: customer analytics, risk assessment, fraud detection, forecasting, optimization, and operational efficiency - Design end-to-end problems spanning the complete data science pipeline (data ingestion → cleaning → EDA → modeling → validation → deployment considerations) - Incorporate big data processing scenarios requiring scalable computational approaches - Verify solutions using Python with standard data science libraries and statistical methods - Document problem statements clearly with realistic business contexts and provide verified correct answers
AI Lab
Munich, Germany
100% remote

Freelance Cybersecurity Consultant for AI Red Teaming

For an AI lab we are looking for cybersecurity consultants to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities - Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. - Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. - Develop and implement automation scripts, custom tools, environments and test harnesses. - Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. - Advise on cybersecurity best practices and policy implications.
AI Lab
100% remote

Commissioning & Qualification (C&Q) Engineer (m/f/d)

A company is looking for an experienced Commissioning & Qualification (C&Q) Engineer to qualify and commission production equipment according to GMP standards. The goal of the project is to ensure the technical and organizational requirements for the GMP-compliant qualification of the production equipment. - Independently conduct commissioning and qualification activities, especially in IOQ - Operate PCS7 systems - Work with single-use equipment - Perform commissioning and qualification activities for production equipment - Ensure all technical and organizational prerequisites for C&Q - GMP-compliant qualification of the associated production equipment
Pharma
Munich, Germany
100% remote
New

(Senior) Engineer Module Lead, Series Support for E/E Components (m/f/d)

- You take on the technical leadership of our SE team leads for low-voltage batteries, low-voltage converters, access systems, and central control units - You represent all components of the module to our product lines - You set goals for the module and derive corresponding targets for the SE teams - You are responsible for goal management in the module for features, costs, weight, quality, and deadlines - You are responsible for target management in change, problem, and quality management, as well as the module’s budget - You report to our project management and our product lines
100% remote

Project Manager Brand Guardianship (m/f/d)

The service is requested as part of the Brand Image Pool Photoshoot project. The project includes: - Managing sub-tasks throughout the entire Image Pool motif shooting project from January to June - Taking on brand guardianship tasks during the pool shooting project period - Specific service description without personal reference: - Independently defining, managing and executing the project. This ranges from project management to creating roadmaps and project presentations - Developing ideas and concepts for initiatives - Actively managing project risks - Actively handling project issues, including providing expert advice on escalations - Preparing and following up on stakeholder and steering board meetings - Defining project scope and overall project phases - Providing transparent and appropriate updates to the client on scope, quality, schedule, budget and status
Telecommunication
Munich, Germany
100% remote

EHS Specialist – Body in White (M/W/D)

A company is looking for an experienced EHS Specialist to support their Body in White (BIW) operations. Body in White refers to the stage in car manufacturing where the vehicle's sheet metal components are welded together to form the body shell, prior to painting and the installation of the engine, chassis, or interior trim. The goal of the project is to ensure compliance with environmental, health, and safety regulations during this critical manufacturing phase while optimizing processes and maintaining high safety standards. The role involves collaborating with production and engineering teams to identify risks, implement safety measures, and foster a culture of safety within the organization. Key responsibilities: - Conduct risk assessments and ensure compliance with EHS regulations specific to BIW operations. - Develop and implement safety protocols and procedures tailored to BIW processes. - Monitor and report on EHS performance metrics within the BIW stage. - Provide training and guidance to employees on EHS best practices in automotive manufacturing. - Investigate incidents and implement corrective actions to prevent recurrence. - Collaborate with cross-functional teams to improve safety standards and processes in BIW.
Automotive & Robotics
Brandenburg, Germany
80% remote

EHS Specialist – Cell Manufacturing

A company in the automotive and robotics industry is seeking an experienced EHS Specialist to support cell manufacturing processes. The goal of the project is to ensure compliance with environmental, health, and safety regulations and to promote a safety culture within the manufacturing environment. The role requires close collaboration with cross-functional teams to implement and maintain EHS standards, conduct risk assessments, and drive continuous improvement initiatives. Key responsibilities: - Developing, implementing and maintaining EHS policies and procedures tailored specifically to cell manufacturing. - Conducting regular risk assessments and audits to ensure regulatory compliance. - Training and guiding employees on EHS best practices. - Investigating incidents and implementing corrective actions to prevent recurrence. - Collaborating with internal teams to promote a safety and sustainability culture. - Monitoring and reporting on EHS performance metrics.
Automotive & Robotics
Brandenburg, Germany
80% remote

Chemist with Python Experience (m/w/d)

GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Chemistry, you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: - Generate prompts that challenge AI. - Define comprehensive scoring criteria to evaluate the accuracy of the AI’s answers. - Correct the model’s responses based on your domain-specific knowledge.
AI Lab
100% remote

IT Project Manager ServiceNow (Senior)

- A company in the energy and energy services sector is looking for an experienced IT project manager for a ServiceNow project. - The goal of the project is to lead and successfully implement an enterprise ServiceNow solution with a focus on ITSM and Customer Service Management (CSM). - The role includes planning, controlling, and ensuring a stable project flow in close collaboration with internal and external stakeholders. - Operational & strategic service management of the ServiceNow platform - Process ownership for ITSM and CSM (B2B & B2C) - Process design, governance & continuous optimization - Management of external providers and vendors - Monitoring, KPI analysis & deriving improvements - Ensuring stable platform operation
Energy
Germany
100% remote

Freelance Mechanical Engineer with Python Experience (m/w/d)

For an AI lab we are looking for Mechanical Engineer with python experience to train an AI model (Large Language Model - LLM). GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: - Content Creation & Refinement: Create and refine content to ensure accuracy and relevance across a variety of topics in Mechanical Engineering, while also developing references and examples of tasks. - Experts Acquisition: Assess the qualification tests of experts, ensuring their competency. - Chat Moderation: Provide support by addressing project-related questions from other experts in Discord chats, especially those related to project guidelines. - Auditing Work: Review and evaluate tasks completed by other experts, ensuring they align with project guidelines. Provide constructive feedback, verify expertise-related information, and edit content as necessary to improve quality.
AI Studio
100% remote

AI Consultant for Vibe Coding (m/w/d)

An AI Lab is looking for a AI Trainer for Vibe Coding. This role involves producing accurate, well-reasoned outputs across diverse domains, leveraging automation and AI tools. The position requires expertise in coding and optimizing Python scripts, handling large datasets, improving AI-generated content, and formatting and troubleshooting technical workflows. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Conduct advanced web research and data mining using multiple tools to locate and extract information from official sources. Use LLMs and advanced prompts to refine search strategies and validate data accuracy by cross-referencing authoritative sources. - Perform web scraping and data extraction by navigating complex website structures and multi-level pages (regions → companies → detailed pages). Handle dynamic content, archived pages, and various HTML formats, and organize extracted data into clean, well-formatted CSV files. - Write and optimize Python scripts for data processing and analysis using libraries such as pandas, BeautifulSoup, Selenium, and matplotlib. Transform raw data into structured formats (CSV, JSON, tables) and create visualizations when required. - Carry out data processing and quality assurance by cleaning, validating, and structuring datasets. - - Ensure data integrity across multiple sources, apply formatting specifications, and run verification steps to maintain high output quality. - Apply strong problem-solving and task execution skills to break down complex workflows, troubleshoot technical issues independently, and adapt quickly between different domains and task types with minimal supervision. - Produce clear documentation and high-quality outputs that follow exact requirements for file formats, naming conventions, and data structure. Maintain reproducible workflows and well-organized code.
AI Lab
100% remote
New

IT Project Manager ISO 27.001 - Gap Closure (m/f/d)

A company in the automotive supplier industry is looking for support in the field of cyber security. The goal of the project is to close gaps as part of the ISO 27001 certification. The IT Project Manager will play a central role in steering and monitoring the gap closure measures. - Steering and monitoring gap closure measures. - Consistently tracking tasks, deadlines, and responsibilities. - Coordinating between IT, specialist departments, information security, and, if necessary, external service providers. - Ensuring that measures are implemented in an ISO-27001-compliant, auditable, and documented way. - Transparent status reports to program management and stakeholders. - Support in audit preparation (evidence, measure status, maturity level).
Munich, Germany
20% remote

Mathematician with Python Experience (m/w/d)

For an AI lab we are looking for mathematicians with python experience to train an AI model (Large Language Model - LLM). As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. Although every project is unique, you might typically: - Design original computational mathematics problems that simulate real mathematical research workflows. - Create problems requiring Python programming to solve (using numpy, scipy, sympy). - Ensure problems are computationally intensive and cannot be solved manually within reasonable timeframes (days/weeks). - Develop problems requiring non-trivial reasoning chains in areas like number theory, combinatorics, graph theory, and numerical analysis. - Base problems on real research challenges or practical applications from mathematical practice. - Verify solutions using Python with standard mathematical libraries. - Document problem statements clearly and provide verified correct answers. Support in: - Number Theory: Prime factorization, Diophantine equations, modular arithmetic, cryptographic computations. - Combinatorics: Enumerations, partitions, generating functions, combinatorial optimization. - Graph Theory: Network analysis, path finding, graph coloring, spanning trees. - Numerical Analysis: Root finding, numerical integration, differential equations, matrix computations. - Discrete Mathematics: Recurrence relations, algorithmic complexity, discrete optimization. - Algebra: Polynomial computations, group theory calculations, matrix decompositions.
AI Lab
100% remote

Physicist with Python Experience (m/f/d)

For an AI lab we are looking for phycists with python experience to train an AI model (Large Language Model - LLM). GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as a phycist, you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: - Design original computational physic problems that simulate real research workflows. - Create problems requiring Python programming to solve (using numpy, scipy, sympy). - Ensure problems are computationally intensive and cannot be solved manually within reasonable timeframes (days/weeks). - Develop problems requiring non-trivial reasoning chains. - Base problems on real research challenges or practical applications from physical practice. - Verify solutions using Python with standard libraries. - Document problem statements clearly and provide verified correct answers.
AI Lab
100% remote

Frontend developer to HR platform with Angular experience

Reach out to us if you are interested in working with us on the project.
FRATCH
Munich
90% remote
Sign up to get access to more exciting projects that match your skills and preferences!

AI Evaluation Consultant (m/w/d)

Sign up to view the number of applicants
Industry
Information Technology (IT)
Areas
Audit
Quality Assurance (QA)

Project info

  • Period
    02.02.2026 - 01.04.2026
  • Capacity
    from 95%
  • Daily rate
    440 - 480€
  • Language
    • English
      (Advanced)
  • Remote
    from 95%

Description

We are seeking an analytical and technically-minded professional to:

  • Evaluate AI outputs and processes
  • Ensure quality, accuracy, and reliability
  • Identify logical errors, risks, and structural inconsistencies
  • Provide actionable insights and recommendations to the team

Ideal candidates:

  • Consultants, auditors, analysts, data researchers, or business/technical analysts with strong reasoning skills
  • Professionals curious about AI, process improvement, and quality evaluation
  • Problem-solvers who enjoy analyzing complex systems, logic, and scenarios

Key Responsibilities:

  • Lead evaluation of AI outputs and related processes
  • Review tasks against expected/ideal scenarios; identify gaps and risks
  • Provide structured, actionable recommendations to engineers, domain experts, and managers
  • Maintain and improve evaluation guidelines, checklists, SOPs
  • Suggest new approaches, tools, and processes to enhance AI evaluation

Requirements

  • Scenario validation, data analysis, auditing, or consulting experience
  • Analytical work in research, technical/business analysis, or risk evaluation

Knowledge & Skills:

  • Strong analytical and critical thinking
  • Attention to detail, reliability, and an ownership mindset
  • Technical understanding: JSON/YAML, basic Git/GitHub
  • Independent, proactive mindset

Nice to Have:

  • Scenario-based testing, annotation workflows, AI/LLM evaluation
  • Experience in cross-functional teams