Project details

Recommended projects

AI Agent Evaluation Analyst

For an AI lab we are looking for AI Agent Evaluation Analyst to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities - Reviewing evaluation tasks and scenarios for logic, completeness, and realism. - Identifying inconsistencies, missing assumptions, or unclear decision points. - Helping define clear expected behaviors (gold standards) for AI agents. - Annotating cause-effect relationships, reasoning paths, and plausible alternatives. - Thinking through complex systems and policies as a human would to ensure agents are tested properly. - Working closely with QA, writers, or developers to suggest refinements or edge case coverage.
AI Lab
100% remote

Freelance Mathematics Expert for AI Model Training (m/f/d)

An AI lab is looking for a freelance mathematics experts to evaluate AI models. The goal of the project is to assess the performance, accuracy, and reliability of AI models applied in mathematics contexts. The role involves working closely with the development team to ensure the models meet industry standards and provide actionable insights. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Evaluate AI models for mathematics applications. - Analyze model outputs and provide feedback for improvement. - Collaborate with the development team to ensure alignment with industry standards. - Document findings and recommendations for model optimization. - Conduct tests to validate model performance and reliability.
AI Lab
100% remote

Freelance Chemistry Expert for AI Model Training (m/f/d)

An AI lab is looking for a freelance chemistry expert to evaluate AI models. The goal of the project is to assess the performance, accuracy, and reliability of AI models applied in chemistry contexts. The role involves working closely with the development team to ensure the models meet industry standards and provide actionable insights. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Evaluate AI models for chemistry applications. - Analyze model outputs and provide feedback for improvement. - Collaborate with the development team to ensure alignment with industry standards. - Document findings and recommendations for model optimization. - Conduct tests to validate model performance and reliability.
AI Lab
100% remote

Freelance Physics Expert for AI Model Training (m/f/d)

An AI lab is looking for a freelance physics experts to evaluate AI models. The goal of the project is to assess the performance, accuracy, and reliability of AI models applied in physics contexts. The role involves working closely with the development team to ensure the models meet industry standards and provide actionable insights. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Evaluate AI models for physics applications. - Analyze model outputs and provide feedback for improvement. - Collaborate with the development team to ensure alignment with industry standards. - Document findings and recommendations for model optimization. - Conduct tests to validate model performance and reliability.
AI Lab
100% remote
New

AI Evaluation Consultant (all genders)

We are seeking an analytical and technically-minded professional to: - Evaluate AI outputs and processes - Ensure quality, accuracy, and reliability - Identify logical errors, risks, and structural inconsistencies - Provide actionable insights and recommendations to the team Ideal candidates: - Consultants, auditors, analysts, data researchers, or business/technical analysts with strong reasoning skills - Professionals curious about AI, process improvement, and quality evaluation - Problem-solvers who enjoy analyzing complex systems, logic, and scenarios Key Responsibilities: - Lead evaluation of AI outputs and related processes - Review tasks against expected/ideal scenarios; identify gaps and risks - Provide structured, actionable recommendations to engineers, domain experts, and managers - Maintain and improve evaluation guidelines, checklists, SOPs - Suggest new approaches, tools, and processes to enhance AI evaluation
AI Labs
100% remote

Freelance Biology Expert for AI Model Training (m/f/d)

An AI lab is looking for a freelance biology experts to evaluate AI models. The goal of the project is to assess the performance, accuracy, and reliability of AI models applied in biology (all areas) contexts. The role involves working closely with the development team to ensure the models meet industry standards and provide actionable insights. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Evaluate AI models for biology applications. - Analyze model outputs and provide feedback for improvement. - Collaborate with the development team to ensure alignment with industry standards. - Document findings and recommendations for model optimization. - Conduct tests to validate model performance and reliability.
AI Lab
100% remote

Freelance Automotive Engineer (with Python) - Quality Assurance / AI Trainer

Generative AI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. Although every project is unique, you might typically: - Content Creation & Refinement: Create and refine content to ensure accuracy and relevance across a variety of topics in Physics, while also developing references and examples of tasks. - Experts Acquisition: Assess the qualification tests of experts, ensuring their competency. - Chat Moderation: Provide support by addressing project-related questions from other experts in Discord chats, especially those related to project guidelines. - Auditing Work: Review and evaluate tasks completed by other experts, ensuring they align with project guidelines. Provide constructive feedback, verify expertise-related information, and edit content as necessary to improve quality.
AI Studio
100% remote
New

Cyber Risk Consulting (Senior Level)

- Identification and analysis of cyber risks arising from changes in the digital landscape and the growing capabilities of attackers. - Development and allocation of appropriate countermeasures and the creation of roadmaps for effectively addressing digital threats. - Translating security incidents and threats into concrete, business-relevant risks with appropriate countermeasures. - Continuously improving processes for managing the cyber risk lifecycle and increasing the maturity of the Cyber Risk Desk. - Preparing project reports on the status, impacts, and required actions related to identified risks. - Conducting risk analyses and management processes that comply with applicable regulatory standards (SOX, PCI, data protection). - Performing an initial risk assessment (likelihood, impact, risk level) including a precise description of risks, impacts, and the probability of occurrence. - Evaluating and detailing the residual risk remaining after potential implementation of the identified risk mitigation measures.
Telecommunications
Munich, Germany
100% remote

Freelance Cybersecurity Consultant for AI Red Teaming

For an AI lab we are looking for cybersecurity consultants to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities - Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. - Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. - Develop and implement automation scripts, custom tools, environments and test harnesses. - Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. - Advise on cybersecurity best practices and policy implications.
AI Lab
100% remote

AI Consultants - Data Science (m/w/d)

We are seeking experienced data scientists to create computationally intensive data science problems for an advanced AI evaluation project. This is a remote, project-based opportunity for experts who can design challenging problems that require computational methods to solve and mirror the full data science lifecycle - from data acquisition and processing to statistical analysis and actionable business insights. What You'll Do - Design original computational data science problems that simulate real-world analytical workflows across industries (telecom, finance, government, e-commerce, healthcare) Create problems requiring Python programming to solve (using pandas, numpy, scipy, sklearn, statsmodels, matplotlib, seaborn) - Ensure problems are computationally intensive and cannot be solved manually within reasonable timeframes (days/weeks) - Develop problems requiring non-trivial reasoning chains in data processing, statistical analysis, feature engineering, predictive modeling, and insight extraction - Create deterministic problems with reproducible answers - avoid stochastic elements or require fixed random seeds for exact reproducibility - Base problems on real business challenges: customer analytics, risk assessment, fraud detection, forecasting, optimization, and operational efficiency - Design end-to-end problems spanning the complete data science pipeline (data ingestion → cleaning → EDA → modeling → validation → deployment considerations) - Incorporate big data processing scenarios requiring scalable computational approaches - Verify solutions using Python with standard data science libraries and statistical methods - Document problem statements clearly with realistic business contexts and provide verified correct answers
AI Lab
Munich, Germany
100% remote

Freelance Civil Engineer with Python Experience (m/f/d)

A company is looking for a freelance Civil engineering experts to evaluate AI models. The goal of the project is to assess the performance, accuracy, and reliability of AI models applied in civil engineering contexts. The role involves working closely with the development team to ensure the models meet industry standards and provide actionable insights. Key responsibilities: - Evaluate AI models for civil engineering applications. - Analyze model outputs and provide feedback for improvement. - Collaborate with the development team to ensure alignment with industry standards. - Document findings and recommendations for model optimization. - Conduct tests to validate model performance and reliability.
AI Lab
100% remote

Ethical AI Expert (m/f/d)

A company is looking for an Ethical AI expert to deliver an online workshop in German for works council employees. The goal of the workshop is to inform participants about the ethical aspects of AI, including topics such as bias, transparency and the EU AI Act. The expert will take on the role of designing and presenting the workshop content to provide participants with a solid understanding of the ethical challenges and regulatory requirements in the field of AI. - Design and deliver an online workshop in German - Teach about ethical aspects of AI (bias, transparency, EU AI Act) - Adapt the content to the needs of works council employees - Answer questions and discuss practical examples
IT
Germany
100% remote

Freelance Kotlin Developer (m/w/d)

For an AI lab we are looking for Kotlin Developer to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. - Code generation and code review - Prompt evaluation and complex data annotation - Training and evaluation of large language models - Benchmarking and agent-based code execution in sandboxed environments - Working across multiple programming languages - Adapting guidelines for new domains and use cases - Following project-specific rubrics and requirements - Collaborating with project leads, solution engineers, and supply managers on complex or experimental projects
AI Lab
100% remote

CRM Manager (f/m/d)

To strengthen data-driven cross & upsell and retention campaigns, Customer Interaction runs a platform where campaign processes, including a profiler, are developed, orchestrated, and monitored. We are looking for support in the following areas: Analysis & Consulting - Functional and technical analysis of existing campaign processes - Advice on data flows, selections, and profiling strategies PL/SQL Development - Implementation and optimization of data selection, transformation, and aggregation streams in Oracle PL/SQL - Mapping defined business logic for customer segmentation Testing & Quality Assurance - Planning and executing unit, integration, and regression tests - Documentation of test cases and results Operations & Monitoring - Monitoring running jobs and workflows (performance, error management) - Tuning SQL queries and batch processes Communication Outputs - Connecting and supplying channels such as email, SMS, outbound call, and mail - Implementing data-driven personalization and targeting
Telecommunications
Munich, Germany
100% remote

AI Trainer for Vibe Coding (m/w/d)

An AI Lab is looking for a AI Trainer for Vibe Coding. This role involves producing accurate, well-reasoned outputs across diverse domains, leveraging automation and AI tools. The position requires expertise in coding and optimizing Python scripts, handling large datasets, improving AI-generated content, and formatting and troubleshooting technical workflows. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Develop and optimize Python scripts for automation and AI tasks. - Handle and analyze large datasets efficiently. - Improve and refine AI-generated content for accuracy and quality. - Format and troubleshoot technical workflows to ensure smooth operations. - Collaborate with cross-functional teams to enhance AI tools and processes.
AI Lab
100% remote

Freelance Rust Developer (m/w/d)

For an AI lab, we are looking for a Rust Developer to train an AI model (Large Language Model - LLM). You will help the AI make sense of the world. As a consultant, you may be invited to join online projects to train the model in your area of expertise. This flexible role works for both experts seeking part-time work (minimum a few hours/week) and those interested in full-time roles. - Code generation and code review - Prompt evaluation and complex data annotation - Training and evaluating large language models - Benchmarking and agent-based code execution in sandboxed environments - Working across multiple programming languages (Python, JavaScript/TypeScript, Rust, SQL, etc.) - Adapting guidelines for new domains and use cases - Following project-specific rubrics and requirements - Collaborating with project leads, solution engineers, and supply managers on complex or experimental projects
AI Lab
100% remote

AI Agent Evaluation Analyst (m/f/d)

We are looking for an Freelance Agent Evaluation Analyst to take ownership of quality, structure, and insight across the project. This role goes far beyond task-checking - it’s about critical thinking, systems-level analysis, and ensuring clarity, reliability, and consistency at scale. You’ll work as both a hands-on evaluator and an analyst, collaborating with domain experts, delivery managers, and engineers. Beyond reviewing outputs, you’ll be expected to understand the “why” behind the work, identify logical gaps or inconsistencies, and propose meaningful improvements. This is a flexible, impact-driven role where you’ll have space to grow, contribute ideas, and help shape how evaluation and quality are scaled across the project. This role is especially well-suited for: Analysts, researchers, or consultants with strong structuring and reasoning skills Junior product managers or strategists curious about AI and evaluation work Smart problem-solvers (students or early-career professionals) who enjoy digging into logic, systems, and edge cases You do not need a coding background. What matters most is curiosity, intellectual rigor, and the ability to evaluate complex setups with precision. What you’ll be doing - Fully own the QA pipeline for agent evaluation tasks; - Review and validate tasks and golden paths created by scenario writers and experts; - Spot logical inconsistencies, vague requirements, hidden risks, and unrealistic assumptions; - Provide structured feedback and ensure quality alignment across contributors; Train, onboard, and mentor new QA team members; - Collaborate with domain experts, delivery managers, and engineers to improve test clarity and coverage; - Maintain and improve QA checklists, SOPs, and review guidelines; - Contribute to test planning, prioritization, and quality benchmarks; - Take initiative to suggest new approaches, tools, and processes that help scale validation and analysis.
AI Studio
Amsterdam, Netherlands
100% remote

Freelance Electrical Engineer with Python Experience (m/w/d)

Generative AI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. Although every project is unique, you might typically: - Content Creation & Refinement: Create and refine content to ensure accuracy and relevance across a variety of topics in Physics, while also developing references and examples of tasks. - Experts Acquisition: Assess the qualification tests of experts, ensuring their competency. - Chat Moderation: Provide support by addressing project-related questions from other experts in Discord chats, especially those related to project guidelines. - Auditing Work: Review and evaluate tasks completed by other experts, ensuring they align with project guidelines. Provide constructive feedback, verify expertise-related information, and edit content as necessary to improve quality.
AI Studio
100% remote

Electronics Technician (m/f/d)

- Independent troubleshooting and sustainable error resolution - Performing preventive maintenance tasks - Service and upkeep of machines and automated production systems with the latest technology - Documentation of work performed - Continuous optimization of our equipment and processes - Openness and appreciation for your suggestions and ideas
Media Company
Nuremberg, Germany

Chemist with Python Experience (m/f/d)

GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Chemistry, you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: - Generate prompts that challenge AI. - Define comprehensive scoring criteria to evaluate the accuracy of the AI’s answers. - Correct the model’s responses based on your domain-specific knowledge.
AI Lab
100% remote

Frontend developer to HR platform with Angular experience

Reach out to us if you are interested in working with us on the project.
FRATCH
Munich
90% remote
Sign up to get access to more exciting projects that match your skills and preferences!

Time's up! We are no longer accepting applications.

Logistics Automation & Robotics Expert (m/w/d) Tech Due Diligence

Industry
Transportation and Logistics
Areas
Operations
Supply Chain Management

Project info

  • Period
    11.08.2025 - 15.08.2025
  • Capacity
    from 95%
  • Location
    Karlsruhe, Germany
  • Languages
    • German
      (Advanced)
      ,
    • English
      (Advanced)
  • Remote
    from 95%

Description

We are supporting a VC fund in the search for a technical expert in the areas outlined below, who is available to assist 1-2 days with a tech due diligence starting next week (from July 28). The expert will support reference and product calls with the target company’s tech team, focusing on evaluating the core technologies, their maturity, and integration into logistics environments.

Tasks:

  • Participate in technical reference and product calls with the target company's engineering team
  • Prepare relevant technical questions in advance of the discussions
  • Assess the use of automation, robotics, and sensor technologies (e.g., LiDAR, radar, cameras)
  • Evaluate the scalability and efficiency of operational workflows in intralogistics environments
  • Analyze remote operation technologies, including control interfaces and low-latency communication systems
  • Review the integration of automation systems into existing WMS and IoT infrastructures
  • Support post-call analysis and provide a structured evaluation of the technical setup to inform investment decisions

Requirements

Must Have:

Automation & Robotics:

  • Understanding of autonomous vehicle standards, robotics integration, and sensor technologies (LiDAR, radar, cameras)
  • Logistics Operations: Experience evaluating operational workflows, scalability, and efficiency within intralogistics and warehouse environments
  • Remote Operation Technologies: Experience in remote-control interfaces, low-latency communication networks, and real-time remote monitoring
  • Software & Systems Integration: Proficiency in integrating automation technologies into existing Warehouse Management Systems (WMS) and IoT platforms

Nice to Have:

  • Cybersecurity: Knowledge of security standards related to remotely operated heavy machinery to identify vulnerabilities and ensure operational integrity
  • Data Analytics & AI: Ability to assess data-driven process optimization, predictive analytics, and AI-driven safety and efficiency improvements
  • Regulatory Compliance: Familiarity with EU/German regulatory frameworks concerning autonomous and remotely operated industrial vehicle