You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise.

This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities

Reviewing evaluation tasks and scenarios for logic, completeness, and realism.
Identifying inconsistencies, missing assumptions, or unclear decision points.
Helping define clear expected behaviors (gold standards) for AI agents.
Annotating cause-effect relationships, reasoning paths, and plausible alternatives.
Thinking through complex systems and policies as a human would to ensure agents are tested properly.
Working closely with QA, writers, or developers to suggest refinements or edge case coverage.

Excellent analytical thinking: Can reason about complex systems, scenarios, and logical implications.
Strong attention to detail: Can spot contradictions, ambiguities, and vague requirements.
Familiarity with structured data formats: Can read, not necessarily write JSON/YAML.
Can assess scenarios holistically: What's missing, what’s unrealistic, what might break?
Experience with policy evaluation, logic puzzles, case studies, or structured scenario design.
Background in consulting, academia, olympiads (e.g. logic/math/informatics), or research.
Exposure to LLMs, prompt engineering, or AI-generated content.
Familiarity with QA or test-case thinking (edge cases, failure modes, “what could go wrong”).

Application Process:

If you are being selected you will by invited to an interview by Mindrift.

Project details

Recommended projects

AI Agent Evaluation Analyst

AI Evaluation Consultant (m/w/d)

Freelance Automotive Engineer (with Python) - Quality Assurance / AI Trainer

Freelance Chemistry Expert for AI Model Training (m/f/d)

AI Consultant - Machine Learning (m/w/d)

Freelance Mechanical Engineer with Python Experience (m/w/d)

Freelance Ruby Developer (m/f/d)

AI Agent Evaluation Analyst (m/f/d)

Freelance Biology Expert for AI Model Training (m/f/d)

Freelance Statistics Expert with Python Experience (m/f/d)

Business Analyst – SAP S/4HANA Output Management (m/f/d)

Freelance Electrical Engineer with Python Experience (m/w/d)

Mathematician with Python Experience (m/w/d)

Freelance AI Trainer - Writers (English) (m/f/d)

Freelance Civil Engineer with Python Experience (m/f/d)

Physicist with Python Experience (m/w/d)

Senior Project Manager Customer Interaction

Freelance Java Developer (m/f/d)

Dentist for Training AI Models (m/f/d)

Freelance Physics Expert (with Python) - Quality Assurance / AI Trainer

AI Consultants - Data Science (m/w/d)

Freelance Cybersecurity Consultant for AI Red Teaming

ERP-Transformation Manager (m/w/d)

Product Manager POS / Cash Register Systems (m/f/d)

Expert for Setting Up a Call Center

Biologist with Python Experience (m/w/d)

Expert in Ethical AI (m/f/d)

AI Consultant for Vibe Coding (m/w/d)

Developer for Consent Management Implementation (m/f/d)

Project Manager Magazines / Magazine Production (m/f/d)

Cyber Risk Consulting (Senior Level)

Frontend developer to HR platform with Angular experience

AI Agent Evaluation Analyst

Project info

Description

Requirements