Project details

Recommended projects

AI Agent Evaluation Analyst

For an AI lab we are looking for AI Agent Evaluation Analyst to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities - Reviewing evaluation tasks and scenarios for logic, completeness, and realism. - Identifying inconsistencies, missing assumptions, or unclear decision points. - Helping define clear expected behaviors (gold standards) for AI agents. - Annotating cause-effect relationships, reasoning paths, and plausible alternatives. - Thinking through complex systems and policies as a human would to ensure agents are tested properly. - Working closely with QA, writers, or developers to suggest refinements or edge case coverage.
AI Lab
100% remote

Freelance AI Trainer - Writers (English) (m/w/d)

An AI Lab is seeking professionals experienced in working with tests to join their innovative team as English AI Trainers. The role involves crafting and editing texts, as well as evaluating AI-generated replies to ensure quality and accuracy. This position is ideal for individuals with expertise in writing, editing, and analyzing content, particularly in the context of AI and language models. As part of a cutting-edge AI lab, you will contribute to the development and refinement of advanced AI systems. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Craft and edit high-quality texts tailored to specific requirements. - Evaluate and analyze AI-generated replies for accuracy, relevance, and quality. - Collaborate with teams to improve AI language models and content generation processes. - Provide feedback and suggestions to enhance AI performance. - Conduct research to ensure content aligns with industry standards and user expectations.
AI Lab
100% remote

Freelance Automotive Engineer (with Python) - Quality Assurance / AI Trainer

Generative AI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. Although every project is unique, you might typically: - Content Creation & Refinement: Create and refine content to ensure accuracy and relevance across a variety of topics in Physics, while also developing references and examples of tasks. - Experts Acquisition: Assess the qualification tests of experts, ensuring their competency. - Chat Moderation: Provide support by addressing project-related questions from other experts in Discord chats, especially those related to project guidelines. - Auditing Work: Review and evaluate tasks completed by other experts, ensuring they align with project guidelines. Provide constructive feedback, verify expertise-related information, and edit content as necessary to improve quality.
AI Studio
100% remote

Freelance Biology Expert for AI Model Training (m/f/d)

An AI lab is looking for a freelance biology experts to evaluate AI models. The goal of the project is to assess the performance, accuracy, and reliability of AI models applied in biology (all areas) contexts. The role involves working closely with the development team to ensure the models meet industry standards and provide actionable insights. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Evaluate AI models for biology applications. - Analyze model outputs and provide feedback for improvement. - Collaborate with the development team to ensure alignment with industry standards. - Document findings and recommendations for model optimization. - Conduct tests to validate model performance and reliability.
AI Lab
100% remote

Freelance Chemistry Expert for AI Model Training (m/f/d)

An AI lab is looking for a freelance chemistry experts to evaluate AI models. The goal of the project is to assess the performance, accuracy, and reliability of AI models applied in chemistry contexts. The role involves working closely with the development team to ensure the models meet industry standards and provide actionable insights. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Evaluate AI models for chemistry applications. - Analyze model outputs and provide feedback for improvement. - Collaborate with the development team to ensure alignment with industry standards. - Document findings and recommendations for model optimization. - Conduct tests to validate model performance and reliability.
AI Lab
100% remote

AI Evaluation Consultant (m/w/d)

We are seeking an analytical and technically-minded professional to: - Evaluate AI outputs and processes - Ensure quality, accuracy, and reliability - Identify logical errors, risks, and structural inconsistencies - Provide actionable insights and recommendations to the team Ideal candidates: - Consultants, auditors, analysts, data researchers, or business/technical analysts with strong reasoning skills - Professionals curious about AI, process improvement, and quality evaluation - Problem-solvers who enjoy analyzing complex systems, logic, and scenarios Key Responsibilities: - Lead evaluation of AI outputs and related processes - Review tasks against expected/ideal scenarios; identify gaps and risks - Provide structured, actionable recommendations to engineers, domain experts, and managers - Maintain and improve evaluation guidelines, checklists, SOPs - Suggest new approaches, tools, and processes to enhance AI evaluation
AI Labs
100% remote

Freelance Mechanical Engineer with Python Experience (m/w/d)

For an AI lab we are looking for Mechanical Engineer with python experience to train an AI model (Large Language Model - LLM). GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: - Content Creation & Refinement: Create and refine content to ensure accuracy and relevance across a variety of topics in Mechanical Engineering, while also developing references and examples of tasks. - Experts Acquisition: Assess the qualification tests of experts, ensuring their competency. - Chat Moderation: Provide support by addressing project-related questions from other experts in Discord chats, especially those related to project guidelines. - Auditing Work: Review and evaluate tasks completed by other experts, ensuring they align with project guidelines. Provide constructive feedback, verify expertise-related information, and edit content as necessary to improve quality.
AI Studio
100% remote

Freelance Civil Engineer with Python Experience (m/f/d)

A company is looking for a freelance Civil engineering experts to evaluate AI models. The goal of the project is to assess the performance, accuracy, and reliability of AI models applied in civil engineering contexts. The role involves working closely with the development team to ensure the models meet industry standards and provide actionable insights. Key responsibilities: - Evaluate AI models for civil engineering applications. - Analyze model outputs and provide feedback for improvement. - Collaborate with the development team to ensure alignment with industry standards. - Document findings and recommendations for model optimization. - Conduct tests to validate model performance and reliability.
AI Lab
100% remote

Freelance Physics Expert (with Python) - Quality Assurance / AI Trainer

Generative AI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. Although every project is unique, you might typically: - Content Creation & Refinement: Create and refine content to ensure accuracy and relevance across a variety of topics in Physics, while also developing references and examples of tasks. - Experts Acquisition: Assess the qualification tests of experts, ensuring their competency. - Chat Moderation: Provide support by addressing project-related questions from other experts in Discord chats, especially those related to project guidelines. - Auditing Work: Review and evaluate tasks completed by other experts, ensuring they align with project guidelines. Provide constructive feedback, verify expertise-related information, and edit content as necessary to improve quality.
AI Studio
100% remote
New

AI Consultant - Machine Learning (m/w/d)

For an AI lab we are looking for Machine learning experts to train an AI model (Large Language Model - LLM). GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: - Design original computational STEM problems that simulate real scientific workflows - Create problems that require Python programming to solve - Ensure problems are computationally intensive and cannot be solved manually within reasonable timeframes (days/weeks) - Develop problems requiring non-trivial reasoning chains and creative problem-solving approaches - Verify solutions using Python with standard libraries (numpy, pandas, scipy, sklearn) - Document problem statements clearly and provide verified correct answers
AI Lab
100% remote

Freelance Ruby Developer (m/f/d)

For an AI lab we are looking for Ruby Developer to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. - Code generation and code review - Prompt evaluation and complex data annotation - Training and evaluation of large language models - Benchmarking and agent-based code execution in sandboxed environments - Working across multiple programming languages (Python, JavaScript/TypeScript, Rust, SQL, etc.) - Adapting guidelines for new domains and use cases - Collaborating with project leads, solution engineers, and supply managers on complex or experimental projects
AI Lab
100% remote

Freelance Statistics Expert with Python Experience (m/f/d)

For an AI lab we are looking for Statistics Expert with python experience to train an AI model (Large Language Model - LLM). GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: - Generate prompts that challenge AI. - Define comprehensive scoring criteria to evaluate the accuracy of the AI’s answers. - Correct the model’s responses based on your domain-specific knowledge.
AI Lab
100% remote

Freelance Electrical Engineer with Python Experience (m/w/d)

Generative AI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. Although every project is unique, you might typically: - Content Creation & Refinement: Create and refine content to ensure accuracy and relevance across a variety of topics in Physics, while also developing references and examples of tasks. - Experts Acquisition: Assess the qualification tests of experts, ensuring their competency. - Chat Moderation: Provide support by addressing project-related questions from other experts in Discord chats, especially those related to project guidelines. - Auditing Work: Review and evaluate tasks completed by other experts, ensuring they align with project guidelines. Provide constructive feedback, verify expertise-related information, and edit content as necessary to improve quality.
AI Studio
100% remote

Freelance Editor (m/f/d)

- You create topic briefs, research and write expert-level (guide) texts in a sophisticated style, and edit the contributions of our freelance authors - The topics target hobby gardeners in the garden and plant sector, as well as home living and furnishing, design and decor, DIY, and also cooking and nutrition - In close exchange with colleagues, readers and experts, you develop exciting topics and present them tailored to the target audience - You also maintain and expand media contacts, and order photo materials for the garden, home and decor areas - Optionally, you organize and carry out photo shoots, and attend press appointments and trade fairs
Media Company
Munich, Germany
50% remote

Mathematician with Python Experience (m/w/d)

For an AI lab we are looking for mathematicians with python experience to train an AI model (Large Language Model - LLM). As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. Although every project is unique, you might typically: - Design original computational mathematics problems that simulate real mathematical research workflows. - Create problems requiring Python programming to solve (using numpy, scipy, sympy). - Ensure problems are computationally intensive and cannot be solved manually within reasonable timeframes (days/weeks). - Develop problems requiring non-trivial reasoning chains in areas like number theory, combinatorics, graph theory, and numerical analysis. - Base problems on real research challenges or practical applications from mathematical practice. - Verify solutions using Python with standard mathematical libraries. - Document problem statements clearly and provide verified correct answers. Support in: - Number Theory: Prime factorization, Diophantine equations, modular arithmetic, cryptographic computations. - Combinatorics: Enumerations, partitions, generating functions, combinatorial optimization. - Graph Theory: Network analysis, path finding, graph coloring, spanning trees. - Numerical Analysis: Root finding, numerical integration, differential equations, matrix computations. - Discrete Mathematics: Recurrence relations, algorithmic complexity, discrete optimization. - Algebra: Polynomial computations, group theory calculations, matrix decompositions.
AI Lab
100% remote

Physicist with Python Experience (m/w/d)

For an AI lab we are looking for phycists with python experience to train an AI model (Large Language Model - LLM). GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as a phycist, you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: - Design original computational physic problems that simulate real research workflows. - Create problems requiring Python programming to solve (using numpy, scipy, sympy). - Ensure problems are computationally intensive and cannot be solved manually within reasonable timeframes (days/weeks). - Develop problems requiring non-trivial reasoning chains. - Base problems on real research challenges or practical applications from physical practice. - Verify solutions using Python with standard libraries. - Document problem statements clearly and provide verified correct answers.
AI Lab
100% remote

AI Agent Evaluation Analyst (m/f/d)

We are looking for a Freelance Agent Evaluation Analyst to take ownership of quality, structure, and insight across the project. This role goes far beyond task-checking - it’s about critical thinking, systems-level analysis, and ensuring clarity, reliability, and consistency at scale. You’ll work as both a hands-on evaluator and an analyst, collaborating with domain experts, delivery managers, and engineers. Beyond reviewing outputs, you’ll be expected to understand the “why” behind the work, identify logical gaps or inconsistencies, and propose meaningful improvements. This is a flexible, impact-driven role where you’ll have space to grow, contribute ideas, and help shape how evaluation and quality are scaled across the project. This role is especially well-suited for: Analysts, researchers, or consultants with strong structuring and reasoning skills Junior product managers or strategists curious about AI and evaluation work Smart problem-solvers (students or early-career professionals) who enjoy digging into logic, systems, and edge cases You do not need a coding background. What matters most is curiosity, intellectual rigor, and the ability to evaluate complex setups with precision. What you’ll be doing - Fully own the QA pipeline for agent evaluation tasks; - Review and validate tasks and golden paths created by scenario writers and experts; - Spot logical inconsistencies, vague requirements, hidden risks, and unrealistic assumptions; - Provide structured feedback and ensure quality alignment across contributors; Train, onboard, and mentor new QA team members; - Collaborate with domain experts, delivery managers, and engineers to improve test clarity and coverage; - Maintain and improve QA checklists, SOPs, and review guidelines; - Contribute to test planning, prioritization, and quality benchmarks; - Take initiative to suggest new approaches, tools, and processes that help scale validation and analysis.
AI Studio
Amsterdam, Netherlands
100% remote

Freelance Java Developer (m/f/d)

For an AI lab we are looking for Java Developer to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. - Code generation and code review - Prompt evaluation and complex data annotation - Training and evaluation of large language models - Benchmarking and agent-based code execution in sandboxed environments - Working across multiple programming languages - Adapting guidelines for new domains and use cases - Following project-specific rubrics and requirements - Collaborating with project leads, solution engineers, and supply managers on complex or experimental projects
AI Lab
100% remote

AI Consultants - Data Science (m/w/d)

We are seeking experienced data scientists to create computationally intensive data science problems for an advanced AI evaluation project. This is a remote, project-based opportunity for experts who can design challenging problems that require computational methods to solve and mirror the full data science lifecycle - from data acquisition and processing to statistical analysis and actionable business insights. What You'll Do - Design original computational data science problems that simulate real-world analytical workflows across industries (telecom, finance, government, e-commerce, healthcare) Create problems requiring Python programming to solve (using pandas, numpy, scipy, sklearn, statsmodels, matplotlib, seaborn) - Ensure problems are computationally intensive and cannot be solved manually within reasonable timeframes (days/weeks) - Develop problems requiring non-trivial reasoning chains in data processing, statistical analysis, feature engineering, predictive modeling, and insight extraction - Create deterministic problems with reproducible answers - avoid stochastic elements or require fixed random seeds for exact reproducibility - Base problems on real business challenges: customer analytics, risk assessment, fraud detection, forecasting, optimization, and operational efficiency - Design end-to-end problems spanning the complete data science pipeline (data ingestion → cleaning → EDA → modeling → validation → deployment considerations) - Incorporate big data processing scenarios requiring scalable computational approaches - Verify solutions using Python with standard data science libraries and statistical methods - Document problem statements clearly with realistic business contexts and provide verified correct answers
AI Lab
Munich, Germany
100% remote

Dentist for Training AI Models (m/w/d)

For an AI lab we are looking for German-speaking dentists to train an AI model (Large Language Model - LLM). As a consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. Although every project is unique, you might typically: - Collaborate with the AI lab to provide domain-specific knowledge in dentistry. - Participate in online training sessions to enhance the AI model's understanding. - Review and validate AI-generated content for accuracy and relevance. - Offer insights and feedback to improve the model's performance. - Engage in flexible project-based work, adapting to unique project requirements.
AI Lab
100% remote

Frontend developer to HR platform with Angular experience

Reach out to us if you are interested in working with us on the project.
FRATCH
Munich
90% remote
Sign up to get access to more exciting projects that match your skills and preferences!

Time's up! We are no longer accepting applications.

Freelance Economics Expert - AI Trainer

Industry
Information Technology (IT)
Areas
Accounting
Product Development
Research and Development (R&D)

Project info

  • Daily rate
    320 - 440€
  • Language
    • English
      (Advanced)
  • Remote
    100%

Description

For an AI lab we are looking for an Economics expert to train an AI model

GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills.

If you join you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically:

You will create complex, realistic tasks that push frontier AI agents to their limits. Think scattered data, conditional procedures, and genuine domain expertise required. You'll build a detailed version with objective scoring, then write an ambiguous version intended to train the agent to succeed with less hand-holding. Real expert complexity only. You're improving the AI tools you'll eventually use yourself.

If you have the relevant experience and are ready to take on this challenging and engaging project, join us!

Requirements

  • You hold a Bachelor’s, Master’s or PhD Degree in Economics or relevant fields with a strong GPA (3.5-4).
  • You have a professional industry experience in accounting with a minimum of 3 years in relevant economics fields (Economics Experts, Analysts, researchers, or consultants).
  • Your level of English is advanced (C1) or above.
  • You are able to write clearly and professionally, including explaining complex tasks in simple, structured language as well as analyze and synthesize information from multiple sources and turn it into accurate, coherent outputs.
  • You have excellent analytical thinking and strong attention to detail skills.
  • You bring creativity in designing realistic and engaging examples, cases, or workflows based on your domain knowledge.
  • You have exposure to LLMs, prompt engineering, or AI-generated content with some understanding of how scoring or evaluation works in agent testing (precision, coverage, etc.).
  • You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.
  • Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.
  • presence of valid accounting related certifications - Examples of professional certifications are Ch.E., CBE, CEcD, PRM, CPPA, CAIA, CMT, CFP, CEA, CFM, CMA, CGMA, CQF, CBV, CFE, CSM, CIPM, CSCA, ChFC, FMVA, ACCA, CPA, ICAEW, CIMA, FRM, CIA, CFA.
  • Technical skills as a bonus: R, Stata, Python, MATLAB, especially for applied or research-intensive work.

Application process

Apply on the FRATCH platform. If selected, you will receive a short test from our client.