Project details

Recommended projects

Freelance Cybersecurity Consultant for AI Red Teaming

For an AI lab we are looking for cybersecurity consultants to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities - Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. - Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. - Develop and implement automation scripts, custom tools, environments and test harnesses. - Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. - Advise on cybersecurity best practices and policy implications.
AI Lab
100% remote

Freelance Automotive Engineer (with Python) - Quality Assurance / AI Trainer

Generative AI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. Although every project is unique, you might typically: - Content Creation & Refinement: Create and refine content to ensure accuracy and relevance across a variety of topics in Physics, while also developing references and examples of tasks. - Experts Acquisition: Assess the qualification tests of experts, ensuring their competency. - Chat Moderation: Provide support by addressing project-related questions from other experts in Discord chats, especially those related to project guidelines. - Auditing Work: Review and evaluate tasks completed by other experts, ensuring they align with project guidelines. Provide constructive feedback, verify expertise-related information, and edit content as necessary to improve quality.
AI Studio
100% remote

Freelance Rust Developer (m/w/d)

For an AI lab, we are looking for a Rust Developer to train an AI model (Large Language Model - LLM). You will help the AI make sense of the world. As a consultant, you may be invited to join online projects to train the model in your area of expertise. This flexible role works for both experts seeking part-time work (minimum a few hours/week) and those interested in full-time roles. - Code generation and code review - Prompt evaluation and complex data annotation - Training and evaluating large language models - Benchmarking and agent-based code execution in sandboxed environments - Working across multiple programming languages (Python, JavaScript/TypeScript, Rust, SQL, etc.) - Adapting guidelines for new domains and use cases - Following project-specific rubrics and requirements - Collaborating with project leads, solution engineers, and supply managers on complex or experimental projects
AI Lab
100% remote

Freelance Kotlin Developer (m/w/d)

For an AI lab we are looking for Kotlin Developer to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. - Code generation and code review - Prompt evaluation and complex data annotation - Training and evaluation of large language models - Benchmarking and agent-based code execution in sandboxed environments - Working across multiple programming languages - Adapting guidelines for new domains and use cases - Following project-specific rubrics and requirements - Collaborating with project leads, solution engineers, and supply managers on complex or experimental projects
AI Lab
100% remote

Test Manager (m/f/d)

The development and quality assurance of the data layer includes its complete provisioning through the respective web application. The data layer forms the central data foundation for analyzing user behavior and for personalized content during the website visit. To increase reliability and stability, automated tests should be used to significantly reduce manual regression tests. For this task, a Test Automation Engineer with a focus on Playwright (Elastic) is needed. - Development and implementation of automated end-to-end tests with the npm package @elastic/synthetics (Playwright) for data layer tests. - Analysis of existing test processes, identification and prioritization of automation potentials. - Creation, maintenance, and optimization of test scripts considering current best practices. - Integration of automated tests into existing CI/CD pipelines (e.g., Jenkins, GitHub Actions) to enable continuous test automation. - Documentation of test cases, test results, and test coverage in tools like Jira and Confluence. - Advising stakeholders on the selection and introduction of appropriate test strategies, test tools, and frameworks. - Conducting code reviews for test automation scripts to improve quality and maintainability. - Preparing decision templates and recommendations for action to further develop test automation. - Providing advice on error analysis and resolution within test automation. - Consulting on setting up reports and alerting with Elastic Observability. - Promoting traceability and reproducibility of test results.
Telecommunications
Munich, Germany
100% remote
New

Freelance Mechanical Engineer with Python Experience (m/w/d)

For an AI lab we are looking for Mechanical Engineer with python experience to train an AI model (Large Language Model - LLM). GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: - Content Creation & Refinement: Create and refine content to ensure accuracy and relevance across a variety of topics in Mechanical Engineering, while also developing references and examples of tasks. - Experts Acquisition: Assess the qualification tests of experts, ensuring their competency. - Chat Moderation: Provide support by addressing project-related questions from other experts in Discord chats, especially those related to project guidelines. - Auditing Work: Review and evaluate tasks completed by other experts, ensuring they align with project guidelines. Provide constructive feedback, verify expertise-related information, and edit content as necessary to improve quality.
AI Studio
100% remote

Freelance Biology Expert for AI Model Training (m/f/d)

An AI lab is looking for a freelance biology experts to evaluate AI models. The goal of the project is to assess the performance, accuracy, and reliability of AI models applied in biology (all areas) contexts. The role involves working closely with the development team to ensure the models meet industry standards and provide actionable insights. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Evaluate AI models for biology applications. - Analyze model outputs and provide feedback for improvement. - Collaborate with the development team to ensure alignment with industry standards. - Document findings and recommendations for model optimization. - Conduct tests to validate model performance and reliability.
AI Lab
100% remote

Freelance Electrical Engineer with Python Experience (m/w/d)

Generative AI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. Although every project is unique, you might typically: - Content Creation & Refinement: Create and refine content to ensure accuracy and relevance across a variety of topics in Physics, while also developing references and examples of tasks. - Experts Acquisition: Assess the qualification tests of experts, ensuring their competency. - Chat Moderation: Provide support by addressing project-related questions from other experts in Discord chats, especially those related to project guidelines. - Auditing Work: Review and evaluate tasks completed by other experts, ensuring they align with project guidelines. Provide constructive feedback, verify expertise-related information, and edit content as necessary to improve quality.
AI Studio
100% remote

Freelance Mathematics Expert for AI Model Training (m/f/d)

An AI lab is looking for a freelance mathematics experts to evaluate AI models. The goal of the project is to assess the performance, accuracy, and reliability of AI models applied in mathematics contexts. The role involves working closely with the development team to ensure the models meet industry standards and provide actionable insights. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Evaluate AI models for mathematics applications. - Analyze model outputs and provide feedback for improvement. - Collaborate with the development team to ensure alignment with industry standards. - Document findings and recommendations for model optimization. - Conduct tests to validate model performance and reliability.
AI Lab
100% remote

Freelance Chemistry Expert for AI Model Training (m/f/d)

An AI lab is looking for a freelance chemistry expert to evaluate AI models. The goal of the project is to assess the performance, accuracy, and reliability of AI models applied in chemistry contexts. The role involves working closely with the development team to ensure the models meet industry standards and provide actionable insights. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Evaluate AI models for chemistry applications. - Analyze model outputs and provide feedback for improvement. - Collaborate with the development team to ensure alignment with industry standards. - Document findings and recommendations for model optimization. - Conduct tests to validate model performance and reliability.
AI Lab
100% remote

Freelance Physics Expert for AI Model Training (m/f/d)

An AI lab is looking for a freelance physics experts to evaluate AI models. The goal of the project is to assess the performance, accuracy, and reliability of AI models applied in physics contexts. The role involves working closely with the development team to ensure the models meet industry standards and provide actionable insights. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Evaluate AI models for physics applications. - Analyze model outputs and provide feedback for improvement. - Collaborate with the development team to ensure alignment with industry standards. - Document findings and recommendations for model optimization. - Conduct tests to validate model performance and reliability.
AI Lab
100% remote

Freelance Ruby Developer (m/f/d)

For an AI lab we are looking for Ruby Developer to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. - Code generation and code review - Prompt evaluation and complex data annotation - Training and evaluation of large language models - Benchmarking and agent-based code execution in sandboxed environments - Working across multiple programming languages (Python, JavaScript/TypeScript, Rust, SQL, etc.) - Adapting guidelines for new domains and use cases - Collaborating with project leads, solution engineers, and supply managers on complex or experimental projects
AI Lab
100% remote

Freelance Civil Engineer with Python Experience (m/f/d)

A company is looking for a freelance Civil engineering experts to evaluate AI models. The goal of the project is to assess the performance, accuracy, and reliability of AI models applied in civil engineering contexts. The role involves working closely with the development team to ensure the models meet industry standards and provide actionable insights. Key responsibilities: - Evaluate AI models for civil engineering applications. - Analyze model outputs and provide feedback for improvement. - Collaborate with the development team to ensure alignment with industry standards. - Document findings and recommendations for model optimization. - Conduct tests to validate model performance and reliability.
AI Lab
100% remote

Mathematician with Python Experience (m/w/d)

For an AI lab we are looking for mathematicians with python experience to train an AI model (Large Language Model - LLM). As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. Although every project is unique, you might typically: - Design original computational mathematics problems that simulate real mathematical research workflows. - Create problems requiring Python programming to solve (using numpy, scipy, sympy). - Ensure problems are computationally intensive and cannot be solved manually within reasonable timeframes (days/weeks). - Develop problems requiring non-trivial reasoning chains in areas like number theory, combinatorics, graph theory, and numerical analysis. - Base problems on real research challenges or practical applications from mathematical practice. - Verify solutions using Python with standard mathematical libraries. - Document problem statements clearly and provide verified correct answers. Support in: - Number Theory: Prime factorization, Diophantine equations, modular arithmetic, cryptographic computations. - Combinatorics: Enumerations, partitions, generating functions, combinatorial optimization. - Graph Theory: Network analysis, path finding, graph coloring, spanning trees. - Numerical Analysis: Root finding, numerical integration, differential equations, matrix computations. - Discrete Mathematics: Recurrence relations, algorithmic complexity, discrete optimization. - Algebra: Polynomial computations, group theory calculations, matrix decompositions.
AI Lab
100% remote

AI Trainer for Vibe Coding (m/w/d)

An AI Lab is looking for a AI Trainer for Vibe Coding. This role involves producing accurate, well-reasoned outputs across diverse domains, leveraging automation and AI tools. The position requires expertise in coding and optimizing Python scripts, handling large datasets, improving AI-generated content, and formatting and troubleshooting technical workflows. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Develop and optimize Python scripts for automation and AI tasks. - Handle and analyze large datasets efficiently. - Improve and refine AI-generated content for accuracy and quality. - Format and troubleshoot technical workflows to ensure smooth operations. - Collaborate with cross-functional teams to enhance AI tools and processes.
AI Lab
100% remote

Physicist with Python Experience (m/w/d)

For an AI lab we are looking for phycists with python experience to train an AI model (Large Language Model - LLM). GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as a phycist, you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: - Design original computational physic problems that simulate real research workflows. - Create problems requiring Python programming to solve (using numpy, scipy, sympy). - Ensure problems are computationally intensive and cannot be solved manually within reasonable timeframes (days/weeks). - Develop problems requiring non-trivial reasoning chains. - Base problems on real research challenges or practical applications from physical practice. - Verify solutions using Python with standard libraries. - Document problem statements clearly and provide verified correct answers.
AI Lab
100% remote

Freelance Physics Expert (with Python) - Quality Assurance / AI Trainer

Generative AI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. Although every project is unique, you might typically: - Content Creation & Refinement: Create and refine content to ensure accuracy and relevance across a variety of topics in Physics, while also developing references and examples of tasks. - Experts Acquisition: Assess the qualification tests of experts, ensuring their competency. - Chat Moderation: Provide support by addressing project-related questions from other experts in Discord chats, especially those related to project guidelines. - Auditing Work: Review and evaluate tasks completed by other experts, ensuring they align with project guidelines. Provide constructive feedback, verify expertise-related information, and edit content as necessary to improve quality.
AI Studio
100% remote

Developer for Consent Management Implementation (m/f/d)

To replace the consent layers currently displayed on the web by third-party CMPs for our international brands, these layers need to be newly implemented so they can be maintained and delivered in-house. This requires solid knowledge of TypeScript, Vue.js and classic web display techniques (HTML and CSS). The goal is to deliver executable code that implements all requirements and includes automated tests that prove correct functionality. What exactly is the scope of work: The main focus is on developing elements to decide the approach and on implementing measures along the project course defined by this. This specifically includes the following service packages: - Implementation of code - Implementation of executable tests that must pass for delivery, test coverage >= 80% - Creation of documentation for the code - Creation of brand-specific cmp-config files. - Creation of a project (including treasury requirements) as a copy of the consent management platform. - Removal of netID references. - Creation of brand-specific settings and files for custom purposes/vendors. - Addition of new brand-specific CSS themes (variable values, logos, etc.). - Inclusion of the required official IAB GVL translations (ES, FR) in the weekly sync with the GVL - Implementation of I18n and preparation of brand-specific data sources - Implementation of PMC2.0 backend usage modules - Implementation of the playout logic - Implementation of the layer initialization process (mode=default and mode=resurface) - CDN upload and release process - Project documentation Project implementation: - The desired result should be written in TypeScript and Vue.js, build with Vite, tests with Vitest.
Telecommunications
Karlsruhe, Germany
100% remote

Freelance Java Developer (m/w/d)

For an AI lab we are looking for Java Developer to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. - Code generation and code review - Prompt evaluation and complex data annotation - Training and evaluation of large language models - Benchmarking and agent-based code execution in sandboxed environments - Working across multiple programming languages - Adapting guidelines for new domains and use cases - Following project-specific rubrics and requirements - Collaborating with project leads, solution engineers, and supply managers on complex or experimental projects
AI Lab
100% remote

AI Agent Evaluation Analyst

For an AI lab we are looking for AI Agent Evaluation Analyst to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities - Reviewing evaluation tasks and scenarios for logic, completeness, and realism. - Identifying inconsistencies, missing assumptions, or unclear decision points. - Helping define clear expected behaviors (gold standards) for AI agents. - Annotating cause-effect relationships, reasoning paths, and plausible alternatives. - Thinking through complex systems and policies as a human would to ensure agents are tested properly. - Working closely with QA, writers, or developers to suggest refinements or edge case coverage.
AI Lab
100% remote

Frontend developer to HR platform with Angular experience

Reach out to us if you are interested in working with us on the project.
FRATCH
Munich
90% remote
Sign up to get access to more exciting projects that match your skills and preferences!

Time's up! We are no longer accepting applications.

Freelance AI Red Team Engineer

Industry
Information Technology (IT)
Areas
Information Technology (IT)
Quality Assurance (QA)
Research and Development (R&D)

Project info

  • Period
    24.11.2025 - 28.03.2026
  • Daily rate
    240 - 320€
  • Location
    Munich, Germany
  • Language
    • English
      (Advanced)
  • Remote
    from 95%

Description

GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as an AI Tutor in Coding, you’ll have the opportunity to collaborate on these projects.

Although every project is unique, you might typically:

  • Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks.
  • Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents.
  • Develop and implement automation scripts, custom tools, environments and test harnesses.
  • Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model.
  • Advise on cybersecurity best practices and policy implications.

Requirements

  • You hold a Bachelor's or Master’s Degree in Computer Science, Software Engineering, Cybersecurity, Digital Forensics or other related fields.
  • Proficient in scripting and automation using Python, Bash, or PowerShell.
  • Experienced with containerization and CI/CD security tools, especially Docker.
  • Hands-on experience with penetration testing across web, API, network, and infrastructure environments.
  • Knowledge of vulnerabilities in current AI models, including prompt injections, with knowledge of OWASP Top 10 for Large Language Models (LLMs).
  • Familiar with AI red-teaming frameworks such as garak or PyRIT.
  • Experience in AI/ML security, evaluation, and red teaming, particularly with LLMs, AI agents, and RAG pipelines.
  • Proficient in offensive exploitation and exploit development.
  • Skilled in reverse engineering using tools like Ghidra or equivalents.
  • Expertise in network and application security, including web application security.
  • Knowledge of operating system security concepts such as Linux privilege escalation and Windows internals.
  • Familiar with secure coding practices for full-stack development.
  • You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.
  • Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.