Log in
Sign up
Project details
Recommended projects
AI Agent Evaluation Analyst (m/f/d)
We are looking for an Freelance Agent Evaluation Analyst to take ownership of quality, structure, and insight across the project. This role goes far beyond task-checking - it’s about critical thinking, systems-level analysis, and ensuring clarity, reliability, and consistency at scale. You’ll work as both a hands-on evaluator and an analyst, collaborating with domain experts, delivery managers, and engineers. Beyond reviewing outputs, you’ll be expected to understand the “why” behind the work, identify logical gaps or inconsistencies, and propose meaningful improvements. This is a flexible, impact-driven role where you’ll have space to grow, contribute ideas, and help shape how evaluation and quality are scaled across the project. This role is especially well-suited for: Analysts, researchers, or consultants with strong structuring and reasoning skills Junior product managers or strategists curious about AI and evaluation work Smart problem-solvers (students or early-career professionals) who enjoy digging into logic, systems, and edge cases You do not need a coding background. What matters most is curiosity, intellectual rigor, and the ability to evaluate complex setups with precision. What you’ll be doing - Fully own the QA pipeline for agent evaluation tasks; - Review and validate tasks and golden paths created by scenario writers and experts; - Spot logical inconsistencies, vague requirements, hidden risks, and unrealistic assumptions; - Provide structured feedback and ensure quality alignment across contributors; Train, onboard, and mentor new QA team members; - Collaborate with domain experts, delivery managers, and engineers to improve test clarity and coverage; - Maintain and improve QA checklists, SOPs, and review guidelines; - Contribute to test planning, prioritization, and quality benchmarks; - Take initiative to suggest new approaches, tools, and processes that help scale validation and analysis.
Freelance AI Consultant (German) (m/w/d)
For our client, we're looking for a German-speaking AI consultant: As a consultant, you might be invited to take part in online projects to train models in your area of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum a few hours/week) and those interested in full-time opportunities. Responsibilities: - Carefully review provided data (text, images, or videos). - Review tasks submitted by the development team and ensure quality assurance/quality control. - Label or classify content based on project guidelines. - Identify and flag factually incorrect, sensitive, inappropriate, or unclear material.
Freelance AI Consultant (Japanese) (m/f/d)
For our client we are looking for a Japanese speaking AI consultant: As a consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum of a few hours per week) and those interested in full-time opportunities. Responsibilities: - Carefully review provided data (text, images, or videos). - Review tasks submitted by the developer team and ensure quality assurance/quality control. - Label or classify content based on project guidelines. - Identify and flag factually incorrect, sensitive, inappropriate, or unclear material.
Freelance Data Annotator (Spanish) (m/f/d)
For an AI studio we are looking for a Spanish speaking data annotation specialist: Annotation is what helps AI make sense of the world. As a QA Annotator, you may be invited to take part in online projects such as rating AI-generated content, evaluating factual accuracy, or comparing responses — when projects are available. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities Responsibilities: - Carefully review provided data (text, images, or videos). - Review tasks submitted by the Annotators team and ensure quality assurance/quality control. - Label or classify content based on project guidelines. - Identify and flag factually incorrect, sensitive, inappropriate, or unclear material.
AI Trainer for Vibe Coding (m/f/d)
An AI Lab is looking for an AI Trainer for Vibe Coding. This role involves producing accurate, well-reasoned outputs across diverse domains, leveraging automation and AI tools. The position requires expertise in coding and optimizing Python scripts, handling large datasets, improving AI-generated content, and formatting and troubleshooting technical workflows. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Develop and optimize Python scripts for automation and AI tasks. - Handle and analyze large datasets efficiently. - Improve and refine AI-generated content for accuracy and quality. - Format and troubleshoot technical workflows to ensure smooth operations. - Collaborate with cross-functional teams to enhance AI tools and processes.
Freelance AI Consultant (Korean) (m/f/d)
For our client we are looking for a Korean speaking AI consultant: As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. Responsibilities: - Carefully review provided data (text, images, or videos). - Review tasks submitted by the developer team and ensure quality assurance/quality control. - Label or classify content based on project guidelines. - Identify and flag factually incorrect, sensitive, inappropriate, or unclear material.
Freelance Consultant - AI Training (Portugese-Speaking)
For an AI lab we are looking for a Portugese speaking freelance consultants to train an AI model (Large Language Model - LLM) in various domains: You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities Responsibilities: - Carefully review analyze provided data by AI in your domain of expertise. - Improve the model in your domain of expertise. - Review AI results and ensure quality assurance/quality control. - Label or classify content based on project guidelines.
Freelance AI Consultant (Chinese) (m/f/d)
For our client we are looking for a Chinese-speaking AI consultant: As a consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum a few hours/week) and those interested in full-time opportunities. Responsibilities: - Carefully review provided data (text, images, or videos). - Review tasks submitted by the developer team and ensure quality assurance/quality control. - Label or classify content based on project guidelines. - Identify and flag factually incorrect, sensitive, inappropriate, or unclear material.
Freelance Ruby Developer (m/f/d)
For an AI lab we are looking for Ruby Developer to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. - Code generation and code review - Prompt evaluation and complex data annotation - Training and evaluation of large language models - Benchmarking and agent-based code execution in sandboxed environments - Working across multiple programming languages (Python, JavaScript/TypeScript, Rust, SQL, etc.) - Adapting guidelines for new domains and use cases - Collaborating with project leads, solution engineers, and supply managers on complex or experimental projects
Freelance Cybersecurity Consultant for AI Red Teaming
For an AI lab we are looking for cybersecurity consultants to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities - Evaluate and red team AI models and agents and machine learning systems for vulnerabilities and safety risks. - Create offline reproducible & auto-evaluable test cases to test safety & capability of AI agents. - Develop and implement automation scripts, custom tools, environments and test harnesses. - Lead or contribute to security research initiatives, especially in AI safety, creating and implementing realistic and challenging attack scenarios for the model. - Advise on cybersecurity best practices and policy implications.
Developer for Consent Management Implementation (m/f/d)
For replacing the previously displayed consent layers by third-party CMPs on the web for our international brands, these layers will be reimplemented so they can be maintained and deployed in-house. This requires solid knowledge of Typescript, Vue.js, and classic web rendering technologies (HTML and CSS). The goal is to deliver executable code that implements all requirements and includes automated tests proving correct functionality. What exactly is the scope of the assignment: The main focus is on developing elements for a decision-making template on the approach and on implementing measures along the designed project course. Concretely, this includes the following work packages: - Implementation of code - Implementation of executable tests, which must pass for delivery, with test coverage >= 80% - Creation of documentation for the code - Creation of brand-specific cmp-config files. - Creation of a project (including asset management requirements) as a copy of the consent management platform. - Removal of netID references. - Creation of brand-specific settings and files for custom purposes/vendors. - Adding new brand-specific CSS themes (variable values, logos, etc.). - Inclusion of required official IAB GVL translations (ES, FR) in the weekly GVL synchronization. - Implementation of I18n and preparation of brand-specific data sources. - Implementation of PMC2.0 backend usage modules. - Implementation of playout logic. - Implementation of the layer initialization process (mode=default and mode=resurface). - CDN upload and release process. - Project documentation Project implementation: - The desired result should be written in Typescript and Vue.js, build via Vite, tests via Vitest.
New
Senior Web Developer (m/f/d)
- You develop modern, high-performance web frontends with React, TypeScript, HTML, and CSS - You implement responsive designs with a focus on accessibility and performance - You plan and run unit and integration tests (for example with Playwright) - Troubleshooting in development, test, or live environments
Freelance Java Developer (f/m/d)
For an AI lab we are looking for Java Developer to train an AI model (Large Language Model - LLM). You help AI to make sense of the world. As consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (minimum few hours/week) and those interested in full-time opportunities. - Code generation and code review - Prompt evaluation and complex data annotation - Training and evaluation of large language models - Benchmarking and agent-based code execution in sandboxed environments - Working across multiple programming languages - Adapting guidelines for new domains and use cases - Following project-specific rubrics and requirements - Collaborating with project leads, solution engineers, and supply managers on complex or experimental projects
Freelance Rust Developer (m/f/d)
For an AI lab we are looking for a Rust Developer to train an AI model (Large Language Model (LLM)). You help AI to make sense of the world. As a consultant, you may be invited to take part in online projects to train the model in your domain of expertise. This flexible role accommodates both experts seeking part-time engagement (at least a few hours per week) and those interested in full-time opportunities. - Code generation and code review - Prompt evaluation and complex data annotation - Training and evaluation of large language models - Benchmarking and agent-based code execution in sandboxed environments - Working across multiple programming languages (Python, JavaScript/TypeScript, Rust, SQL, etc.) - Adapting guidelines for new domains and use cases - Following project-specific rubrics and requirements - Collaborating with project leads, solution engineers, and supply managers on complex or experimental projects
Freelance Mathematics Expert for AI Model Training (m/f/d)
An AI lab is looking for a freelance mathematics experts to evaluate AI models. The goal of the project is to assess the performance, accuracy, and reliability of AI models applied in mathematics contexts. The role involves working closely with the development team to ensure the models meet industry standards and provide actionable insights. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Evaluate AI models for mathematics applications. - Analyze model outputs and provide feedback for improvement. - Collaborate with the development team to ensure alignment with industry standards. - Document findings and recommendations for model optimization. - Conduct tests to validate model performance and reliability.
Freelance Chemistry Expert for AI Model Training (m/f/d)
An AI lab is looking for a freelance chemistry experts to evaluate AI models. The goal of the project is to assess the performance, accuracy, and reliability of AI models applied in chemistry contexts. The role involves working closely with the development team to ensure the models meet industry standards and provide actionable insights. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Evaluate AI models for chemistry applications. - Analyze model outputs and provide feedback for improvement. - Collaborate with the development team to ensure alignment with industry standards. - Document findings and recommendations for model optimization. - Conduct tests to validate model performance and reliability.
Freelance Physics Expert for AI Model Training (m/f/d)
An AI lab is looking for a freelance physics experts to evaluate AI models. The goal of the project is to assess the performance, accuracy, and reliability of AI models applied in physics contexts. The role involves working closely with the development team to ensure the models meet industry standards and provide actionable insights. This is a remote part-time role that can be flexibly tailored to your availability – from just a few hours per week to full-time. Key responsibilities: - Evaluate AI models for physics applications. - Analyze model outputs and provide feedback for improvement. - Collaborate with the development team to ensure alignment with industry standards. - Document findings and recommendations for model optimization. - Conduct tests to validate model performance and reliability.
Physicist with Python Experience (m/w/d)
For an AI lab we are looking for physicists with Python experience to train an AI model (Large Language Model - LLM). GenAI models are improving very quickly, and one of our goals is to make them capable of addressing specialized questions and achieving complex reasoning skills. If you join the platform as a physicist, you’ll have the opportunity to collaborate on these projects. Although every project is unique, you might typically: - Design original computational physics problems that simulate real research workflows. - Create problems requiring Python programming to solve (using numpy, scipy, sympy). - Ensure problems are computationally intensive and cannot be solved manually within reasonable timeframes (days/weeks). - Develop problems requiring non-trivial reasoning chains. - Base problems on real research challenges or practical applications from physical practice. - Verify solutions using Python with standard libraries. - Document problem statements clearly and provide verified correct answers.
New
Fullstack Engineer (m/f/d)
- Product and web development in the data-driven area - Shaping the software architecture for new data products - Collaborating in interdisciplinary teams (e.g. with data scientists and business developers)
New
Electronics Technician (m/f/d)
- Independent troubleshooting and sustainable error resolution - Performing preventive maintenance tasks - Service and upkeep of machines and automated production systems with the latest technology - Documentation of work performed - Continuous optimization of our equipment and processes - Openness and appreciation for your suggestions and ideas
Frontend developer to HR platform with Angular experience
Reach out to us if you are interested in working with us on the project.
Sign up
to get access to more exciting projects that match your skills and preferences!
Time's up! We are no longer accepting applications.
Similar projects
Data Migration Specialist (m/f/d)
Industry
Information Technology (IT)
Areas
Information Technology (IT)
Quality Assurance (QA)
Project info
Period
03.03.2025 - 30.06.2025
Capacity
from 95%
Daily rate
750 - 850€
Location
Berlin, Germany
Languages
German
(Advanced)
,
English
(Advanced)
Remote
from 95%
Description
Drive migration activities for assigned objects, ensuring timely completion and quality standards.
Prepare value mappings and execute MOCK and Production data loads according to the defined timeline.
Support the data migration closure process and load file archiving activities
Request, review, analyze, and communicate regular data quality reports (washing machine) to key stakeholders.
Conduct Data Verification Tests (DVTs) and prepare necessary templates for dual maintenance activities.
Identify, document, and raise defects for bugs or new functionalities related to assigned objects.
Manage Hypercare defects and change requests within the area of responsibility.
Perform hands-on tools testing, including test data preparation and fixing data for program testing activities (e.g., user acceptance tests).
Requirements
Proven experience in data migration, including tools testing, MOCK/Production data loading, and DVT execution.
Strong understanding of data quality management and defect tracking processes.
Ability to adhere to timelines, manage multiple tasks, and prioritize responsibilities effectively.
Experience with cutover planning, Hypercare support, and security audit processes.