Bachelor's and/or Master’s Degreein Computer Science, Software Engineering, Data Science / Data Analytics, Artificial Intelligence / Machine Learning, Computational Linguistics / Natural Language Processing (NLP), Information Systems or other related fields.
Background in QA, software testing, data analysis, or NLP annotation.
Good understanding of test design principles (e.g., reproducibility, coverage, edge cases).
Strong written communication skills in English.
Comfortable with structured formats like JSON/YAML for scenario description.
Can define expected agent behaviors (gold paths) and scoring logic.
Basic experience with Python and JS.
Curious and open to working with AI-generated content, agent logs, and prompt-based behavior.
You are ready to learn new methods, able to switch between tasks and topics quickly and sometimes work with challenging, complex guidelines.
Our freelance role is fully remote so, you just need a laptop, internet connection, time available and enthusiasm to take on a challenge.

Nice to Have

Experience in writing manual or automated test cases.
Familiarity with LLM capabilities and typical failure modes.
Understanding of scoring metrics (precision, recall, coverage, reward functions).

Project details

Recommended projects