AI Agent Evaluation Analyst with expertise in training Large Language Models (LLMs), reviewing evaluation tasks for logic, completeness, and realism, identifying inconsistencies, defining gold standards, annotating reasoning paths, and applying analytical thinking to complex systems and policies.
Discover other experts with similar qualifications and experience
2025 © FRATCH.IO GmbH. All rights reserved.