Results-driven AI Data & QA Specialist with demonstrated expertise in LLM behavior testing, multimodal data annotation, and evaluation of AI model outputs across education, productivity, and consumer technology domains. Proven success in designing scalable annotation systems, applying robust evaluation frameworks, and ensuring 98%+ accuracy across complex multimodal datasets.
Recognized for precision and cross-functional collaboration in improving model reasoning fidelity, content safety, and task compliance. Adept at taxonomy design, cross-build consistency evaluation, rubric-driven scoring, and prompt engineering to assess and refine generative model performance. Skilled in leveraging structured QA processes, automation tools, and experimental analysis to deliver measurable improvements in data quality and model performance.
Discover other experts with similar qualifications and experience
2025 © FRATCH.IO GmbH. All rights reserved.