AI Evaluation Specialist
AI & Data Science
Full-time
Hybrid
AI Evaluation Specialists design and conduct systematic assessments of AI system performance, safety, and alignment, developing benchmark suites, red-teaming frameworks, and automated evaluation pipelines that measure model capabilities and risks. They create human evaluation protocols, analyze failure modes, and produce evaluation reports that inform model development decisions and regulatory compliance. This quality assurance function for AI systems is essential at both AI development organizations and enterprise AI deployers.
Upload your CV
Get an ATS compatibility score and personalized interview practice