AI Evaluation Specialist

AI & Data Science

Full-time

Hybrid

AI Evaluation Specialists design and conduct systematic assessments of AI system performance, safety, and alignment, developing benchmark suites, red-teaming frameworks, and automated evaluation pipelines that measure model capabilities and risks. They create human evaluation protocols, analyze failure modes, and produce evaluation reports that inform model development decisions and regulatory compliance. This quality assurance function for AI systems is essential at both AI development organizations and enterprise AI deployers.

Upload your CV

Get an ATS compatibility score and personalized interview practice

AI Evaluation Specialist

AI Assistant