Back to all jobs
S

Data Scientist, AI Evaluation

Scale AI

San Francisco, CA Hybrid Full-time Mid $170k - $260k

About the Role

Design and build evaluation frameworks for frontier AI models. You will develop benchmarks, analyze model outputs at scale, create statistical methodologies for measuring AI capabilities, and work directly with enterprise customers to assess model fitness for their use cases.

Requirements

  • MS in Statistics, Data Science, or related field
  • 3+ years of experience in applied data science
  • Experience with LLM evaluation and benchmarking
  • Strong statistical analysis and experimental design skills
  • Proficiency in Python and data visualization tools

Skills

PythonLLM EvaluationStatisticsBenchmarkingData Visualization
Apply Now

Via Scale AI Careers

Job Details

RoleData Scientist
LevelMid
TypeFull-time
LocationSan Francisco, CA (Hybrid)
Salary$170k - $260k
PostedMarch 24, 2026