AIML - Head of Data Science and Insights
Apple
IT, Data Science
Cupertino, CA, USA
USD 305k-487,200 / year + Equity
Posted on May 22, 2026
Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other’s ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the world, changing lives for the better. It’s the diversity of our people and their thinking that inspires the innovation that runs through everything we do. When we bring everybody in, we can do the best work of our lives. Here, you’ll do more than join something — you’ll add something. Apple’s AIML Evaluation team is looking for a seasoned, technical leader to lead our Data Science and Insights team. The organization leads Evaluation for Apple Intelligence, Siri and a large portfolio of other billion+ user facing features in SWE. Successful candidates will have deep experience in traditional human evaluation methodology, logging, and A/B testing, in addition to hands-on experience building and deploying LLM-based autograders and rubrics, and using these tools to proactively drive improvements in models and agentic features.
As the head of Data Science and Insights, you'll influence the direction of a wide variety of software features, models, and platforms, in close collaboration with teams across the company. Your experience will enable you to thoughtfully balance the various tradeoffs involved in creating successful features that meet Apple's high customer expectations for both quality and privacy.
- Setting the evaluation strategy that determines how Apple measures quality for Apple Intelligence, Siri, and the broader SWE portfolio of billion+ user features.
- Leading a large team data scientists and machine learning engineers — recruiting, developing, and retaining strong technical talent across both disciplines.
- Driving the methodological agenda across human evaluation, logging, AB testing, and LLM-based autograders and rubrics, and ensuring those methods translate into measurable model and agentic feature improvements.
- Partnering with Apple Intelligence, Siri, and other SWE product and engineering teams to embed evaluation into product development cycles and turn evaluation results into shipped quality gains.
- Partnering with peer leaders inside AIML Evaluation platforms (AB, Annotation, Synthetic Data), Apple Foundation Models, Machine Translation, etc.
- Representing the team to executive leadership across AIML and SWE, including Senior Vice Presidents.
- 10+ years of experience in data science and machine learning evaluation, including 6+ years leading large technical teams
- Advanced degree in a quantitative field such as Statistics, Computer Science, Machine Learning, or similar
- Demonstrated track record of running organizations of 50+ data scientists and/or machine learning engineers
- Deep experience in human evaluation methodology, logging, and AB testing for consumer-facing products at scale
- Hands-on experience building and deploying LLM-based autograders and rubrics, and using them to drive proactive improvements in models and agentic features
- Strong written and verbal communication skills, able to communicate effectively with engineers and senior leaders, including Senior Vice Presidents
- Experience evaluating large consumer AI products such as conversational assistants, search systems, or agentic features
- Experience with logging infrastructure and instrumentation for AI product quality measurement
- Track record of growing senior leaders from within your organization and where needed recruiting senior data science and machine learning talent in competitive hiring markets
- Familiarity with evaluation frameworks for agentic systems and tool-use
- Strong written and verbal communication skills, able to communicate effectively with engineers and senior leaders, including Senior Vice Presidents