Machine Learning - Data Scientist Lead

Apple

Apple

Software Engineering, Data Science

Sunnyvale, CA, USA

USD 181,100-318,400 / year + Equity

Posted on Apr 30, 2026
Do you have a passion for computer vision and deep learning? Are you excited by the latest advances in multimodal models? The Video Engineering Data Analytics and Quality group is looking for a technical lead with deep expertise in evaluating machine learning and deep learning models, including foundation models and multimodal systems.
In this role, you will design robust evaluation frameworks, mentor a team of engineers and scientists, and drive alignment across Apple's research, engineering, and product teams. You will combine strong analytical thinking, Python expertise, and a deep understanding of statistical evaluation and data quality. You will also help set the technical direction for how we measure and improve the quality of some of Apple's most exciting AI experiences.
  • Lead and mentor a small team of ML evaluation engineers and data scientists, providing technical guidance, feedback, and code reviews.
  • Define team-level evaluation standards and best practices that other teams across Apple can adopt.
  • Own the technical roadmap for evaluation infrastructure, from planning through execution.
  • Onboard and ramp up new team members on tools, datasets, and workflows.
  • Design and implement scalable evaluation frameworks for machine learning and deep learning models.
  • Develop robust methodologies to assess the performance of foundation models (such as LLMs and vision-language models) across diverse tasks.
  • Leverage LLMs as judges to perform subjective and open-ended model evaluations, for example in summarization, reasoning, or multimodal generation tasks.
  • Create, curate, and manage evaluation datasets and benchmarks.
  • Drive cross-functional alignment by partnering with product managers, research leads, and engineering managers to prioritize evaluation goals.
  • Collaborate with ML engineers, data scientists, and ML infrastructure engineers to deliver great user experiences.
  • Communicate findings clearly through dashboards, presentations, and technical documentation.
  • Influence without authority across teams with different goals and priorities.
  • Conduct failure analysis and uncover edge cases to improve model robustness.
  • Contribute to internal tools and infrastructure to automate and scale evaluation processes.
  • Write clear technical reports and present findings to both technical and non-technical audiences.
  • BS and a minimum of 10 years relevant industry experience.
  • 4+ years of industry or academic experience in machine learning or data science.
  • 2+ years of experience leading technical projects or mentoring junior engineers or scientists.
  • Strong experience evaluating supervised, unsupervised, and deep learning models.
  • Hands-on experience with LLMs (such as GPT, Claude, or PaLM) and using them as scoring or judging mechanisms.
  • Familiarity with multimodal models (such as image + text or video + audio) and their evaluation challenges.
  • Proficiency in Python and libraries such as NumPy, pandas, scikit-learn, PyTorch, or TensorFlow.
  • Solid understanding of statistical testing, sampling, confidence intervals, and metrics such as precision/recall, BLEU, ROUGE, and FID.
  • M.S. or Ph.D. in Computer Science, Statistics, Machine Learning, or a related field.
  • Prior experience managing or tech-leading a team of two or more engineers or scientists.
  • Experience with open-source evaluation tools such as OpenEval, ELO-based ranking, or LLM-as-a-Judge frameworks.
  • Familiarity with prompt engineering, few-shot, or zero-shot evaluation techniques.
  • Experience evaluating generative models, such as text or image generation systems.
  • Prior contributions to ML benchmarks or public evaluations.
  • Comfort with giving and receiving feedback in a collaborative, fast-moving environment.
  • Strong communication and documentation skills, with the ability to write technical reports and present to non-technical audiences.
At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $181,100 and $318,400, and your base pay will depend on your skills, qualifications, experience, and location.

Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits.

Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.

Apple accepts applications to this posting on an ongoing basis.