ML Engineer - Evaluation Automation, Siri AI Quality Engineering
Apple
Software Engineering, Data Science, Quality Assurance
Cupertino, CA, USA
Posted on Aug 19, 2025
Apple has an extraordinary reputation for product quality. We are looking for a versatile Machine Learning Engineer with a strong background in Large Language Models (LLMs) to build the next generation ML evaluation frameworks and tools. In this role, you will use LLMs and other ML techniques to help automate large-scale data generation and evaluation job execution on server or on device, build LLM judges, detect anomalies, and streamline ML evaluation workflows. This is a high-impact role where you'll work at the intersection of AI/ML, conversational agents, information retrieval, software engineering, and ML evaluation, helping us push the boundaries of how AI can transform ML evaluation.