On-device ML Performance Infrastructure Engineer
Apple
Software Engineering, Other Engineering, Data Science
Seattle, WA, USA
Posted on May 30, 2025
The On-Device Machine Learning team at Apple is responsible for enabling the Research to Production lifecycle of cutting edge machine learning models that power magical user experiences on Apple’s hardware and software platforms. Apple is the best place to do on-device machine learning, and this team sits at the heart of that discipline, interfacing with research, SW engineering, HW engineering, and products. The team builds critical infrastructure for analyzing latency, memory and numerical correctness of the latest machine learning architectures across all Apple devices. This cross functional effort powers model developers’ decisions to get full machine performance via advanced quantization/sparsity/architecture tradeoffs. This infrastructure underpins most of Apple’s critical machine learning workflows across Camera, Siri, Health, Vision, etc., and as such is an integral part of Apple Intelligence. Our group is looking for an ML Performance Infrastructure Engineer, with a focus on ML frameworks/runtimes and system performance. The role entails building the infrastructure for giving developers actionable feedback in the world’s foremost ML graph compilation and runtime system capable of optimizing & executing ML models efficiently on Apple products and services.