On-device ML Performance Infrastructure Engineer

Apple

Apple

Software Engineering, Other Engineering, Data Science
Seattle, WA, USA
Posted on May 29, 2025
Apple is the best place to do on-device machine learning, and this team sits at the heart of that discipline, interfacing with research, SW engineering, HW engineering, and products. The team builds critical infrastructure for analyzing latency, memory, and numerical correctness of the latest machine-learning architectures across all Apple devices. This cross-functional effort enables model developers to make informed decisions about achieving optimal performance through advanced quantization, sparsity, and architecture tradeoffs. This infrastructure underpins most of Apple’s critical machine learning workflows across Camera, Siri, Health, Vision, and other areas, and as such, is an integral part of Apple Intelligence. Our group is seeking an ML Performance Infrastructure Engineer with a focus on ML frameworks and runtimes, as well as system performance. The role entails building the infrastructure for providing developers with actionable feedback in the world’s foremost ML graph compilation and runtime system, capable of optimizing and executing efficiently on Apple products and services. We promote innovation and new technology to further improve our creative output. We are seeking an innovative and passionate person to join this amazing team, if you feel this is you, we'd love to hear from you!