Model Optimization Engineer (PyTorch Infrastructure Development)

Apple

Apple

Other Engineering
Cupertino, CA, USA
Posted on May 29, 2025
Are you excited about the impact that optimizing deep learning models can have on enabling transformative user experiences? The field of ML compression research continues to grow rapidly and new techniques to perform quantization, pruning etc are increasingly available to be ported and adopted by the ML developer community, that is looking to ship more models in a constrained memory budget and make them run faster. We are passionate about productizing and pushing the envelope of the state of the art of model optimization algorithms, to further compress and speed up the thousands of models shipping as part of Apple internal and external apps, running locally on millions of Apple devices. We are a team that collaborates heavily with researchers, ML software and hardware architecture teams and external/internal product teams shipping models on Apple devices. If you are excited about making a big impact and playing a critical role in growing the user base and driving the adoption of a relatively new library, this is a great opportunity for you. We are looking for someone who is highly self motivated and passionate about optimizing models for on device execution. If you have a proven track record of developing and working with the internals of an ML python library, writing high quality code and shipping software, we strongly encourage you to apply.