Machine Learning Research Manager, Speech and Multimodal Modeling
Apple
Software Engineering, Data Science
Cambridge, MA, USA
USD 205,200-308,400 / year + Equity
Posted on Nov 14, 2025
The Speech Team within the Siri organization drives major speech recognition, synthesis and speech to speech model changes for various features deeply embedded throughout Apple’s ecosystem. Our team owns on-device accurate and private speech recognition models across various systems on chip and hardware platforms with diverse compute restrictions, enabling prominent production user experiences. This team drives core technology advances while fulfilling major production needs, including developing speech to speech experiences and the underlying multimodal foundation model technology for current and future speech-enabled features across Apple’s software, hardware, and services ecosystem. This allows for cutting edge applied research anchored in Apple specific production needs, while improving speech interaction experiences for Apple’s customers around the world. Our technology powers speech interaction for iOS, watchOS, visionOS, macOS, tvOS, including Siri, Dictation and various speech enabled Apple Intelligence features.
We are seeking an exceptional machine learning research manager with deep technical expertise and strong cross-functional leadership skills to drive applied modeling research, deliver foundational models that support large-scale project development, and contribute to advancing our data processing, modeling, and evaluation tooling. In this role, you will lead core modeling initiatives in speech understanding and speech-to-speech interactions.
- Deliver speech models and multimodal foundation models enabling step function quality improvements and new speech experiences
- Oversee applied research and development closely, anchored in Apple’s needs around high quality, efficient and private speech to speech models
- Hands-on leadership to multiply productivity of their team by removing technical blockers
- Collaborate with cross function teams on data processing, model development tooling, and evaluation
- 5+ years of experiences working in and leading machine learning research and development
- Experience working on speech modeling, delivering research and production outcomes
- Experience with both speech understanding and generation preferred
- Prior experience leading a machine learning team
- Hands-on experiences with large scale machine learning model development
- Demonstrated ability to learn new technologies efficiently and apply them effectively
- Proven ability to define and drive processes to increase development productivity and support complex project execution
- Master’s degree or equivalent in Computer Science, Electrical Engineering or related fields
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.