Data Scientist - Multimodal Training Data & Tools - SIML
Apple
Data Science
Cupertino, CA, USA
USD 147,400-220,900 / year + Equity
Posted on Apr 1, 2026
Are you passionate about Generative AI? Are you interested in working on cutting edge generative modeling technologies to enrich billions of people? We are the Intelligence System Experience (ISE) team within Apple’s software organization. The team operates at the intersection of multimodal machine learning and system experiences. It oversees a range of experiences such as System Experience (Springboard, Settings), Image Generation, Genmoji, Writing tools, Keyboards, Pencil & Paper, Generative Shortcuts - all powered by production scale ML workflows. Our multidisciplinary ML teams focus on a broad spectrum of areas, including Visual Generation Foundation Models, Multimodal Understanding, Visual Understanding of People, Text, Handwriting, and Scenes, Personalization, Knowledge Extraction, Conversation Analysis, Behavioral Modeling for Proactive Suggestions, and Privacy-Preserving Learning. These innovations form the foundation of the seamless, intelligent experiences our users enjoy every day. We are seeking engineers experienced in building models, data pipelines and tools for prompt optimization, data synthesis, and auto grading to enable training and deploying large-scale generative models. You will be working alongside a cross functional team of engineers who own ML infrastructure & algorithms, data scientists, designers, safety and UX engineers. Industry experience in prompt optimization, large-scale data synthesis across text and images, and automated grading using GenAI is preferred. Selected references to our team’s work: https://arxiv.org/pdf/2507.13575 https://arxiv.org/pdf/2407.21075 https://www.apple.com/newsroom/2024/12/apple-intelligence-now-features-image-playground-genmoji-and-more/
We are seeking an experienced data scientist to help us build and deploy large scale generative models. Responsibilities in the role will include training ad-hoc models for data synthesis, build data pipelines and tools for large-scale data auto-grading across text and image, prompt engineering and optimization, and extracting insights from billion-scale datasets to enable the model training.
- Bachelors or Masters degree in Electrical Engineering/Computer Science or a related field (mathematics, physics or computer engineering), with a focus on data science
- 3+ years of data science or related experience, preferably in a consumer tech company
- Experience developing models for data synthesis and auto-grading to enable training generative models
- Experience in prompt engineering and optimization for LLMs
- Strong programming and problem-solving skills
- Strong problem-solving skills and ability to work in a collaborative, product-focused environment
- Ability to communicate technical results clearly and concisely
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.