The Intelligence Platform team builds scalable, production-grade systems that power high-quality, user-centric intelligence across Apple’s operating systems. We focus on designing and operating large-scale ML systems leveraging Generative AI, Large Language Models, RAG architectures, and emerging agentic AI patterns. Our goal is to deliver reliable, low-latency, and privacy-preserving AI capabilities at scale.

We are looking for a Software Engineer with strong systems and engineering expertise to build and scale LLM-powered systems in production. This role focuses on designing robust infrastructure for LLM serving, tool-use orchestration, and agentic workflows. You will work at the intersection of ML and systems engineering—translating advanced AI capabilities into efficient, scalable, and reliable systems. You will play a key role in shaping system architecture, optimizing performance, and ensuring production readiness of LLM-driven features across Apple platforms.

* Design and build scalable systems for LLM inference, orchestration, and agentic workflows (e.g., tool-use pipelines, multi-step reasoning systems).
* Productionize LLM-based solutions with a focus on latency, throughput, reliability, and scalability.
* Architect and maintain infrastructure for model serving, batching, caching, and context management.
* Develop and optimize pipelines for RAG systems, retrieval infrastructure, and data flow across components.
* Partner with modeling teams to integrate models into production systems, ensuring alignment with performance and product requirements.
* Build monitoring, evaluation, and feedback systems to ensure high-quality and robust model behavior in production.
* Drive system-level optimizations across the stack, including distributed systems, concurrency, and resource management.

Strong software engineering background with experience building distributed systems or large-scale production services.
Experience deploying and operating ML/LLM systems in production environments.
Solid understanding of systems design, performance optimization, and scalability trade-offs.
Proficiency in programming and building reliable backend systems.
Familiarity with LLM architectures and inference workflows.

Experience with LLM serving systems, inference optimization, batching strategies, or caching (KV/prefix).
Experience designing agentic systems, tool orchestration frameworks, or multi-turn pipelines.
Familiarity with RAG systems, retrieval infrastructure, and vector databases.
Experience with on-device / hybrid ML systems and constraints (latency, memory, privacy).
Ability to lead system design discussions and influence architecture decisions across teams.

At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $147,400 and $272,100, and your base pay will depend on your skills, qualifications, experience, and location.

Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits.

Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.

Apple accepts applications to this posting on an ongoing basis.

Apply now

See more open positions at Apple

Privacy policy Cookie policy

Our Mission

Our History

Our Team

Our Board of Trustees

Board of Trustees Student Nominations

Audited Financials

Careers

Mentorship

Apprenticeship Pathway Program

Talent Network

Founders

Membership

Lifetime Membership

Responsible AI Certification (RAIC)

Apprenticeship Pathway Program Apprentice

Apprenticeship Pathway Program Industry Partners

NEXT

Tech Collabs

GHC

Donate

Recurring Donate

Sponsors & Partner Opportunities

Membership Sponsorship

Our Communities

Systers

Gift Membership

Case Studies & White Papers

Technical Equity Experience Study (TechEES)

Impact Reports

Visual Impact Report

Top Companies

Pass It On Awards

AnitaB.org Tech Journey Scholarship

Our Resources

Blog

Podcast

Become a Member

AnitaB.org Talent Network

Software Engineer — LLM Systems, Generative AI Infrastructure & Agentic Platforms