At Apple, we're on the cutting edge of delivering transformative experiences through Artificial Intelligence. If you're passionate about pushing the boundaries of AI and hardware optimization, we want you to join our team. As a Senior Machine Learning Performance Engineer, you will help push the boundaries of on-device generative AI performance and efficiency, designing and implementing novel techniques to optimize large-scale machine learning workloads on the ANE(Apple Neural Engine). You will work at the intersection of machine learning, systems, and hardware architecture, shaping how next-generation AI models run across millions of Apple devices. This is a unique opportunity to contribute to technologies that directly impact the daily experience of Apple customers worldwide.

In this role, you will play a critical part in enabling state-of-the-art machine learning workloads on Apple silicon. You will collaborate closely with model developers, machine learning researchers, compiler engineers, and hardware architects to deliver highly optimized inference performance. In this role, you will: * Develop novel ML inference optimization strategies to improve performance and power efficiency on the ANE. * Analyze and identify performance bottlenecks across the full stack, including model architecture, compiler, runtime, and hardware. * Partner with Apple AI/ML, software, and silicon teams to co-design next-generation ML models and inference techniques optimized for ANE. * Build and maintain performance profiling tools, benchmarking frameworks, and analysis infrastructure * Drive performance characterization, modeling, and optimization for large-scale ML workloads such as LLMs, diffusion models, and computer vision models

Strong understanding of machine learning inference workloads, including LLMs, diffusion models, or computer vision models
Deep knowledge of computer architecture, including memory hierarchy, parallelism, dataflow, and SIMD/vector processing
Hands-on experience optimizing machine learning inference on hardware accelerators (GPU, TPU, NPU, etc.)
Experience analyzing system-level performance and identifying optimization opportunities across software and hardware stacks
Strong programming skills in Python and C/C++

BS in Computer Science, Computer Engineering, or a related field
Minimum 3 years of experience in system performance analysis, machine learning systems, or hardware/software optimization
Strong debugging, performance analysis, and problem-solving skills

MS or PhD in Computer Science, Machine Learning, Computer Architecture, or a related field
Experience with ML system optimization, performance modeling, or architecture evaluation
Experience developing profiling tools, benchmarking frameworks, or performance analysis tools
Familiarity with hardware accelerators or ML compiler stacks
Experience with large-scale ML workloads such as transformers, diffusion models, or mixture-of-experts architectures
Strong analytical skills with the ability to analyze large datasets and communicate insights clearly
Excellent written and verbal communication skills

At Apple, base pay is one part of our total compensation package and is determined within a range. This provides the opportunity to progress as you grow and develop within a role. The base pay range for this role is between $147,400 and $272,100, and your base pay will depend on your skills, qualifications, experience, and location.

Apple employees also have the opportunity to become an Apple shareholder through participation in Apple’s discretionary employee stock programs. Apple employees are eligible for discretionary restricted stock unit awards, and can purchase Apple stock at a discount if voluntarily participating in Apple’s Employee Stock Purchase Plan. You’ll also receive benefits including: Comprehensive medical and dental coverage, retirement benefits, a range of discounted products and free services, and for formal education related to advancing your career at Apple, reimbursement for certain educational expenses — including tuition. Additionally, this role might be eligible for discretionary bonuses or commission payments as well as relocation. Learn more about Apple Benefits.

Note: Apple benefit, compensation and employee stock programs are subject to eligibility requirements and other terms of the applicable plan or program.

Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.

Apple accepts applications to this posting on an ongoing basis.

Apply now

See more open positions at Apple

Privacy policy Cookie policy

Our Mission

Our History

Our Team

Our Board of Trustees

Board of Trustees Student Nominations

Audited Financials

Careers

Mentorship

Apprenticeship Pathway Program

Talent Network

Founders

Membership

Lifetime Membership

Responsible AI Certification (RAIC)

Apprenticeship Pathway Program Apprentice

Apprenticeship Pathway Program Industry Partners

NEXT

Tech Collabs

GHC

Donate

Recurring Donate

Sponsors & Partner Opportunities

Membership Sponsorship

Our Communities

Systers

Gift Membership

Case Studies & White Papers

Technical Equity Experience Study (TechEES)

Impact Reports

Visual Impact Report

Top Companies

Pass It On Awards

AnitaB.org Tech Journey Scholarship

Our Resources

Blog

Podcast

Become a Member

AnitaB.org Talent Network

Machine Learning Inference Performance Engineer