Test Triage & Automation Engineer, Siri
Apple
Software Engineering
Cupertino, CA, USA
USD 147,400-272,100 / year + Equity
Posted on Mar 20, 2026
Apple is where individual imaginations gather together, committing to the values that lead to great work. Every new product we build, service we create, or Apple Store experience we deliver is the result of us making each other’s ideas stronger. That happens because every one of us shares a belief that we can make something wonderful and share it with the world, changing lives for the better. It’s the diversity of our people and their thinking that inspires the innovation that runs through everything we do. When we bring everybody in, we can do the best work of our lives. Here, you’ll do more than join something — you’ll add something. As part of Siri AI Quality Engineering, we are dedicated to creating groundbreaking conversational assistant technologies for both large-scale systems and new client devices, building upon our legacy of intelligent assistant solutions that already assist millions of users worldwide. Does the opportunity to play a part in building groundbreaking technology for large-scale systems, natural language and artificial intelligence excite you? Do you want to expand the experience of Siri and other AI/ML products to new products that will help millions get things done, across the globe? Join Siri AI Quality Engineering at Apple and contribute to a highly accomplished team dedicated to releasing high-quality software, models, and products that will delight and inspire millions of people.
We are seeking a Senior Software Engineer to join our Siri AI Client Platforms Quality Engineering team. In this role, you will be dealing with high volumes of test data and evaluation pipelines requiring both technical depth and creative thinking to build solutions that can keep pace with the rapid evolution of AI technologies. You will be responsible for designing, driving, triaging, evaluating automation results that support the qualification of Siri's AI features, not just validating what features are supposed to do, but creatively measuring qualitative experiences the way a real user would. This means going beyond pass/fail metrics and thinking deeply about how Siri's responses feel, how natural the interactions are, and how well the product truly serves our customers in the real world. You will work closely with Product, Platform engineers, and program managers to understand how Siri behaves, how they evolve, and how changes in the underlying AI stack ripple through to the customer experience. Your insights and findings will directly influence product decisions, model improvements, and feature launches. We are a fast-paced, and deeply collaborative team that values the tight relationship between Quality engineering, Product engineering, and program management. We move quickly, we hold ourselves to the highest standards, and we genuinely care about the products we build. If you are someone who thrives in an environment where your work matters, where innovation is encouraged, and where you can see the direct impact of your contributions on a global scale, this is the team for you.
- Own end-to-end automation pipelines across multiple platforms, from creation and configuration to continuous monitoring, evaluation, and triage.
- Harness large-scale test datasets to perform deep analysis, identify trends, and surface actionable quality insights that inform product decisions and drive improvements across Siri's AI features
- Innovate and build new strategies, frameworks, and tooling to accelerate evaluation cycles, reduce bottlenecks, and continuously raise the efficiency and effectiveness of the quality engineering process
- Proactively identify, escalate, and track critical issues whether blocking automation evaluation or impacting the product experience
- Partner closely with Program Management teams to drive timely resolution of issues, balancing urgency with thoroughness to keep product timelines on track without compromising quality
- Operate with a self-starter mindset: take full ownership of your work, anticipate challenges before they arise, and drive initiatives forward with minimal direction in a fast-paced, ever-evolving environment
- 5+ years of experience designing, implementing, and optimizing large-scale data-driven platforms and frameworks, APIs, services, and tools.
- Thorough understanding of system, architecture and large-scale system design.
- Strong programming skills with Swift, Python and Shell scripting languages
- Experience building dashboards and analytics solutions using tools like Tableau, Grafana, Superset, or Splunk to visualize KPIs and monitor data quality.
- Demonstrated success in collaborating cross-functionally with engineering, machine learning, and data science teams to solve complex challenges.
- Ability to proactively triage, investigate, and debug difficult technical and UX problems independently as well as collaboratively
- Capacity to drive test triage products, methodologies, and processes
- Proficiency with software revision control (e.g. Git) and CI/CD systems (e.g. Jenkins)
- Highly organized with strong planning skills to estimate, update, and communicate progress
- BS/MS in Computer Science, Engineering, or a related field.
- Deep understating about large scale data validation platforms with focus on privacy.
- Experience building tooling solutions with Claude tools.
- Knowledge of statistics-based evaluation approaches, ML training pipelines, and techniques for enhancing the accuracy of ML systems.
- Strong attention to detail and the proven ability to delve into data, uncover hidden patterns, and conduct comprehensive error/deviation analysis.
Apple is an equal opportunity employer that is committed to inclusion and diversity. We seek to promote equal opportunity for all applicants without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, Veteran status, or other legally protected characteristics. Learn more about your EEO rights as an applicant.