Principal Software Development Engineer, AWS Mantle
Software Engineering
Seattle, WA, USA
Description
Are you passionate about building the infrastructure that powers the next generation of AI? We are seeking a Principal Software Development Engineer to join the AWS Mantle team and drive the technical vision for our distributed inference engine that serves millions of customers across Amazon Bedrock. In this role, you will define and execute on large-scale, ambiguous technical challenges at the intersection of machine learning systems, distributed computing, and security—shaping how the world accesses foundation models.
Set the long-term technical direction for a globally distributed, high-performance ML inference platform serving models from industry-leading AI providers
Own end-to-end system design decisions that directly impact latency, reliability, and scalability for millions of customers worldwide
Influence engineering strategy across Amazon Bedrock, partnering with senior leadership to align technical investments with business outcomes
Raise the engineering bar through exemplary system design, mentorship, and contributions to the broader AWS engineering community
Navigate complex trade-offs across performance, security, and cost while maintaining the highest standards for operational excellence
Key job responsibilities
As a Principal SDE on the Mantle team, you will serve as the technical conscience and strategic thought leader for one of AWS's most critical AI infrastructure platforms. You will architect solutions that are reliable, scalable, and secure—operating at the cutting edge of distributed systems where millisecond-level latency and zero-trust security are non-negotiable.
Design and evolve the architecture of Mantle's distributed inference engine, including capacity management, model onboarding pipelines, and quality-of-service controls
Drive cross-organizational initiatives spanning multiple AWS teams to deliver seamless, OpenAI-compatible API experiences with Zero Operator Access (ZOA) security guarantees
Lead technical strategy for scaling inference to support rapid onboarding of new foundation models while maintaining global availability and performance SLAs
Author and champion technical vision documents, influence product roadmaps, and represent the team in executive-level architectural reviews
Mentor and develop senior engineers, fostering a culture of engineering excellence, innovation, and customer obsession
About the team
About the team
The AWS Mantle team is building the next-generation inference engine that powers Amazon Bedrock—providing secure, enterprise-grade access to high-performing foundation models from the world's leading AI companies. Our mission is to simplify and accelerate how models are served at global scale, with an unwavering commitment to customer trust through innovations like our Zero Operator Access architecture, designed so that no person—whether from AWS, a customer, or a model provider—can ever access customer inference data.
We operate at massive scale, serving inference requests across all major AWS regions with sophisticated automated capacity management and unified resource pools
Our team values builders who thrive in ambiguity, think long-term, and are excited to define the future of AI infrastructure from the ground up
We foster a collaborative, inclusive environment where diverse perspectives drive better solutions—and where the best ideas win regardless of where they originate
We ship fast and iterate with purpose, having rapidly expanded from launch to supporting models from OpenAI, DeepSeek, Google, Mistral, NVIDIA, and more
We believe work should be meaningful and fun—you'll join a team that takes pride in making history at the forefront of generative AI