Principal Technical Program Manager , Amazon CloudTune
Amazon
IT, Operations
Seattle, WA, USA
Description
The CloudTune Capacity Management and Planning (CaMP)-PMO team is responsible for planning, procuring, and managing cloud AWS infrastructure resources—including business-critical Gen AI resources—used by Amazon's WW Stores, Digital, and Other businesses (Alexa, Advertising, Digital Video, Retail Website Marketplaces, and more). We are a fun-loving, high-performing, and diverse team that owns SDO-wide cross-cutting programs spanning short-term capacity assurance for peak events such as Prime Day, Black Friday, and Cyber Monday, as well as longer-term capacity assurance programs such as Diversify Mega regions, while balancing and managing IMR spend.
As Gen AI becomes foundational to Amazon's customer experience and operational excellence, GPU capacity has emerged as one of our most strategically constrained resources. We are looking for a passionate, deeply technical, and results-oriented Principal Technical Program Manager to join our team and lead the mission-critical GenAI program by partnering closely with SDO-wide organizations and AWS/Bedrock teams.
You will be responsible for capacity management and assurance of highly constrained GPUs that power Amazon's Gen AI capabilities—a resource where demand consistently exceeds supply by 2-3x. This will include:
Driving supply and supportability with AWS to secure and scale GPU infrastructure
Owning engineering goals for systematic end-to-end demand-supply signals to optimize allocation
Leading migration initiatives to move inferencing workloads to Bedrock, improving efficiency and reducing infrastructure pressure
You will have a proven track record of planning and delivering complex programs with multi-year roadmaps using Agile and incremental delivery methods. You will have experience influencing architectural change across thousands of teams in highly constrained environments, and you can succinctly communicate with engineers through senior VPs to drive alignment on critical infrastructure decisions.
Key job responsibilities
* You will be responsible for the strategic roadmap and tactical milestones of the Program.
* You will contribute to the Monthly Business Reviews and leadership updates for the Program and goals , you will be required to communicate effectively verbally and in writing, to a wide range of audiences including Directors and VPs across SDO businesses
* You will be responsible to drive decisions in an ambiguous environment by proactively identifying risks and bringing them to the attention of stakeholders with mitigation plans before they become roadblocks.
* You will be responsible for diving deep into data and metrics, understanding them well to drive and influence decisions and stay connected to the details
* You will need a strong bias for action and be able to handle multiple priorities simultaneously
* Your ability to understand the big picture and plan ahead for dependencies and roadblocks is crucial.
* Throughout, you will internalize Amazon’s Leadership Principles, and live those into everyday practices to guide your programs to success.