Sr. Manager, Technical Infrastructure Program Manager, ML Capacity Delivery, ML Capacity Delivery
Amazon
Software Engineering, Other Engineering, IT, Operations, Data Science
Seattle, WA, USA
Description
We are seeking a Senior Manager of Technical Infrastructure Program Managers (TIPMs) to lead our Machine Learning infrastructure delivery team across the Americas. As part of the Data Center Planning & Delivery (DCPD) organization, our team plans and delivers the physical data center infrastructure that powers AWS’s rapidly growing Machine Learning and Generative AI services.
You will manage a team of managers and individual contributors responsible for the capacity planning and delivery of ML data center infrastructure at multi-gigawatt scale. You will work across the entire AWS organization to develop both short- and long-range capacity plans, address infrastructure constraints, and drive projects from pipeline evaluation through site selection, design, construction, testing, and deployment.
The ideal candidate is a strong people leader and communicator who thrives in highly ambiguous, fast-paced environments. You will build and scale specialized teams delivering infrastructure in compressed timelines and greenfield locations. You possess proven judgment, relationship building skills, the ability to influence without authority across multiple organizations, and a track record of building organizational capabilities from scratch. Experience in data center infrastructure or large-scale capital program delivery is required.
You must be comfortable leading remote teams delivering novel solutions across multiple US regions; designing team structures and mechanisms to meet long-term business goals; managing complex stakeholder landscapes up to VP level; driving decisions across construction, energy, procurement, design, and finance teams; and navigating persistent ambiguity and change. You will build a strategy where none exists, establish organizational standards, and drive cross-functional alignment at scale.
If you are a builder who enjoys deploying technical and highly innovative projects at massive scale, and you want to help shape the future of AI infrastructure, this position is for you!
Key job responsibilities
• Build, lead, and develop a geographically dispersed team of managers and individual contributors delivering ML data center capacity across multiple US regions
• Design organizational structure including specialized sub-teams
• Execute demand planning processes resulting in strategic alignment across multiple stakeholders to develop infrastructure capacity strategies and plans
• Work with regional planning, design, construction, energy, and field engineering teams to facilitate site selection, qualification, and build planning
• Manage the delivery of ML-specific infrastructure including high-power-density facilities, specialized networking, and liquid cooling systems
• Drive strategic planning (OP/S&OP) processes and create scalable mechanisms for capacity forecasting and financial planning
• Solve complex problems and remove blockers to timely capacity delivery; manage escalations with high judgment
• Identify opportunities to invent and simplify processes, implementing resolutions and scalable mechanisms that become organizational standards
• Communicate ideas effectively to a wide variety of stakeholders, from execution teams to VP-level leadership, for purposes ranging from informative to executive approvals
• Partner cross-functionally with Networking, Public Policy, Operations, Design Engineering, Real Estate, Supply Chain, Security, and Finance
• Hire, develop, and retain top talent; provide coaching and career development to grow the next generation of infrastructure leaders