Principal Engineer Manager
Microsoft
Principal Engineer Manager
Multiple Locations, United States
Save
Overview
The Azure Dedicated Team is responsible for attaching large scale and non-native workloads into our Azure Core services. These services can range from enterprise solutions such as Storage Filers, bleeding Edge Bare Metal Databases all the way to large scale AI training supercomputers.
As a Software Engineering Manager in this team, we are looking for a manager with a blend of technical depth and a desire to lead a team through the current wave of AI innovation. In particular this role could be a great fit for those with prior server infrastructure lifecycle management at scale. This opportunity will allow you to gain exposure to the challenges of supercomputer infrastructure for AI training and be at the forefront of serving the biggest household names in AI each day. This role will allow you to have the autonomy to deliver innovative distributed system control planes whilst leading a skilled team of Software Engineers.
Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.
Qualifications
Required Qualifications:
- Bachelor's Degree in Computer Science, or related technical discipline AND 6+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python
- OR equivalent experience.
- 1+ years hands on experience with remote management of rack-mounted server infrastructure at scale, including delivery of software services to manage, monitor and sustain nodes in service meeting SLAs.
Other Qualifications:
- Ability to meet Microsoft, customer and/or government security screening requirements are required for this role. These requirements include, but are not limited to the following specialized security screenings:
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
Preferred Qualifications:
- Master's Degree in Computer Science or related technical field AND 8+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR Bachelor's Degree in Computer Science or related technical field AND 12+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience.
- 4+ years people management experience.
- Knowledge of most recent NVIDIA DGX Superpod architecture including management and configuration.
- Knowlegde of industty standards for security including node security, BIOS manamegement and out of band / IPMI standards.
- Knowledge and practical use of K8s.
Software Engineering M5 - The typical base pay range for this role across the U.S. is USD $139,900 - $274,800 per year. There is a different range applicable to specific work locations, within the San Francisco Bay area and New York City metropolitan area, and the base pay range for this role in those locations is USD $188,000 - $304,200 per year.
Certain roles may be eligible for benefits and other compensation. Find additional benefits and pay information here: https://careers.microsoft.com/us/en/us-corporate-pay
Microsoft will accept applications for the role until October, 24, 2025
#azurecorejobs
Responsibilities
- Leads product development and scaling to customer requirements and applies best practices for meeting scaling needs and performance expectations and holds accountability for products that do not meet expectations.
- Guides partnership with appropriate stakeholders (e.g., project manager, technical lead and customer) to determine user requirements within and across teams.
- Guides teams and leads identification of dependencies and the development of design documents for a product, application, service, or platform.
- Optimizes, debugs, refactors, and reuses code to improve performance and maintainability, effectiveness, and return on investment (ROI).
- Guides team to drive multiple group's project plans, release plans, and work items in coordination with appropriate stakeholders (e.g., project managers).
- Guides team and acts as an expert for Designated Responsible Individual (DRI) and monitors other engineers across product lines, working on call to monitor system/product/service for degradation, downtime, or interruptions.