Senior Manager, Software Engineering - Platform Team
NVIDIA
NVIDIA has been transforming computer graphics, PC gaming, and accelerated computing for more than 25 years. It’s a unique legacy of innovation that’s fueled by great technology—and amazing people. Today, we’re tapping into the unlimited potential of AI to define the next era of computing. An era in which our GPU acts as the brains of computers, robots, and self-driving cars that can understand the world. Doing what’s never been done before takes vision, innovation, and the world’s best talent. As an NVIDIAN, you’ll be immersed in a diverse, supportive environment where everyone is inspired to do their best work. Come join the team and see how you can make a lasting impact on the world.
The Automation Team seeks a Senior Software Engineering Manager to lead the development and upkeep of key platforms supporting internal systems on Configuration Management, Asset Management, and Observability. Lead a team of skilled engineers in developing core platform services, crafting the technical roadmap, and encouraging innovation.
What you'll be doing:
- Lead and manage software engineers, including hiring, mentoring, and performance evaluations. 
- Define and implement a technical roadmap that closely aligns with company goals. 
- Coordinate the creation, advancement, and implementation of crucial platform services, promoting guidelines in software development, such as code reviews, testing, and documentation. 
- Encourage a culture centered on constant growth, originality, and outstanding technical proficiency. 
- Manage project timelines, resources, and risks to ensure successful delivery. 
- Provide operational and platform support during non-US business hours for service continuity and incident response. 
What we need to see:
- Bachelor's or equivalent experience in Computer Science, Engineering, or a related field. 
- 10+ years of software development experience, with a focus on platform/backend systems, and 5+ years leading software engineering teams. 
- Demonstrated expertise in developing and rapidly growing distributed systems, cloud-native applications, and large-scale enterprise platforms for Configuration Management, Asset Management, and Observability. 
- Strong proficiency in programming languages such as Java, Python, or Go, and experience managing containerized application deployments in both cloud and on-prem environments. 
- Solid understanding of database technologies (both relational and NoSQL) and message queues. 
- Excellent communication, interpersonal, and leadership skills with a demonstrated ability to encourage, motivate, and drive technical excellence within agile development environments. 
Ways to stand out from the crowd:
- Experience with Nautobot to maintain asset lifecycle management for both On-Prem and Cloud Assets. 
- Hands-on skills in Python/Go Lang, with the ability to build and deploy containerized software on orchestration platforms. 
- Prior experience working with LLMs, RAGs, and the implementation of AIOps tools for incident diagnosis and remediation. 
- Experience in deploying Ansible Automation Platform and Salt Stack in On-Prem Hardware to manage Network and Infrastructure assets in Data Centers. 
 
              
            