Senior Manager, Software Engineering - Libraries Devops and Infrastructure
NVIDIA
We are looking for an experienced and self-motivated software engineering manager to lead our devops and infrastructure development efforts for high-performance accelerated computing libraries in the AI and scientific computing domains. Around the world, leading commercial and academic organizations are revolutionizing AI, data analytics, and scientific and engineering simulations, using data centers powered by GPUs. Applications of these technologies include LLMs, computer vision, encryption, computer aided engineering (CAE), electronic design automation (EDA), autonomous vehicles and countless others. Did you know our team develops the GPU accelerated libraries and SDKs that help make these possible?
In this role, you will lead an engineering team crafting, developing, and optimizing our continuous integration and delivery (CI/CD) infrastructure. Over a hundred engineers use this system to build, improve, and test innovative software libraries on the latest GPUs and CPUs. Ideal candidates will not only have experience leading engineering groups focused on software, but also be motivated to advance the state-of-the-art. If this sounds exciting, we would love to meet you.
What you will be doing:
Lead, mentor, and grow your engineering team.
Be responsible for systems that build, test, and monitor functional and performance characteristics of our accelerated computing libraries.
Continuously adopt new technologies to improve efficiency and adapt to changes.
Work closely with NVIDIA teams responsible for the planning and hosting of infrastructure needed by our team.
Interact with libraries engineering teams to understand their use cases and requirements.
What we need to see:
Bachelors degree in Computer Science, Engineering, Applied Mathematics, or related field (or equivalent experience).
12+ years of overall proven experience.
4+ years of experience managing software engineering teams including experience recruiting.
Hands-on software product development experience including functional and performance testing.
Excellent project planning and management skills for software development in an agile environment.
Proficiency in: scripting (Shell, Perl, etc.); programming (Python, C, C++); source control (e.g., git); builds (CMake, Make, etc.); and CI/CD systems (GitLab CI, Jenkins, etc.).
Strong collaboration, communication, and documentation habits.
Ways to stand out from the crowd:
Experience with working in a globally distributed organization.
Familiarity with CPU and/or GPU compute capabilities.
Experience with leading and mentoring teams in dynamic environments such as concurrent HW and SW development.
Experience deploying agentic AI solutions in software development processes.
Experience with High-Performance Computing (HPC) hardware systems such as large scale compute clusters with system administrator level experience with multi-user Linux servers.
You will also be eligible for equity and benefits.
This posting is for an existing vacancy.
NVIDIA uses AI tools in its recruiting processes.
NVIDIA is committed to fostering a diverse work environment and proud to be an equal opportunity employer. As we highly value diversity in our current and future employees, we do not discriminate (including in our hiring and promotion practices) on the basis of race, religion, color, national origin, gender, gender expression, sexual orientation, age, marital status, veteran status, disability status or any other characteristic protected by law.