Hero Image

AnitaB.org Talent Network

Connecting women in tech with the best professional opportunities!

Principal GPU/CPU Systems Engineer

Oracle

Oracle

Software Engineering
India · Bengaluru, Karnataka, India
Posted on Feb 27, 2026

Required Qualifications

  • 10 or more years of experience in hardware design, system engineering, and platform bring-up.
  • Hands-on experience with market-leading GPUs or AI platforms spanning development, bring-up, test, and characterization.
  • Strong knowledge of AI/GPU and or AI/CPU platform architectures and capabilities.
  • Experience evaluating system architectures, platform definitions, and implementation paths.
  • Ability to balance hardware performance, power, cost, regulatory, and cross-functional requirements.
  • Experience with modern server platforms across x86 and ARM architectures.
  • Hardware development experience at the system, board, and FPGA levels.
  • Proficiency reviewing hierarchical schematics, advanced multilayer board layouts, and end-to-end interconnects.
  • Strong understanding of firmware and system diagnostics using BMC firmware, UEFI or BIOS, and Linux tools.
  • Experience scripting and customizing diagnostics, validation, and test workflows.
  • Experience with GPU supplier test code and open-source AI test and characterization tools.
  • Experience with system integration, validation, and performance characterization.
  • Strong understanding of high-speed buses and interconnects used in modern AI and compute platforms.
  • Demonstrated ability to debug and root-cause complex hardware and software issues.
  • Ability to document design intent and technical specifications clearly.
  • Strong communication skills with the ability to explain complex technical topics across engineering teams and executive audiences.
  • Proven ability to provide cross-functional technical leadership and collaborate effectively with internal teams and external partners.

Preferred Skills

  • Experience using hardware debuggers.
  • Experience with PCIe, DDR, Ethernet, USB, SPI, and related interfaces.
  • Experience with platform-level security technologies.
  • Experience with power circuit design and signal integrity.

Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.

True innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.

We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling 1-888-404-2494 in the United States.

Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.


Oracle hardware platform development engineering is seeking a Sr. Principal GPU/CPU Systems Engineer to help define, develop, and support next-generation AI and compute platforms for Oracle Cloud Infrastructure (OCI). This role focuses on platform architecture, system integration, performance characterization, and in-service support of large-scale Cloud AI systems. You will work closely with internal hardware, firmware, software, security, manufacturing, and cloud operations teams, as well as external GPU and AI silicon partners, to deliver highly performant, secure, and scalable Cloud AI solutions.

Career Level - IC5


Platform Architecture and Definition

  • Participate in platform definition, architecture evaluation, and analysis for existing and next-generation Cloud AI platforms.
  • Evaluate system architectures, proposed implementations, and scaling and optimization strategies.
  • Review and assess third-party merchant silicon used for AI accelerator modules and GPU/CPU platforms.
  • Balance hardware performance priorities against power, cost, regulatory, and cross-functional requirements.

Platform Development and Oversight

  • Drive definition, development, integration, debug, characterization, and tuning of AI hardware platforms.
  • Provide platform development oversight for internal teams and third-party partners.
  • Work with in-house engineering experts on design reviews, schematics, board layout, and implementation decisions.
  • Document and specify design intent and technical details in collaboration with engineering teams.

System Integration, Validation, and Performance

  • Guide and support system integration, system test, qualification, and characterization.
  • Define and oversee system validation plans, diagnostics features, and test strategies.
  • Develop and expand system characterization and performance testing capabilities.
  • Utilize supplier-provided and approved open-source AI platform qualification and test tools.
  • Support definition of in-service system monitoring, error reporting, and operational health visibility.

Cross-Functional and Partner Collaboration

  • Collaborate with GPU and AI chip suppliers, system architects, firmware developers, and hardware engineers.
  • Partner with storage, networking, compute, quality, security, cloud orchestration, and manufacturing teams.
  • Support development program managers with technical assessments and planning.
  • Assist manufacturing teams to ensure hardware is secure, robustly evaluated, and production-ready.

Security, Support, and Operations

  • Participate in hardware platform security evaluations.
  • Guide internal teams and partners on scaling, monitoring, and deploying AI platforms into the cloud.
  • Serve as a senior technical advisor to Oracle hardware, software, cloud, and support teams.
  • Act as the final level of engineering support for complex deployed product issues.
  • Assist with root-cause analysis through lab replication, remote debug, and cross-team collaboration.