Staff Engineer, Gen AI
Qualcomm
Software Engineering, Data Science
Hyderabad, Telangana, India
Posted on Apr 4, 2025
Company:
Qualcomm India Private Limited
Job Area:
Engineering Group, Engineering Group > Software Engineering
General Summary:
More details below:
Join the exciting Generative AI team at Qualcomm focused on integrating cutting edge GenAI models on Qualcomm chipsets. The team uses Qualcomm chips’ extensive heterogeneous computing capabilities to allow inference of GenAI models on-device without a need for connection to the cloud. Our inference engine is designed to help developers run neural network models trained in a variety of frameworks on Snapdragon platforms at blazing speeds while still sipping the smallest amount of power. Utilize this power efficient hardware and Software stack to run Large Language Models (LLMs) and Large Vision Models (LVM) at near GPU speeds!
Responsibilities:
In this role, you will spearhead the development and commercialization of the Qualcomm AI Runtime (QAIRT) SDK on Qualcomm SoCs. As an AI inferencing expert, you'll push the limits of performance from large models. Your mastery in deploying large C/C++ software stacks using best practices will be essential. You'll stay on the cutting edge of GenAI advancements, understanding LLMs/Transformers and the nuances of edge-based GenAI deployment. Most importantly, your passion for the role of edge in AI's evolution will be your driving force.
Minimum Qualifications:
Bachelor’s degree in engineering, Computer Science, or related field and 10+ years of Systems Engineering or related work experience.
OR
Master’s degree in engineering, Computer Science, or related field and 9+ years of Systems Engineering or related work experience.
Requirements:
Strong understanding of Generative AI models – LLM, LVM, LMMs and building blocks (self-attention, cross attention, KV caching etc.)
Floating-point, Fixed-point representations and Quantization concepts.
Experience with optimizing algorithms for AI hardware accelerators (like CPU/GPU/NPU).
Hands-on experience in C/C++ programming, Design Patterns and OS concepts.
Excellent analytical and debugging skills.
Exposure to shell scripts, python scripts, understanding of Linux/Windows systems and automation scripts/environment.
Good communication skills, presentation skills and should manage his/her tasks independently.
Ability to collaborate across a globally diverse team and multiple interests.
Preferred Qualifications
Minimum Qualifications:
Master's degree in Engineering, Information Systems, Computer Science, or related field and 3+ years of Software Engineering or related work experience.
OR
PhD in Engineering, Information Systems, Computer Science, or related field and 2+ years of Software Engineering or related work experience.
Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law.
To all Staffing and Recruiting Agencies: Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications.
If you would like more information about this role, please contact Qualcomm Careers.
3073180
Qualcomm India Private Limited
Job Area:
Engineering Group, Engineering Group > Software Engineering
General Summary:
More details below:
Join the exciting Generative AI team at Qualcomm focused on integrating cutting edge GenAI models on Qualcomm chipsets. The team uses Qualcomm chips’ extensive heterogeneous computing capabilities to allow inference of GenAI models on-device without a need for connection to the cloud. Our inference engine is designed to help developers run neural network models trained in a variety of frameworks on Snapdragon platforms at blazing speeds while still sipping the smallest amount of power. Utilize this power efficient hardware and Software stack to run Large Language Models (LLMs) and Large Vision Models (LVM) at near GPU speeds!
Responsibilities:
In this role, you will spearhead the development and commercialization of the Qualcomm AI Runtime (QAIRT) SDK on Qualcomm SoCs. As an AI inferencing expert, you'll push the limits of performance from large models. Your mastery in deploying large C/C++ software stacks using best practices will be essential. You'll stay on the cutting edge of GenAI advancements, understanding LLMs/Transformers and the nuances of edge-based GenAI deployment. Most importantly, your passion for the role of edge in AI's evolution will be your driving force.
Minimum Qualifications:
Bachelor’s degree in engineering, Computer Science, or related field and 10+ years of Systems Engineering or related work experience.
OR
Master’s degree in engineering, Computer Science, or related field and 9+ years of Systems Engineering or related work experience.
Requirements:
Strong understanding of Generative AI models – LLM, LVM, LMMs and building blocks (self-attention, cross attention, KV caching etc.)
Floating-point, Fixed-point representations and Quantization concepts.
Experience with optimizing algorithms for AI hardware accelerators (like CPU/GPU/NPU).
Hands-on experience in C/C++ programming, Design Patterns and OS concepts.
Excellent analytical and debugging skills.
Exposure to shell scripts, python scripts, understanding of Linux/Windows systems and automation scripts/environment.
Good communication skills, presentation skills and should manage his/her tasks independently.
Ability to collaborate across a globally diverse team and multiple interests.
Preferred Qualifications
- Strong understanding of SIMD processor architecture and system design.
- Proficiency in object-oriented software development and familiarity
- Familiarity with Linux and Windows environment
- Strong background in kernel development for SIMD architectures.
- Familiarity with frameworks like llama.cpp, MLX, and MLC is a plus.
- Good knowledge of PyTorch, TFLite, and ONNX Runtime is preferred.
Minimum Qualifications:
- Bachelor's degree in Engineering, Information Systems, Computer Science, or related field and 4+ years of Software Engineering or related work experience.
Master's degree in Engineering, Information Systems, Computer Science, or related field and 3+ years of Software Engineering or related work experience.
OR
PhD in Engineering, Information Systems, Computer Science, or related field and 2+ years of Software Engineering or related work experience.
- 2+ years of work experience with Programming Language such as C, C++, Java, Python, etc.
Qualcomm expects its employees to abide by all applicable policies and procedures, including but not limited to security and other requirements regarding protection of Company confidential information and other confidential and/or proprietary information, to the extent those requirements are permissible under applicable law.
To all Staffing and Recruiting Agencies: Our Careers Site is only for individuals seeking a job at Qualcomm. Staffing and recruiting agencies and individuals being represented by an agency are not authorized to use this site or to submit profiles, applications or resumes, and any such submissions will be considered unsolicited. Qualcomm does not accept unsolicited resumes or applications from agencies. Please do not forward resumes to our jobs alias, Qualcomm employees or any other company location. Qualcomm is not responsible for any fees related to unsolicited resumes/applications.
If you would like more information about this role, please contact Qualcomm Careers.
3073180