BECOME A member Get involved Donate Now!

Donate Now! BECOME A member

FAQ

AnitaB.org Talent Network

Connecting women in tech with the best professional opportunities!

0

COMPANIES

0

JOBS

My job alerts

Site Reliability Engineer II

Microsoft

Software Engineering, Other Engineering

Posted on Oct 29, 2025

Apply now

Site Reliability Engineer II

Bangalore, Karnataka, India

Save

Share job

Date posted

Oct 28, 2025

Job number

1898900

Work site

3 days / week in-office

Travel

0-25 %

Role type

Individual Contributor

Profession

Software Engineering

Discipline

Software Engineering

Employment type

Full-Time

Overview

The Production Engineering and Artificial Intelligence (AI) Group, part of the Linux Systems Group within Microsoft, plays a critical role in powering Azure Cloud. This team ensures that Azure operates with the latest version of Linux software at the highest levels of quality and performance, serving as the gatekeeper for production software. The team achieves this at Azure scale through efficient automation and by leveraging artificial intelligence to reduce the human effort required for these responsibilities. This is an excellent opportunity to join the Production Engineering and AI Group and contribute to the growth of Microsoft’s Azure Cloud infrastructure.

As a Site Reliability Engineer II, you will be responsible for ensuring that software deployments follow safe rollout processes while driving operational excellence. You will leverage technical expertise, telemetry analysis, and advanced artificial intelligence to maintain reliability and performance across large-scale systems.

Microsoft’s mission is to empower every person and every organization on the planet to achieve more. As employees we come together with a growth mindset, innovate to empower others, and collaborate to realize our shared goals. Each day we build on our values of respect, integrity, and accountability to create a culture of inclusion where everyone can thrive at work and beyond.

Qualifications

Required Qualifications:

4+ years technical experience in software engineering, network engineering, or systems administration
- OR Bachelor's Degree in Computer Science, Information Technology, or related field AND 1+ year(s) technical experience in software engineering, network engineering, or systems administration
- OR Master's Degree in Computer Science, Information Technology, or related field.
1+ years experience in Cloud Infrastructure and Data Center Expertise
- Managing public cloud infrastructure or large-scale data center setups.
- Site Reliability Engineering (SRE) principles.
- Safe deployment practices in hyper-scale data centers.
- Distributed systems designed for high availability and incident handling protocols.
1+ years experience in Programming and Automation Skills
- Python and Bash or PowerShell scripting and advances in cloud technologies.

Other Qualifications:

Ability to meet Microsoft, customer and/or govenment security screening requirements are required for this role.
These requirements include, but are not limited to the following specialized security screenings:
- Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.

Preferred Qualifications:

5+ years technical experience in software engineering, network engineering,
- OR systems administration OR Bachelor's Degree in Computer Science, Information Technology,
- OR related field AND 2+ years technical experience in software engineering, network engineering,
- OR systems administration
- OR Master's Degree in Computer Science, Information Technology,
- OR related field AND 1+ year(s) technical experience in software engineering, network engineering,
1+ year(s) people management experience.

#azurecorejobs

Responsibilities

Independently write code or scripts that automate the performance of scalable operations processes (e.g., monitoring, alerting, deploying products and updates) across components and features of products.
Create, test and deploy changes through a safe deployment process (SDP) and improve the observability, security, reliability and operability of the systems operating at hyper scale.
Use tools and processes to troubleshoot problems affecting the availability, security, reliability, performance of components, leveraging the AI capabilities
Enable the team to increase the velocity in which changes can reliably and safely deployed in production and monitors the effects of these changes.
Respond to incidents during regular on-call rotations and take appropriate action to mitigate impact. You will develop alerts and automated monitoring infrastructure to notify degradation in performance or availability and draw insights from this data to manage infrastructure in an optimal way

Benefits/perks listed below may vary depending on the nature of your employment with Microsoft and the country where you work.

Industry leading healthcare

Educational resources

Discounts on products and services

Savings and investments

Maternity and paternity leave

Generous time away

Giving programs

Opportunities to network and connect

Microsoft is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to age, ancestry, citizenship, color, family or medical care leave, gender identity or expression, genetic information, immigration status, marital status, medical condition, national origin, physical or mental disability, political affiliation, protected veteran or military status, race, ethnicity, religion, sex (including pregnancy), sexual orientation, or any other characteristic protected by applicable local laws, regulations and ordinances. If you need assistance and/or a reasonable accommodation due to a disability during the application process, read more about requesting accommodations.

Apply now

See more open positions at Microsoft

Privacy policy Cookie policy