DevOps Engineer, SCOT Support India, FO Support
Amazon
Software Engineering, Customer Service
Hyderabad, Telangana, India
Description
Amazon's Supply Chain Optimization Technologies (SCOT) organization builds the systems that promise, plan, and execute the delivery of millions of packages to customers worldwide — faster and cheaper every day. Our systems handle millions of requests per second and make business decisions impacting billions of dollars annually. As we expand into new geographies and grow the complexity of our transportation network, we need engineers who thrive on solving hard problems at massive scale.
We are looking for a DevOps Engineer to join our Fulfillment Optimization support team. In this role, you will be the frontline technical expert responsible for keeping mission-critical, high-volume systems running smoothly. You will deep dive into operational issues across multiple distributed systems, identify root causes, drive resolutions, and build the automation and tooling that prevents problems from recurring.
This is not a passive monitoring role. You will develop subject matter expertise across key areas of our fulfillment stack, lead incident response, author runbooks, improve operational processes, and mentor junior engineers. You will work cross-functionally with software development teams to improve system supportability, availability, and performance. This team provides 12x7 on-call support on a rotation basis.
What You Will Do
- Troubleshoot and resolve technical issues across distributed systems, often with limited or no existing documentation
- Serve as a technical point of contact within your area of expertise for your team and partner engineering groups
- Lead incident response efforts — drive root cause analysis, author post-incident reviews, and implement preventive mechanisms
- Identify operational trends and problems before they impact customers; define and implement proactive monitoring and alerting
- Build automation, tooling, and scripts to improve operational efficiency and reduce manual toil
- Author, maintain, and review technical documentation including runbooks, SOPs, and troubleshooting guides
- Mentor junior team members and assist with onboarding and hiring activities
- Lead internal team projects and deliver on defined goals and timelines
- Partner across teams on tactical and strategic initiatives to improve system reliability and customer experience
- Use data to identify and drive development of new support mechanisms, processes, and tools
Key job responsibilities
- Maintain and operate products and systems within the scope of your team, performing change management activities independently
- Participate in 12x7 on-call rotation, managing incidents through to resolution or appropriate escalation
- Contribute to Correction of Errors (COEs) and support retrospectives
- Influence issue prioritization, best practices, and operational standards within the team