Network Development Engineer, Capacity Restoration Team
Amazon
Hyderabad, Telangana, India
Description
AWS Infrastructure Services owns the design, planning, delivery, and operation of all AWS global infrastructure. In other words, we’re the people who keep the cloud running. We support all AWS data centers and all of the servers, storage, networking, power, and cooling equipment that ensure our customers have continual access to the innovation they rely on. We work on the most challenging problems, with thousands of variables impacting the supply chain — and we’re looking for talented people who want to help.
You’ll join a diverse team of software, hardware, and network engineers, supply chain specialists, security experts, operations managers, and other vital roles. You’ll collaborate with people across AWS to help us deliver the highest standards for safety and security while providing seemingly infinite capacity at the lowest possible cost for our customers. And you’ll experience an inclusive culture that welcomes bold ideas and empowers you to own them to completion.
Network Development Engineers (NDEs) on CRT are Builders. You don't implement off-the-shelf solutions. You create new ones — solutions that go beyond traditional industry approaches.
As an NDE, you will work at the component level of CRT's automation and tooling stack. You will build, improve, and operate tools that accelerate capacity restoration. You will solve straightforward technical problems, with guidance from teammates when needed.
Your scope covers small to mid-size components, features, and processes. You will actively learn the team product architecture and apply it to build working solutions. You will create secure, stable, testable, and maintainable code with minimal defects.
This role bridges operations and engineering. You will both operate the network and write code to make operations faster. NDEs on CRT do not depend on vendors for design or support.
Key job responsibilities
- Build and improve automation components that accelerate link and device restoration across multiple network fabrics.
- Troubleshoot network failures at the component level using workflow orchestration engines, deployment automation tools, self-service automation frameworks, link - testing services, operational event monitoring systems, and network health monitoring systems.
- Develop link test plans and structured diagnostic output that enable system-guided troubleshooting.
- Extend and integrate with existing tooling to improve monitoring and alerting for out-of-service capacity.
- Create and maintain team-level operational metrics, monitors, and processes tied to CRT KPIs: MTTR, restoration success rate, and backlog depth.
- Write clean, testable code (Python preferred) with documentation that supports future engineers.
- Contribute to design reviews and operational improvement discussions.
- Participate in on-call rotations. Support follow-the-sun coverage across Seattle and Bangalore.
- Document systems, tools, and runbooks clearly. Keep documentation current as systems evolve.
- Actively learn team product architecture and apply that knowledge to deliver working solutions.
A day in the life
- Develop and improve automation scripts to accelerate capacity restoration workflows
- Troubleshoot complex network issues at the component level across RB, BB, and FNC fabrics
- Implement small-to-mid-scope network processes or tools (e.g., link test automation, health check scripts)
- Integrate with existing tooling ecosystem (workflow orchestration engines, deployment automation tools, self-service automation frameworks, link testing services)
- Participate in design reviews and operational improvement discussions
- Document systems clearly for future engineers
Participate in on-call rotations
About the team
The Capacity Restoration Team (CRT) sits within Backbone, Enterprise, Regional Engineering (BERE) organization. Our mission is to own end-to-end restoration of out-of-service inter-metro and intra-metro network capacity.
CRT reduces operational backlog, improves network health monitoring systems, and drives time-to-remediate (TTR) down. Today, a significant network capacity is out of service at any given time. The operational workload is capacity related and heavily manual and we are building the team and automation to change that.
We operate across Seattle, Dublin, Bangalore, and Sydney for follow-the-sun coverage. CRT acts as the connective layer between Engineering, Operations, Tooling, and Software teams.
ABOUT AWS:
Diverse Experiences
AWS values diverse experiences. Even if you do not meet all of the qualifications and skills listed in the job description, we encourage candidates to apply. If your career is just starting, hasn’t followed a traditional path, or includes alternative experiences, don’t let it stop you from applying.
Why AWS
Amazon Web Services (AWS) is the world’s most comprehensive and broadly adopted cloud platform. We pioneered cloud computing and never stopped innovating — that’s why customers from the most successful startups to Global 500 companies trust our robust suite of products and services to power their businesses.
Work/Life Balance
We value work-life harmony. Achieving success at work should never come at the expense of sacrifices at home, which is why flexible work hours and arrangements are part of our culture. When we feel supported in the workplace and at home, there’s nothing we can’t achieve in the cloud.
Inclusive Team Culture
Here at AWS, it’s in our nature to learn and be curious. Our employee-led affinity groups foster a culture of inclusion that empower us to be proud of our differences. Ongoing events and learning experiences, including our Conversations on Race and Ethnicity (CORE) and AmazeCon (gender diversity) conferences, inspire us to never stop embracing our uniqueness.
Mentorship and Career growth
We’re continuously raising our performance bar as we strive to become Earth’s Best Employer. That’s why you’ll find endless knowledge-sharing, mentorship and other career-advancing resources here to help you develop into a better-rounded professional.