Senior Software Engineer - Storage & Compute Stability, Pune
Bloomberg
Software Engineering
Pune, Maharashtra, India
Posted on Sep 12, 2025
About the Team
The Storage and Compute Stability Team is a trusted partner in ensuring the reliability, performance, and security of Bloomberg’s cloud storage and compute infrastructure. We operate at the intersection of infrastructure, software, and services, proactively identifying, solving, and preventing issues before they impact our users.
Our focus is on streamlining processes, driving automation, and serving as a bridge between product teams and stakeholders. This enables Bloomberg’s engineers to innovate rapidly, while maintaining stability at scale. We follow agile practices and thrive in a collaborative environment where code reviews, design discussions, and brainstorming are part of our daily rhythm. The team is driven by curiosity, creativity, and a shared passion for building efficient, resilient systems.
This isn’t just another operations role you’ll be embedded at the core of Bloomberg’s infrastructure. Our team spans infrastructure, software, and services, supporting both short-term needs and long-term strategic investments.
You’ll have the opportunity to:
- Work on critical infrastructure and help define how it evolves
- Take on meaningful projects that balance immediate impact with sustainable improvements
- Join a culture that values innovation, automation, and continuous improvement
We'll trust you to:
- Ensure system reliability and performance by monitoring, troubleshooting, and optimizing compute and storage services
- Proactively identify issues and trends to prevent outages, reduce mean time to recovery (MTTR), and improve overall service availability
- Collaborate with product owners, developers, and infrastructure teams to deliver scalable, long-term solutions
- Automate operational processes such as deployments, monitoring, maintenance, and capacity management
- Develop and maintain runbooks, reproducers, and documentation to support knowledge-sharing and workflow efficiency
- Participate in on-call rotations to support critical infrastructure and respond to incidents
- Contribute to infrastructure lifecycle management, including capacity forecasting, proactive refresh planning, and upgrades
- Continuously explore opportunities to improve team processes and system stability
What we value:
- Our work is guided by key principles that define how we operate:
- Expertise – We invest in deep technical knowledge to solve complex infrastructure challenges
- Proactivity – We anticipate issues before they occur and design systems to withstand failure
- Collaboration – We build strong relationships with product teams and stakeholders to deliver end-to-end solutions
- Efficiency – We reduce manual work through thoughtful automation and streamlined processes
- Documentation – We believe in capturing and sharing knowledge to make systems transparent and maintainable
What makes you successful:
- Strong communication and collaboration skills; the ability to explain technical concepts to diverse audiences
- The ability to be self-motivated and autonomous; you take ownership of problems and drive them to resolution
- Passion for continuous learning and working across a broad spectrum of systems and technologies
- Being comfortable working in an agile environment, participating in daily standups, sprint planning, and code reviews
- Curiosity, adaptability, and eagerness to work across the entire infrastructure stack
You'll need to have:
- 5+ years of demonstrated experience working with object-oriented programming languages such as C/C++ and Python, and the willingness to work with Python as your primary language on the job
- Experience with monitoring, logging, and observability tools
- Understanding of containers and orchestration technologies
- Solid knowledge of networking, operating systems, and distributed systems concepts
- Experience participating in incident response and on-call support for production systems
We'd love to see:
- Familiarity with cloud platforms (Ceph or OpenStack) and related compute/storage services
- Experience with infrastructure-as-code tools (e.g., Terraform, Ansible)
If this sounds like you:
Apply if you think we're a good match. We'll get in touch to let you know what the next steps are, but in the meantime feel free to have a look at this:
Tech at Bloomberg - https://www.bloomberg.com/company/values/tech-at-bloomberg/