Senior Cloud Software Engineer - Storage
Microsoft
Senior Cloud Software Engineer - Storage
Taipei, Taipei City, Taiwan
Save
Overview
Microsoft Silicon and Cloud Hardware Infrastructure Engineering (SCHIE) is the team behind Microsoft’s expanding Cloud Infrastructure and responsible for powering Microsoft’s “Intelligent Cloud” mission. CHIE delivers the core infrastructure and foundational technologies for Microsoft's over 200 online businesses including Bing, MSN, Office 365, Xbox Live, Skype, OneDrive and the Microsoft Azure platform globally with our server and data center infrastructure, security and compliance, operations, globalization, and manageability solutions. Our focus is on smart growth, high efficiency, and delivering a trusted experience to customers and partners worldwide and we are looking for passionate, high-energy engineers to help achieve that mission.
As Microsoft's cloud business continues to grow the ability to deploy new offerings and HW infrastructure on time, in high volume with high quality and lowest cost is of paramount importance. To achieve this goal, the Silicon Cloud Hardware Infrastructure Engineering (SCHIE) team is instrumental in defining and delivering measures of success for hardware design, qualification, fleet support, scale, and sustainability related to Microsoft cloud hardware.
Azure Memory and Storage Center of Excellence (AMS CoE) is part of the SCHIE organization focusing on Memory and Storage devices going into Cloud hardware servers. AMS provides memory and storage solutions to Azure, drive memory and storage suppliers to deliver high quality products, meeting our requirements.
We are looking for an experienced hands-on Software Engineer in SSD/HDD solutions for fleet health with a strong passion for customer focused solutions, insight and industry knowledge to architect and specify hardware storage solutions that optimize quality, reliability, cost, and performance.
#SCHIE #HIFE
Qualifications
Required Qualifications:
- Bachelor's Degree in Computer Science or related technical field AND 4+ years technical engineering experience with coding in languages including, but not limited to, C, C++, C#, Java, JavaScript, or Python OR equivalent experience
- 5+ years of software/firmware engineering development experience
- Ability to analyze storage system solutions and drive towards recommendation based on data and objective reasoning
- Ability to lead collaborative technical projects from conception to successful implementation
- Demonstrable organizational, problem solving and prioritization skills
- Ability to deal with ambiguity, resolve conflicts, prioritize multiple strategic and tactical options and drive issues to closure without compromising on quality
- Proven track record of successfully influencing others at all levels of the organization
Preferred Qualifications:
- Experience with computer/server storage architecture and development of SSD based storage systems solutions
- Experience with distributed/cloud computing technologies and services (Spark, Hadoop, Azure, AWS, Cosmos)
- Experience with scripting and query languages such as Powershell, SQL and Kusto queries (Azure Data Explorer)
- Experience with Visualization tools, such as Power BI
- Experience in machine learning (practical ML models with real world application) and ML tools for modeling and analysis (PyTorch, TensorFlow, scikit-learn), pipeline, training and deploying a model
- Experience in HDD hardware or firmware design, architecture, and development
Ability to meet Microsoft, customer and/or government security screening requirements is required for this role. These requirements include but are not limited to the following specialized security screenings: Microsoft Cloud Background Check: This position will be required to pass the Microsoft Cloud Background Check upon hire/transfer and every two years thereafter.
Responsibilities
- Design and build infrastructure for storage devices at scale
- Develop scalable live monitoring capabilities, failure detection and prediction algorithms for storage devices
- Investigate, triage and root cause SSD/HDD related failures in Azure solutions
- Build automation for operations of storage devices
- Collaborate with suppliers to design reliable, high performance and quality storage devices
- Develop ML algorithms for failure prediction
- Analyze data to identify, prototype, and drive the implementation of technical and process improvements to increase the predictability, agility, and quality of Azure systems
- Actively support Azure service stakeholders