DevOps Engineer - Site Reliability Engineering team
SAP
We help the world run better
At SAP, we enable you to bring out your best. Our company culture is focused on collaboration and a shared passion to help the world run better. How? We focus every day on building the foundation for tomorrow and creating a workplace that embraces differences, values flexibility, and is aligned to our purpose-driven and future-focused work. We offer a highly collaborative, caring team environment with a strong focus on learning and development, recognition for your individual contributions, and a variety of benefit options for you to choose from.
Important information:
- This is a hybrid role based out of SAP Montreal office, working in-office with the team 3 days per week.
- Candidates must be legally entitled to work in Canada at the time of application. This position is not eligible for employer-sponsored work authorization (e.g., LMIA or other immigration support).
What you'll do
We are looking for an engineer to join an already established SRE team for the SAP Business Technology Platform.
As a Site Reliability Engineer, you will have the opportunity to operate and support business critical Cloud services. As part of your daily job, you will proactively monitor the service behavior and identify areas for improvement. You will participate in the development of tools for monitoring and troubleshooting cloud services built on latest open source and SAP technologies, following SRE principles.
Responsibilities:
- Act as technical expert during Live site incidents (downtimes of supported services in scope), investigate and solve incidents on a deep technical level.
- Drive root cause analysis and follow-up improvements to prevent issues from reoccurring.
- Perform in-depth troubleshooting and log analysis to identify and solve complex issues in accordance with internal and external SLAs.
- Build software-based solutions to address improvements in service reliability and stability.
- Enhance infrastructure and platform monitoring by gathering system metrics (4 Golden Signals) and implementing tools for recovery.
- Integrate and collaborate closely with development teams and work with them on outputs from Postmortems and product improvements.
- Learn new technologies and keep up to date with latest development increments.
- Create and maintain technical documentation.
- Define, advocate, apply SRE best practices.
- Participate in the on-call rotation (follow the sun approach) to react to major incidents. On-call has a special compensation package.
What you bring
- Experience with Kubernetes and good understanding of container technologies.
- 3 + years experience in SRE.
- Understanding of modern cloud architectures (experience with Cloud Platforms such as AWS, Azure, GCP are a plus).
- Scripting skills, CI/CD (Concourse, Github Actions and ArgoCD are a plus) - enthusiasm for automation - make the computers do the work for
- you.
- Working efficiently in emergency situations. Affinity to quickly analyze and solve problems in a global team setup.
- Excellent team player, passionate about his/her work, self-motivated and driven.
- Excellent communication skills - precise, based on facts.
- Fluency in English.
- Preferred Additional Skills and Competencies:
- Coding experience with Python, Bash, GO
- CKA/CKAD/CKS certifications
- Experience with Unix/Linux operating system
- Experience with modern monitoring, logging, and alerting tools (Grafana, Prometheus, Kibana, Loki, Splunk On-Call, Dynatrace)
- Security best practices for application development and operations in a public Cloud Environment
- Contribution to open-source projects
Meet the team
The Reliability Engineering organization provides multitude of products and services related to operations and continuity of business delivery.
The Site Reliability Engineering teams make the SAP Business Technology Platform run better by providing 24x7 deep technical coverage for Incident Management (Outages and other incidents with major customer impact) applying SRE principles. We share a Live Site First culture and care for the business continuity of our customers running mission critical applications in the Cloud.
Bring out your best
SAP innovations help more than four hundred thousand customers worldwide work together more efficiently and use business insight more effectively. Originally known for leadership in enterprise resource planning (ERP) software, SAP has evolved to become a market leader in end-to-end business application software and related services for database, analytics, intelligent technologies, and experience management. As a cloud company with two hundred million users and more than one hundred thousand employees worldwide, we are purpose-driven and future-focused, with a highly collaborative team ethic and commitment to personal development. Whether connecting global industries, people, or platforms, we help ensure every challenge gets the solution it deserves. At SAP, you can bring out your best.
We win with inclusion
SAP’s culture of inclusion, focus on health and well-being, and flexible working models help ensure that everyone – regardless of background – feels included and can run at their best. At SAP, we believe we are made stronger by the unique capabilities and qualities that each person brings to our company, and we invest in our employees to inspire confidence and help everyone realize their full potential. We ultimately believe in unleashing all talent and creating a better and more equitable world.
SAP is proud to be an equal opportunity workplace and is an affirmative action employer. We are committed to the values of Equal Employment Opportunity and provide accessibility accommodations to applicants with physical and/or mental disabilities. If you are interested in applying for employment with SAP and are in need of accommodation or special assistance to navigate our website or to complete your application, please send an e-mail with your request to Recruiting Operations Team: Careers@sap.com.
For SAP employees: Only permanent roles are eligible for the SAP Employee Referral Program, according to the eligibility rules set in the SAP Referral Policy. Specific conditions may apply for roles in Vocational Training.
EOE AA M/F/Vet/Disability:
Qualified applicants will receive consideration for employment without regard to their age, race, religion, national origin, ethnicity, age, gender (including pregnancy, childbirth, et al), sexual orientation, gender identity or expression, protected veteran status, or disability.
SAP believes the value of pay transparency contributes towards an honest and supportive culture and is a significant step toward demonstrating SAP’s commitment to pay equity. SAP provides the annualized compensation range inclusive of base salary and variable incentive target for the career level applicable to the posted role. The targeted combined range for this position is 92200 - 156800(CAD) CAD. The actual amount to be offered to the successful candidate will be within that range, dependent upon the key aspects of each case which may include education, skills, experience, scope of the role, location, etc. as determined through the selection process. Any SAP variable incentive includes a targeted dollar amount, and any actual payout amount is dependent on company and personal performance. Please reference this link for a summary of SAP benefits and eligibility requirements: www.SAPNorthAmericaBenefits.com.
Due to the nature of the role, which involves global interactions with SAP entities, as well as with employees and stakeholders in Canada, functional proficiency in English is required for positions based in the Quebec.
Requisition ID: 428172 | Work Area: Software-Development Operations | Expected Travel: 0 - 10% | Career Status: Professional | Employment Type: Regular Full Time | Additional Locations: #LI-Hybrid