Platform Engineering, Monitoring and Observability Lead - SRE Focus (Lead Systems Operations Engineer)

Wells Fargo

Wells Fargo

Operations, Software Engineering
Bengaluru, Karnataka, India
Posted on Sep 9, 2025

About this role:

“Wells Fargo is seeking a Lead Systems Operations Engineer. We believe in the power of working together because great ideas can come from anyone. Through collaboration, any employee can have an impact and make a difference for the entire company. Explore opportunities with us for a career in a supportive environment where you can learn and grow.”


In this role, you will:

  • Lead complex, broad impact initiatives including provision of high level systems consultation for the technology teams
  • Work as key participant in large scale planning of computer systems and network infrastructure for Systems Operations functional area
  • Review and analyze complex technical challenges, as well as escalated support issues related to core business solutions that require in depth evaluation of multiple factors, such as alternatives, enhancements, periodic systems reviews, or improvements to existing systems
  • Make decisions on technical changes and enhancements
  • Consult with engineering team on change design requiring solid understanding of technical process controls or standards that influence and drive new initiatives
  • Collaborate and consult with technical peers, colleagues, and mid to more experienced level managers to resolve systems support issues and achieve goals

Required Qualifications:

  • 5+ years of Systems Engineering, Technology Architecture experience, or equivalent demonstrated through one or a combination of the following: work experience, training, military experience, education


Desired Qualifications:

  • Lead the strategy and execution of monitoring and observability initiatives across infrastructure and applications.
  • Architect and maintain dashboards, alerts, and telemetry pipelines using tools like Grafana, Prometheus, and Elastic APM.
  • Integrate and optimize observability platforms including Splunk, AppDynamics, ThousandEyes, and ITRS Geneos.
  • Collaborate with SRE and DevOps teams to ensure system reliability, scalability, and performance.
  • Develop automation scripts in Python and Shell for data collection, analysis, and alerting.
  • Drive root cause analysis and incident response using observability data.
  • Evaluate and implement Gen AI solutions to enhance observability and predictive analytics.
  • Mentor junior engineers and promote best practices in monitoring and reliability engineering.
  • Bachelor’s or Master’s degree in Computer Science, Engineering, or related field.
  • 5+ years of experience in IT operations, with at least 3 years in a lead role focused on observability and SRE.
  • Proven expertise in tools such as:
  • Splunk, ITRS Geneos, Grafana, Prometheus, Elastic APM
  • ThousandEyes, AppDynamics
  • Strong scripting skills in:
  • Python (especially for data analytics and automation)
  • Shell scripting
  • Deep understanding of SRE principles including SLIs, SLOs, error budgets, and incident management.
  • Experience with cloud platforms (AWS, Azure, or GCP) and containerized environments (Kubernetes, Docker).
  • Certifications in observability tools or cloud platforms (e.g., Splunk Certified Admin, AWS Cloud Practitioner).
  • Experience with machine learning or Gen AI frameworks applied to observability (e.g., anomaly detection, predictive alerting).
  • Familiarity with CI/CD pipelines and infrastructure as code (Terraform, Ansible).
  • Strong analytical mindset with a passion for data-driven decision-making.
  • Excellent communication and stakeholder management skills.


Job Expectations:

  • The team operates on a 16x5 schedule, ensuring coverage across critical business hours and extended support windows.
  • Candidates must be willing to participate in weekend on-call rotations, providing support for high-priority incidents and system health checks.
  • As part of production management responsibilities, the lead is expected to be available during off-hours when necessary to support major incidents, deployments, or escalations.
  • Flexibility and responsiveness are key, especially in high-impact scenarios where rapid resolution is essential to maintaining system reliability and performance.

Posting End Date:

11 Sep 2025

*Job posting may come down early due to volume of applicants.

We Value Equal Opportunity

Wells Fargo is an equal opportunity employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, sexual orientation, gender identity, national origin, disability, status as a protected veteran, or any other legally protected characteristic.

Employees support our focus on building strong customer relationships balanced with a strong risk mitigating and compliance-driven culture which firmly establishes those disciplines as critical to the success of our customers and company. They are accountable for execution of all applicable risk programs (Credit, Market, Financial Crimes, Operational, Regulatory Compliance), which includes effectively following and adhering to applicable Wells Fargo policies and procedures, appropriately fulfilling risk and compliance obligations, timely and effective escalation and remediation of issues, and making sound risk decisions. There is emphasis on proactive monitoring, governance, risk identification and escalation, as well as making sound risk decisions commensurate with the business unit’s risk appetite and all risk and compliance program requirements.

Candidates applying to job openings posted in Canada: Applications for employment are encouraged from all qualified candidates, including women, persons with disabilities, aboriginal peoples and visible minorities. Accommodation for applicants with disabilities is available upon request in connection with the recruitment process.

Applicants with Disabilities

To request a medical accommodation during the application or interview process, visit Disability Inclusion at Wells Fargo.

Drug and Alcohol Policy

Wells Fargo maintains a drug free workplace. Please see our Drug and Alcohol Policy to learn more.

Wells Fargo Recruitment and Hiring Requirements:

a. Third-Party recordings are prohibited unless authorized by Wells Fargo.

b. Wells Fargo requires you to directly represent your own experiences during the recruiting and hiring process.