Senior EAS Observability Engineer
London, UK
Are you ready to make an impact at DTCC?
Do you want to work on innovative projects, collaborate with a dynamic and supportive team, and receive investment in your professional development? At DTCC, we are at the forefront of innovation in the financial markets. We're committed to helping our employees grow and succeed. We believe that you have the skills and drive to make a real impact. We foster a thriving internal community and are committed to creating a workplace that looks like the world that we serve.
Pay and Benefits:
- Competitive compensation, including base pay and annual incentive
- Comprehensive health and life insurance and well-being benefits
- Pension
- Paid Time Off and Personal/Family Care, and other leaves of absence when needed to support your physical, financial, and emotional well-being.
- DTCC offers a flexible/hybrid model of 3 days onsite and 2 days remote (onsite Tuesdays, Wednesdays and a third day unique to each team or employee).
The impact you will have in this role:
We are seeking a Senior Observability Engineer (Associate Director) to be part of the team leading enterprise adoption of modern observability capabilities across DTCC’s critical business applications. This is a hands-on engineering role focused on implementing observability standards, enabling application onboarding, and building the automation frameworks required to scale across a large portfolio of systems. Candidate will drive adoption of the enterprise observability platform (Grafana + Open Telemetry), lead migration from legacy monitoring tools, and ensure consistent, high-quality telemetry that enables end-to-end business process traceability.
Success in this role requires a strong foundation in observability engineering and telemetry data modeling, ensuring that metrics, logs, and traces are structured, consistent, and usable for both operational and analytical purposes.
Your Primary Responsibilities:
Observability Platform Adoption
- Lead applications onboarding to the enterprise observability platform
- Drive migration from legacy tooling to Grafana and Open Telemetry
- Enable scalable adoption across a large and diverse application portfolio
Automation & Enablement
- Design and build automation frameworks for application onboarding
- Deliver self-service capabilities, templates, and standardized configurations
- Ensure consistent implementation through reusable engineering patterns
Standards Implementation (OTel & Telemetry)
- Implement Open Telemetry instrumentation and semantic conventions
- Define and apply consistent labeling and telemetry standards to enhance APM
- Standardize metrics, logs, and traces (MELT) across applications
- Drive adoption through automation and structured onboarding processes
Telemetry Data Modeling & Observability Quality Governance
- Design and enforce structured telemetry models to ensure data is consistent and query able
- Define naming conventions, dimensions, and labeling strategies for observability data
- Enable end-to-end traceability and correlation across distributed systems
- Ensure telemetry supports analytics, reporting, and AIOps use cases
Legacy Migration & Target-State Adoption
- Support migration of applications from legacy observability configurations
- Identify and remediate gaps relative to target-state standards
- Provide targeted onboarding support for complex or high-priority workloads
Data Platform Observability
- Extend observability practices to data-centric applications and pipelines
- Partner with teams using platforms such as Snowflake to define observability patterns
- Enable monitoring of data quality, pipeline health, and analytics workflows
Technical Leadership & Influence
- Operate as a senior individual contributor influencing across teams
- Partner with application engineering, SRE, and platform teams
- Drive adoption through hands-on engineering, mentorship, and technical guidance
**NOTE: The Primary Responsibilities of this role are not limited to the details above. **
Qualifications:
- Minimum of 10 years of related experience
- Bachelor's degree preferred or equivalent experience
Talents Needed For Success:
- 10+ years of experience in distributed systems, SRE, or reliability engineering
- Strong hands-on experience with:
- Observability platforms (e.g., Grafana, Splunk, Dynatrace)
- Open Telemetry (instrumentation, traces, metrics, logs)
- Experience designing and structuring telemetry data (metrics, logs, traces, events)
- Ability to define data models, labeling strategies, and semantic standards as structured, data for observability and analytics
- Experience building automation frameworks or platform onboarding solutions
- Proven ability to drive adoption of standards across multiple teams
- Strong skills in performance analysis, troubleshooting, and system reliability practices.
- Proficiency in one or more: Python, Java, SQL, SPL, PromQL
- Strong problem-solving, communication, and collaboration skills
- Experience with Snowflake or similar data platforms, business data observability
- Experience applying data modeling and analytics principles to observability systems
- Familiarity with Grafana ecosystem (Tempo, Loki, Mimir, Grafana)
- Experience implementing Open Telemetry at scale
- Exposure to SLOs, AIOps, or observability-driven automation
- Experience in financial services or regulated environments
We offer top class training and development for you to be an asset in our organization!
We are an equal opportunity employer and value diversity at our company. We do not discriminate on the basis of race, religion, color, national origin, sex, gender, gender expression, sexual orientation, age, marital status, veteran status, or disability status. We will ensure that individuals with disabilities are provided reasonable accommodation to participate in the job application or interview process, to perform essential job functions, and to receive other benefits and privileges of employment. Please contact us to request accommodation.
DTCC proudly supports Flexible Work Arrangements favoring openness and gives people freedom to do their jobs well, by encouraging diverse opinions and emphasizing teamwork. When you join our team, you’ll have an opportunity to make meaningful contributions at a company that is recognized as a thought leader in both the financial services and technology industries. A DTCC career is more than a good way to earn a living. It’s the chance to make a difference at a company that’s truly one of a kind.
Learn more about Clearance and Settlement by clicking here.
Serves as a dedicated technology resource for advancing DTCC’s business opportunities and providing industry thought leadership for leveraging new technology. The goal of this new department is to partner internally with IT, our business and regulatory divisions and externally with clients, regulators, and fintech vendors, to help build new platforms and business models to advance DTCC’s mission to support the financial markets.
We are seeking a Senior Observability Engineer (Associate Director) to be part of the team leading enterprise adoption of modern observability capabilities across DTCC’s critical business applications. This is a hands-on engineering role focused on implementing observability standards, enabling application onboarding, and building the automation frameworks required to scale across a large portfolio of systems. Candidate will drive adoption of the enterprise observability platform (Grafana + Open Telemetry), lead migration from legacy monitoring tools, and ensure consistent, high-quality telemetry that enables end-to-end business process traceability.