Principal Software Developer - Observability Platform
Oracle
We are seeking an experienced Software Engineer 4 (IC4) to build and evolve backend services and data pipelines that power our observability platform (metrics, logs, traces, and events). You will design and implement reliable, scalable microservices and streaming components, improve operational excellence, and partner with cross-functional teams to deliver high-quality capabilities with strong performance, resilience, and security.Design, develop, troubleshoot and debug software programs for databases, applications, tools, networks etc.
Only Oracle brings together the data, infrastructure, applications, and expertise to power everything from industry innovations to life-saving care. And with AI embedded across our products and services, we help customers turn that promise into a better future for all. Discover your potential at a company leading the way in AI and cloud solutions that impact billions of lives.
True innovation starts when everyone is empowered to contribute. That’s why we’re committed to growing a workforce that promotes opportunities for all with competitive benefits that support our people with flexible medical, life insurance, and retirement options. We also encourage employees to give back to their communities through our volunteer programs.
We’re committed to including people with disabilities at all stages of the employment process. If you require accessibility assistance or accommodation for a disability at any point, let us know by emailing accommodation-request_mb@oracle.com or by calling 1-888-404-2494 in the United States.
Oracle is an Equal Employment Opportunity Employer. All qualified applicants will receive consideration for employment without regard to race, color, religion, sex, national origin, sexual orientation, gender identity, disability and protected veterans’ status, or any other characteristic protected by law. Oracle will consider for employment qualified applicants with arrest and conviction records pursuant to applicable law.
At Oracle Health, we build technology that helps clinicians, patients, and healthcare organizations deliver better outcomes. Joining the Oracle Health and Analytics organization means working on platforms that operate at enterprise scale, where reliability, performance, security, and compliance are foundational. On the Observability team, you’ll help improve the visibility and operational health of services that support critical healthcare workflows—by building systems that enable teams to detect issues early, respond effectively, and continuously improve.
Career Level - IC4
What you will do:
• Design, build, and maintain backend microservices and APIs supporting observability use cases (ingestion, processing, storage, query, and alerting).
• Develop and operate streaming/data pipeline components for high-throughput event processing (e.g., Kafka-based ingestion and processing).
• Implement reliable data handling patterns: backpressure, retries, idempotency, ordering guarantees, and dead-letter strategies.
• Integrate and optimize caching and state management using Redis (or equivalent) for latency-sensitive workflows.
• Ensure services meet SLOs through performance tuning, capacity planning, and resilience engineering (timeouts, circuit breakers, rate limiting).
• Improve developer and operational productivity through instrumentation, dashboards, logging standards, and runbooks.
• Collaborate with SRE/DevOps, product, and peer engineering teams to deliver features end-to-end, including production readiness.
• Participate in code reviews, design reviews, and incident follow-ups (postmortems) to drive continuous improvement.
• Mentor junior engineers and contribute to technical direction and best practices across the team.
Required Qualifications
• 6+ years of professional software engineering experience (or equivalent).
• Strong proficiency in Java (or similar JVM language such as Kotlin/Scala) and building backend systems.
• Hands-on experience with microservice frameworks such as Spring Boot and/or Micronaut (or comparable frameworks).
• Strong experience with Apache Kafka (producers/consumers, consumer groups, partitions, offsets, schema evolution, DLQs). Experience with OCI Streaming is a plus.
• Experience with Redis (caching, pub/sub or streams, data modeling, performance considerations). Experience with OCI Cache is a plus.
• Demonstrated experience designing and operating data pipelines (streaming and/or batch), including data validation, enrichment, and transformation.
• Solid understanding of distributed systems concepts (consistency, availability, fault tolerance, concurrency).
• Proficiency with testing practices (unit/integration, contract testing) and CI/CD fundamentals.
Preferred Qualifications
• Experience building observability platforms or components (metrics/logs/traces/events), OpenTelemetry familiarity.
• Experience with data stores and persistence patterns; experience with Oracle Autonomous Database (ADB) is a plus.
• Experience with schema management and serialization (Avro/Protobuf/JSON Schema) and schema registries.
• Containerization and orchestration experience (Docker, Kubernetes).
• Experience with infrastructure-as-code and automation (Terraform, Ansible).
• Cloud-native experience and production operations (on-call, incident management, RCA/postmortems).
• Security-aware development (authn/authz, secrets management, secure coding practices).
• Strong written design documentation skills and ability to influence technical direction.
Skills & Competencies
• Systems thinking: designs that scale, degrade gracefully, and are operable.
• Ownership mindset: drives work from design to production and support.
• Pragmatic engineering: balances correctness, delivery, and maintainability.
• Collaboration: works effectively across teams and time zones.
• Communication: clear design rationale, tradeoffs, and documentation.
Typical Tech Stack
• Languages: Java (Kotlin/Scala optional)
• Frameworks: Spring Boot / Micronaut
• Streaming: Kafka, OCI Streaming
• Cache/State: Redis, OCI Cache
• Data/Storage: Oracle Autonomous Database (ADB)
• Observability: OpenTelemetry, Prometheus/Grafana, Open Search, OCI Logging
• Deployment: Docker/Kubernetes, CI/CD pipelines
• Infrastructure as Code / Automation: Terraform, Ansible