Senior Data Engineer
Yahoo
Yahoo Mail is the ultimate consumer inbox with hundreds of millions of users. It’s the best way to access your email and stay organized from a computer, phone or tablet. With its beautiful design and lightning fast speed, Yahoo Mail makes reading, organizing, and sending emails easier than ever.
A little about Us:
As a global email provider, Yahoo Mail is the ultimate Consumer Inbox with over 220 million monthly active users. It is the best way to access your email and stay organized from a computer, phone or tablet. We create technology that changes the internet while handling billions of inbound connections per day to manage trillions of messages requiring petabytes of efficient storage.
Yahoo Mail's vision: Be the best consumer email platform to help users run the business of their life.
The Yahoo Mail engineering team develops solutions powering our mail brands, including a next-generation infrastructure that we are 100% moving to a native public cloud architecture.
About the team:
The Mail Intelligence Org develops intelligent capabilities at scale to uncover interests, reveal habits, and personalize user journeys across Yahoo Mail and the entire Yahoo ecosystem.
We seek innovative, entrepreneurial, and passionate engineers dedicated to delivering exceptional user experiences. Your passion and ownership mindset are as vital as the high engineering standards, code quality, and world-class architectural skills we uphold.
We manage billions of mail messages with advanced backend systems and algorithms, including NLP, GenAI, and ML techniques. Our infrastructure extracts information, creates enhanced content, and integrates diverse data sources to amplify key insights. This work presents rewarding technical challenges for high-caliber engineers eager to tackle impactful problems.
Sounds Exciting? Then, come join us!!
Responsibilities:
Contribute to Mail Intelligence Mission: Support the development of AI/ML capabilities that enhance user personalization and insights within Yahoo Mail and the broader Yahoo ecosystem.
Innovate and Personalize: Assist in building scalable solutions that identify user interests and habits, contributing to personalized user experiences.
Data Pipeline and Insights Development: Design and implement robust data pipelines using frameworks like DataProc, Dataflow, and Composer. Streamline data processing and orchestration, utilize Data Lake and Pub/Sub for storage and streaming, and create insightful visualizations with Looker Studio to support AI/ML initiatives.
Collaboration: Work closely with cross-functional teams in the USA to integrate data solutions that enhance user experiences.
Data Governance: Implement robust data governance practices to maintain data quality and compliance across the organization.
Optimization: Continuously improve data workflows for efficiency and scalability.
Technical Strategy: Collaborate with tech leads to align data strategies with business goals.
Feedback and Recovery: Implement mechanisms to handle data processing failures gracefully.
Tradeoff Management: Balance cost, quality, and speed in data solutions to optimize performance.
A lot about YOU
Educational Background: You have a strong foundation in computer science, data engineering, or a related field, with a focus on big data technologies and cloud platforms.
Technical Skills: You are proficient in programming languages such as SQL, Python, or Java, and have experience with big data frameworks like Hadoop, Spark, or Kafka. You excel in using stream and batch processing frameworks and data visualization tools, and are skilled in implementing data governance practices.
Data Engineering Expertise: You have advanced skills in designing and implementing data pipelines, ensuring data quality and integrity.
AI Tools Usage: You have experience with AI development tools such as Cursor, Copilot, and Claude.
Problem-Solving Ability: You possess strong analytical skills and a knack for solving complex data challenges using innovative approaches.
Collaboration: You are a team player who can work effectively with cross-functional teams, including tech leads and architects, to deliver high-quality data solutions.
Passion for Innovation: You are enthusiastic about leveraging data engineering to drive innovation and enhance user experiences across Yahoo Mail and the broader ecosystem.
Attention to Detail: You maintain high standards for data quality and are committed to delivering robust, scalable solutions.
Continuous Learning: You are eager to stay updated with the latest advancements in data engineering and apply them to improve Yahoo's products and services.
Qualifications
Educational Requirement: Bachelor's degree in Computer Science, Data Engineering, or a related field; a Master's degree is a plus.
Experience: 5+ years in data engineering, big data processing, or a related field.
Technical Proficiency: Strong skills in SQL, Python, or Java, with experience in big data technologies like Hadoop, Spark, or Kafka. Proficient in stream and batch processing frameworks and data visualization tools. Skilled in implementing data governance practices.
Cloud Experience: Proficiency with public cloud platforms, especially Google Cloud Platform (GCP).
Problem-Solving Skills: Demonstrated ability to solve complex problems and implement efficient, scalable solutions.
Collaboration Skills: Experience working with international teams and cross-functional collaboration.
Communication Skills: Strong verbal and written communication skills for effective collaboration with US-based teams.
Continuous Improvement: Eagerness to learn and adapt to new data technologies and methodologies.
Preferred Qualifications
Mail Experience: Prior experience working with email systems or related technologies is a plus.
Personal Attributes: Self-driven and detail-oriented, with a passion for tackling challenges. Strong teamwork spirit, excellent communication skills, and the ability to multitask and manage expectations effectively.
Cloud Experience: Experience with public cloud platforms, especially Google Cloud Platform (GCP), is preferred.
Technologies: Experience with specific technologies such as DataProc, Dataflow, Composer, Data Lake, Pub/Sub, and Looker Studio.
Yahoo is proud to be an equal opportunity workplace. All qualified applicants will receive consideration for employment without regard to, and will not be discriminated against based on age, race, gender, color, religion, national origin, sexual orientation, gender identity, veteran status, disability or any other protected category. Yahoo will consider for employment qualified applicants with criminal histories in a manner consistent with applicable law. Yahoo is dedicated to providing an accessible environment for all candidates during the application process and for employees during their employment. If you need accessibility assistance and/or a reasonable accommodation due to a disability, please submit a request via the Accommodation Request Form (www.yahooinc.com/careers/contact-us.html) or call +1.866.772.3182. Requests and calls received for non-disability related issues, such as following up on an application, will not receive a response.
Yahoo has a high degree of flexibility around employee location and hybrid working. In fact, our flexible-hybrid approach to work is one of the things our employees rave about. Most roles don’t require specific regular patterns of in-person office attendance. If you join Yahoo, you may be asked to attend (or travel to attend) on-site work sessions, team-building, or other in-person events. When these occur, you’ll be given notice to make arrangements.
If you’re curious about how this factors into this role, please discuss with the recruiter.
Currently work for Yahoo? Please apply on our internal career site.