Hero Image

AnitaB.org Talent Network

Connecting women in tech with the best professional opportunities!

Data Engineer, Seller Partner Trust and Store Integrity Science

Amazon

Amazon

Software Engineering, Data Science
Seattle, WA, USA
Posted on Apr 4, 2026

Description

Do you want to join an innovative team of scientists and engineers who use machine learning and artificial intelligence to help Amazon provide the best customer experience by preventing eCommerce fraud? Are you excited by the prospect of building scalable data infrastructure and pipelines that process terabytes of data, enabling state-of-the-art algorithms to solve real world problems? Do you like to own end-to-end data systems and directly impact the team's ability to deliver insights and models that drive company profitability? Do you enjoy collaborating in a diverse team environment?

If yes, then you may be a great fit to join the Amazon Selling Partner Trust & Store Integrity Science Team. We are looking for a talented data engineer who is passionate about building robust data platforms and pipelines that empower scientists to develop advanced machine learning systems, helping manage the safety of millions of transactions every day and scaling up our operations with automation.

Key job responsibilities
DATA INFRASTRUCTURE & PIPELINE DEVELOPMENT
- Design, build, and maintain scalable data pipelines that support multiple ML model training and inference workflows
- Develop and optimize ETL processes to ingest, transform, and prepare terabytes of data from diverse sources for model consumption
- Implement robust data quality checks and monitoring systems to ensure data integrity across all pipelines

ML OPERATIONS SUPPORT
- Build and maintain infrastructure for model training pipelines, including feature engineering, data versioning, and experiment tracking
- Design and implement scalable inference pipelines that serve predictions for millions of transactions with low latency and high reliability
- Collaborate with scientists to productionize ML models, translating research code into production-ready systems

SYSTEM PERFORMANCE & RELIABILITY
- Optimize data processing workflows for cost efficiency and performance, managing compute and storage resources effectively
- Implement monitoring, alerting, and logging systems to ensure pipeline reliability and quick issue resolution
- Maintain comprehensive documentation of data schemas, pipeline architectures, and operational procedures

CROSS-FUNCTIONAL COLLABORATION
- Partner closely with scientists to understand data requirements and translate them into technical solutions
- Work with stakeholders to define data SLAs and ensure systems meet business needs
- Provide technical guidance on data architecture decisions and best practices