Data Engineer, PXT Central Science
Amazon
Description
The PXT Central Science team is looking for a Data Engineer. This individual will join a team of economists and scientists to own and accelerate science and analytics in our rapid employee intelligence workstream. This suite of models identifies causal factors driving changes in employee sentiment, actions, and business outcomes.
Key job responsibilities
We are looking for a data engineer with expertise in complex data environments. You will be responsible for enhancing our existing data architecture to further standardize metrics and definitions, building and testing new features, developing end-to-end data engineering solutions for complex analytical problems, and collaborating with economists, data scientists, and software engineers to translate data into actionable insights. Specific responsibilities include:
- Data Pipeline Development & Management: Design and maintain scalable data pipelines using native AWS services (Glue, EMR, Lambda); Build robust monitoring and error handling systems for data workflows; Optimize pipeline performance, reliability, and cost efficiency; Develop feature engineering frameworks and automated data transformations; Create clear documentation for pipeline architecture and operations
- Advanced Data Integration: Design and implement pipelines for diverse data types (text, image, audio); Build scalable feature extraction and processing frameworks; Develop robust data quality and validation checks; Create flexible schemas to support evolving data requirements; Shape data strategy and modeling approaches across teams
- Cross-team Collaboration: Partner with economics and data science teams to understand analytical requirements; Work closely with software engineering teams to ensure seamless integration with existing systems; Collaborate with infrastructure teams to optimize resource utilization across interconnected AWS accounts; Participate in technical design reviews and architecture discussions.
- Analytics Platform Management: Maintain and enhance layered data systems used by economists and scientists; Build automated reporting solutions for senior leadership and other stakeholders; Implement version control and change management processes for analytical environments.
- Technical Environment: Work across multiple interconnected AWS accounts and services; Implement security best practices for cross-account data access; Design solutions that scale across multiple regions and business units.
About the team
The Central Science Team within Amazon’s People Experience and Technology org (PXTCS) uses economics, behavioral science, statistics, machine learning, and Generative AI to proactively identify mechanisms and process improvements which simultaneously improve Amazon and the lives, well-being, and the value of work to Amazonians. We are an interdisciplinary team, which combines the talents of science, engineering, and UX to develop and deliver solutions that measurably achieve this goal.