System Development Engineer, AI/ML, Prime Video & Studios Core Tech, Prime Video & Studios Core Tech
Amazon
Description
Amazon Prime Video is a first-stop entertainment destination offering customers a vast collection of premium programming in one app available across thousands of devices. Prime members can customize their viewing experience and find their favorite movies, series, documentaries, and live sports from a vast catalog. All customers, regardless of whether they have a Prime membership or not, can rent or buy titles via the Prime Video Store, and can enjoy even more content for free with ads.
Are you interested in shaping the future of entertainment? Prime Video's technology teams are creating best-in-class digital video experience. We are building industry-leading Audio, Video and Language technology using GenAI to delight our customers and advance the state-of-the-art in the translation, transcription and generation of multingual cinematic content.
As part of the AI team in Amazon Prime Video, you will gather requirements from collaborators on research teams in computer vision, audio processing and language modeling and develop scripts and UI tools around creating data pipelines for model training and testing, data/model versioning, dataset curation, data annotations and fix tools, pipelines and QuickSight dashboards as needed. This is critical for our the core technology building. You’ll get to work on projects that are fast-paced, challenging, and varied. You’ll also be able to experiment with new possibilities, take risks, and collaborate with remarkable people.
We’ll look for you to bring your diverse perspectives, ideas, and skill-sets to make Prime Video even better for our customers. Your work will impact millions of our customers in the form of products and services that make use of speech and language technology on Prime Video content.
Key job responsibilities
a) gathering requirements from scientists
b) extracting data from various in-house and open-source domain
c) developing scripts and UI tools for creating data pipelines for model training and testing, data/model versioning to ensure model reproducibility, train/test dataset curation, data annotations, data cleaning (eliminating ground truth noise or inconsistencies in data)
d) fixing tools, pipelines and QuickSight dashboards as needed
e) creating Sagemaker batch transform scripts
f) automated monitoring of batch transform/aws batch jobs/chaining of dependent jobs
g) making science code deployment ready to hand over to engineering for deployment (creating docker images, setting up of deployment packages etc.)
h) Proficiency using numpy, pandas, kind of libraries, dealing with jsons, csv files, parquet format etc.
About the team
Amazon Prime Video is a first-stop entertainment destination offering customers a vast collection of premium programming in one app available across thousands of devices. Prime members can customize their viewing experience and find their favorite movies, series, documentaries, and live sports from a vast catalog. All customers, regardless of whether they have a Prime membership or not, can rent or buy titles via the Prime Video Store, and can enjoy even more content for free with ads. By 2026, Prime Video (PV) aspires to be the primary destination for customers to watch and the primary channel for partners to distribute premium cinematic content. The Content Reasoning, Enrichment & Localization (CoREL) science team’s mission is to develop AI to deeply understand the diverse facets of Prime Video's content across multiple languages, and to create content that power immersive, cinematic experiences. To drive this mission and delight our customers, the CoREL Science Bangalore team addresses the localization and accessiblility aspects by a) understanding fine-grained information contained in content to enable generation of localized assets in different languages, b) generating subtitles and accessibility assets like captions across different content types and languages, c) assessing quality of generated (1P) and partner submitted (3P) assets and metadata to ensure defect free customer experience on PV.