Senior Data Management Professional - Content Indexing
Bloomberg
Bloomberg runs on data. Our products are fueled by powerful information. We combine data and context to paint the whole picture for our clients, around the clock – from around the world. In Data, we are responsible for delivering this data, news and analytics through innovative technology - quickly and accurately. We apply problem-solving skills to identify innovative workflow efficiencies, and we implement technological solutions to enhance our systems, products and processes - all while providing customer support to our clients.
Our Team:
The Content News Indexing team plays a critical role in managing one of Bloomberg’s most important financial classification systems for news. Our mandate is to strengthen Bloomberg’s leadership in the global financial news market.
We leverage both proprietary and open-source technologies to automatically retrieve, parse, organize, and tag news content from a wide range of sources—including social media platforms, news feeds, and websites—delivered through the Bloomberg Professional Service.
In addition, we design, build, and maintain a comprehensive taxonomy of classification tags, developed in alignment with the evolving needs of our clients. This work ensures that Bloomberg’s news products remain precise, reliable, and indispensable to financial professionals worldwide.
Our team’s work is practical, impactful, and essential to delivering the accuracy and speed that define Bloomberg’s reputation in financial news.
The Role:
In this role you will be diving deep into complex annotation outputs - requiring you to understand the data requirements, specifying the modeling needs of datasets, using existing techstack for efficient data ingestion workflows, and data pipelining. You will implement technical solutions using programming, machine learning, AI, and human-in-the-loop approaches to make sure our annotation data is fit-for-purpose for our AI Engineering teams.
We’ll trust you to:
Analyse and propose improvements to the annotation pipeline and build infrastructure to make those solutions come to life
Work across numerous groups in product, engineering, and data teams to build a pipeline that will deliver high-quality annotations to train NLP Models
Apply problem-solving and critical thinking, with a focus on innovation and continuous improvement to create pipelines to track the performance of the news classification taggers
Incorporate machine learning and statistics to detect anomalies and drive quality improvement in areas such as accuracy, completeness, consistency, and reliability
Developed a fit-for-purpose assessment of customers internal and external to Bloomberg to develop a quality strategy
Utilize technical skills and Data Science principles to conduct ad-hoc analysis to answer the business needs and discover trends in the news classification data
You’ll Need to Have*:
Bachelor’s degree or equivalent experience in Computational Linguistics, Computer Science, Quantitative Finance, or a related technical field or background
4+ years of programming experience in a development or production environment
Proficiency using scripting languages (preferably Python) to query and interrogate datasets
Understanding of Machine Learning, applied statistics and data analytics
Strong problem-solving skills, particularly to modify and improve processes and workflows
Excellent written and verbal communication skills to explain technical processes and solutions to business partners and management
High attention to detail and degree of shown decision-making and problem-solving skills
Ability to work independently as well as in a distributed team environment as well as influence others and lead change
We’d love to see:
Experience working with annotation schemas, edge cases, guideline development and maintenance, semantic analysis
Experience transforming workflows into a more timely and efficient process
Experience implementing high volume, low-latency ETL pipelines
Experience working with human in the loop workflows
Experience using native language skills to capture various forms of linguistic utterances with high accuracy
Experience mentoring and teaching data skills to others