Associate Director of Platform Engineering and AI Systems
University of Chicago
Department
BSD CTD - Platform Engineering
About the Department
broader research community to use data at scale to pursue scientific inquiry and accelerate discovery. Learn more at https://gdc.cancer.gov/, https://gen3.org/, https://stats.gen3.org/, https://m3initiative.uchicago.edu/, and https://ctds.uchicago.edu/.
This at-will position is wholly or partially funded by contractual grant funding which is renewed under provisions set by the grantor of the contract. Employment will be contingent upon the continued receipt of these grant funds and satisfactory job performance.
Job Summary
The Associate Director of Platform Engineering and AI Systems leads the design, operation, and continual improvement of large-scale cloud and on-premise data infrastructure supporting advanced AI, bioinformatics, and data commons and mesh research at the University of Chicago's Center for Translational Data Science. This role combines deep technical expertise, strategic architectural vision, and hands-on operational leadership to ensure reliable, secure, and innovative platform services. The ideal candidate will oversee engineering teams and project portfolios, drive technology alignment with scientific priorities, and collaborate with internal, external, and sponsor stakeholders to accelerate impactful data-driven discovery.
Responsibilities
Lead Platform Engineering and Cloud Operations teams supporting multiple public & private cloud systems, including an on-prem system for AI research.
Oversee day-to-day cloud operations and platform reliability across multiple production environments.
Provide technical guidance, architectural direction, and mentorship to engineering staff.
Conduct project planning and execution for infrastructure initiatives including hardware refreshes, storage expansions, and hybrid cloud deployments.
Maintain 5-year hardware refresh plans for on-prem infrastructure supporting PB-scale storage and multiple bioinformatics pipelines.
Represent the Center and its priorities to sponsors, serving as the primary liaison for platform and operations within project portfolio.
Coordinate with hardware and SaaS vendors for procurement, negotiations, support contracts, and maintenance.
Generate monthly and annual reports on engineering and operations performance.
Prepare and oversee contract proposals related to platform and infrastructure services.
Serve as key personnel on infrastructure contracts and agreements.
Participate in security audits, annual tabletop exercises, and ensure team readiness for incident response.
Lead 24/7 incident response processes and maintain systems for continuous support coverage, serving as lead systems contact for incident response across the Center.
Ensure timely resolution of production incidents and platform issues.
Oversee design, deployment, and monitoring of infrastructure supporting scientific workflows and high-throughput genomic data processing.
Manages employees by establishing annual performance goals, allocating resources, assessing annual performance, and determining individual merit, incentive and/or promotional increases. Provides technical oversight and develops standards, guidelines, and processes for application systems.
Creates plans to translate business requirements into well-designed applications while balancing user and business needs, technical competencies, industry developments, and time constraints.
Advises decisions on project and infrastructure needs, including the evaluation of server technologies, languages, platforms, and frameworks. Develops timelines and project plans for the team.
Formulates and defines specifications for complex installations, maintenance, and upgrades. Identifies and analyzes performance and capacity issues.
Performs other related work as needed.
Minimum Qualifications
Education:
Minimum requirements include a college or university degree in related field.
Work Experience:
Certifications:
---
Preferred Qualifications
Education:
Master’s degree in related field.
Experience:
Experience with personnel accounting and financial concepts, especially for sponsored awards from NIH and/or NSF.
Experience with PB-scale hardware and software.
Experience with production-level bioinformatics workflow needs.
Expertise with system administration issues in a large and complex client server environment.
Experience operating SaaS systems for cancer genomics researchers.
Experience working in a research environment as government contractor or subcontractor.
3 years or more of supervisory experience.
Experience with Agile and Waterfall project management approaches.
Experience with privacy and security concerns and federal regulations related to human genomic data.
Experience with FISMA Moderate systems requirements.
Experience with securing and running open-source technology-based infrastructures and tools, especially those running virtual environments.
Experience with AI infrastructure needs and tools for making AI research open source.
Experience monitoring and enforcing strong network and system security policies and using information security scanning tools.
Experience managing deployment of systems at scale using Linux automatic installers.
Experience with data center operations, including business continuity planning and disaster recovery planning and testing.
Experience with data commons, meshes, and/or fabrics with controlled and open access data.
Preferred Competencies
Communicate effectively with internal parties, project sponsors, external vendors, and divisional and university leadership.
Coach and mentor junior and senior personnel regarding all aspects of their job, using influence where a direct supervisory relationship is absent.
Knowledge of configuration and management of clusters at scale.
Knowledge of backup technology and other monitoring and automated systems management technologies.
Knowledge of integration and management issues in a heterogeneous computing environment.
Outstanding deductive and investigative skills to identify and diagnose complex, non-intuitive technical problems.
Ability to apply in-depth knowledge and experience of internal or external business issues to improve products or services.
Ability to take a new perspective using existing solutions.
Ability to learn new procedures, techniques, and approaches quickly.
Integrity and credibility to work with sensitive data.
In-depth understanding of IT architectural frameworks, development methodologies, tools, and techniques.
Excellent supervisory and staff management skills.
Strong and effective oral and written communication skills.
Ability to facilitate technical discussions.
Ability to relate business issues to technology, and vice versa required.
Ability to accurately monitor project progress, to keep track of effort and funds expended and committed, and to anticipate at an early stage any need for changes in project direction, scope, objectives, funding, or timeline.
In-depth knowledge of NIST SP 800-53, Rev. 5 and 800-171.
Working Conditions
Hybrid office/work-from-home environment with frequent trips to campus data centers.
Application Documents
Resume (required)
Cover Letter (preferred)
When applying, the document(s) MUST be uploaded via the My Experience page, in the section titled Application Documents of the application.
Job Family
Role Impact
Scheduled Weekly Hours
Drug Test Required
Health Screen Required
Motor Vehicle Record Inquiry Required
Pay Rate Type
FLSA Status
Pay Range
The included pay rate or range represents the University’s good faith estimate of the possible compensation offer for this role at the time of posting.
Benefits Eligible
The University of Chicago offers a wide range of benefits programs and resources for eligible employees, including health, retirement, and paid time off. Information about the benefit offerings can be found in the Benefits Guidebook.
Posting Statement
The University of Chicago is an equal opportunity employer and does not discriminate on the basis of race, color, religion, sex, sexual orientation, gender, gender identity, or expression, national or ethnic origin, shared ancestry, age, status as an individual with a disability, military or veteran status, genetic information, or other protected classes under the law. For additional information please see the University's Notice of Nondiscrimination.
Job seekers in need of a reasonable accommodation to complete the application process should call 773-702-5800 or submit a request via Applicant Inquiry Form.
All offers of employment are contingent upon a background check that includes a review of conviction history. A conviction does not automatically preclude University employment. Rather, the University considers conviction information on a case-by-case basis and assesses the nature of the offense, the circumstances surrounding it, the proximity in time of the conviction, and its relevance to the position.
The University of Chicago's Annual Security & Fire Safety Report (Report) provides information about University offices and programs that provide safety support, crime and fire statistics, emergency response and communications plans, and other policies and information. The Report can be accessed online at: http://securityreport.uchicago.edu. Paper copies of the Report are available, upon request, from the University of Chicago Police Department, 850 E. 61st Street, Chicago, IL 60637.