U01TR003528
Cooperative Agreement
Overview
Grant Description
Critical: Collaborative Resource for Intensive Care Translational Science, Informatics, Comprehensive Analytics, and Learning
Translational research in Artificial Intelligence (AI) has been hindered by the lack of shared data resources with sufficient depth, breadth, and diversity. There are very limited EHR datasets freely available to the general research community, especially the AI research community, through credential-based access. The MIMIC dataset is from a single institution that has a fixed and limited racial, ethnic, and geographic profile. The eICU dataset is limited in data comprehensiveness (e.g., number of kinds of lab tests ~1/5 of MIMIC), data span (1 year, 2014-2015), and data variety (e.g., no free text clinical notes) etc. Thus, MIMIC and eICU respectively have advantages and disadvantages of data depth and data breadth.
The vision of this proposal is to leverage multiple Clinical and Translational Science Awards (CTSAs) with diverse racial, ethnic, and geographic profiles in order to develop and evaluate a multi-site de-identified ICU dataset, to facilitate accelerated translational research in AI and deep learning approaches to understand, track, and predict the pathophysiological state of patients.
In this project, a group of nationwide CTSA sites will work together to build a new, more inclusive, multi-site dataset that is downloadable from NCATS Cloud by researchers with credential-based access. This project will combine the respective advantages of MIMIC (data depth) and eICU (data breadth). The created dataset will include more geographic regions, larger quantities of time-series data, including pre-, during-, and post-ICU patient information. This will incorporate not only more patient diversity but also capture regional population differences and practice variations that could have clinical impact.
Aim 1 will develop and provide credentialed access to a multi-site dataset consisting of de-identified discrete outpatient, inpatient, and ICU data for critically ill patients at respective CTSAs. Aim 2 will create a federated access dataset from and develop novel federated learning methods on the part of the multi-site ICU data consisting of unstructured clinical notes or structured data for select groups of patients at higher risks of re-identification (e.g., rare disease patients). Aim 3 will develop novel memory-network based meta-learning AI algorithms and use the multi-site dataset to answer concrete and long-standing clinical problems in critical care. Aim 4 will innovatively leverage the library network to develop and disseminate open resources for the research community and develop best practice guidelines for other CTSAs to join the effort. In particular, we aim to support and cultivate the growth of the next generation medical AI workforce for research and practice.
We aim to establish a large cross-CTSA collaborative data sharing for critical care by leveraging the existing CTSA collaborative networks. With the diversified racial, ethnic, and geographic profiles from the above CTSAs, we will be able to support fair and generalizable algorithms for advanced patient monitoring and decision support. The proposed project will provide best practice guidance to and set up exemplary examples for nationwide CTSAs. It will also support the cultivation of next generation medical AI researchers.
Translational research in Artificial Intelligence (AI) has been hindered by the lack of shared data resources with sufficient depth, breadth, and diversity. There are very limited EHR datasets freely available to the general research community, especially the AI research community, through credential-based access. The MIMIC dataset is from a single institution that has a fixed and limited racial, ethnic, and geographic profile. The eICU dataset is limited in data comprehensiveness (e.g., number of kinds of lab tests ~1/5 of MIMIC), data span (1 year, 2014-2015), and data variety (e.g., no free text clinical notes) etc. Thus, MIMIC and eICU respectively have advantages and disadvantages of data depth and data breadth.
The vision of this proposal is to leverage multiple Clinical and Translational Science Awards (CTSAs) with diverse racial, ethnic, and geographic profiles in order to develop and evaluate a multi-site de-identified ICU dataset, to facilitate accelerated translational research in AI and deep learning approaches to understand, track, and predict the pathophysiological state of patients.
In this project, a group of nationwide CTSA sites will work together to build a new, more inclusive, multi-site dataset that is downloadable from NCATS Cloud by researchers with credential-based access. This project will combine the respective advantages of MIMIC (data depth) and eICU (data breadth). The created dataset will include more geographic regions, larger quantities of time-series data, including pre-, during-, and post-ICU patient information. This will incorporate not only more patient diversity but also capture regional population differences and practice variations that could have clinical impact.
Aim 1 will develop and provide credentialed access to a multi-site dataset consisting of de-identified discrete outpatient, inpatient, and ICU data for critically ill patients at respective CTSAs. Aim 2 will create a federated access dataset from and develop novel federated learning methods on the part of the multi-site ICU data consisting of unstructured clinical notes or structured data for select groups of patients at higher risks of re-identification (e.g., rare disease patients). Aim 3 will develop novel memory-network based meta-learning AI algorithms and use the multi-site dataset to answer concrete and long-standing clinical problems in critical care. Aim 4 will innovatively leverage the library network to develop and disseminate open resources for the research community and develop best practice guidelines for other CTSAs to join the effort. In particular, we aim to support and cultivate the growth of the next generation medical AI workforce for research and practice.
We aim to establish a large cross-CTSA collaborative data sharing for critical care by leveraging the existing CTSA collaborative networks. With the diversified racial, ethnic, and geographic profiles from the above CTSAs, we will be able to support fair and generalizable algorithms for advanced patient monitoring and decision support. The proposed project will provide best practice guidance to and set up exemplary examples for nationwide CTSAs. It will also support the cultivation of next generation medical AI researchers.
Awardee
Funding Goals
NOT APPLICABLE
Grant Program (CFDA)
Awarding / Funding Agency
Place of Performance
Chicago,
Illinois
60611
United States
Geographic Scope
Single Zip Code
Related Opportunity
Analysis Notes
Amendment Since initial award the total obligations have increased 287% from $1,255,336 to $4,864,370.
Northwestern University was awarded
Collaborative ICU Dataset for AI Research
Cooperative Agreement U01TR003528
worth $4,864,370
from National Center for Advancing Translational Sciences in August 2021 with work to be completed primarily in Chicago Illinois United States.
The grant
has a duration of 4 years and
was awarded through assistance program 93.350 National Center for Advancing Translational Sciences.
The Cooperative Agreement was awarded through grant opportunity Limited Competition: Clinical and Translational Science Award (CTSA) Program: Collaborative Innovation Award, (U01 Clinical Trial Optional).
Status
(Complete)
Last Modified 9/20/24
Period of Performance
8/15/21
Start Date
7/31/25
End Date
Funding Split
$4.9M
Federal Obligation
$0.0
Non-Federal Obligation
$4.9M
Total Obligated
Activity Timeline
Subgrant Awards
Disclosed subgrants for U01TR003528
Transaction History
Modifications to U01TR003528
Additional Detail
Award ID FAIN
U01TR003528
SAI Number
U01TR003528-291541309
Award ID URI
SAI UNAVAILABLE
Awardee Classifications
Private Institution Of Higher Education
Awarding Office
75NR00 NIH NATIONAL CENTER FOR ADVANCING TRANSLATIONAL SCIENCES
Funding Office
75NR00 NIH NATIONAL CENTER FOR ADVANCING TRANSLATIONAL SCIENCES
Awardee UEI
KG76WYENL5K1
Awardee CAGE
01725
Performance District
IL-05
Senators
Richard Durbin
Tammy Duckworth
Tammy Duckworth
Budget Funding
Federal Account | Budget Subfunction | Object Class | Total | Percentage |
---|---|---|---|---|
National Center for Advancing Translational Sciences, National Institutes of Health, Health and Human Services (075-0875) | Health research and training | Grants, subsidies, and contributions (41.0) | $2,406,123 | 100% |
Modified: 9/20/24