R03OD034496
Project Grant
Overview
Grant Description
Uncovering therapeutic-associated biomarkers via machine learning and feature engineering approaches - Project summary
Identifying biomarkers that are diagnostic, robust, and generalizable across individuals while possessing therapeutic values is the most wanted endeavor in medicine. However, there are numerous challenges in the identification of such robust therapy-associated biomarkers (TABS).
For example, most of the current methods seek to achieve statistically significant differential biological signals in general patient cohorts but fail to acknowledge heterogenous genetic backgrounds and phenotypic diversity among individual patients. Our recent studies using newly developed machine learning-based feature engineering approaches and conducted in a pan-cancer study across 12 cancer types showed that biologically constrained features (named herein invariant features) are universal in disease and can be used to classify individual cancers.
Importantly, we also show that invariant features can be used to build de novo biological networks and discover network hubs that can be successfully utilized to infer the expression of associated genes. As such, invariant features can act as information encoders. Using information from drug repurposing hub, we show that these hub genes are also drug targets. Collectively, these observations suggest that invariant feature hubs can be TAB candidates.
We propose that under the new light of biological constraints, we can use a dynamic approach for biomarker discovery that encapsulates both the genetic heterogeneity and molecular fluctuation across individual patients. Our central hypothesis is that disease states show constrains in their molecular activities, and identifiable invariable features possess diagnostic and therapeutic values.
The main objective of this proposal is to uncover TABS using selected NIH Common Fund datasets (namely, exRNA, GTEx, LINCS, and IDG). In Aim 1, we will test the hypothesis that biologically constrained invariant features are universal to most if not all biological states. We will show this by finding invariant features with respect to each biological state from selected Common Fund datasets.
We will conduct comparative analyses in disease and normal states in order to dissect disease-specific invariant features. Next, in Aim 2, we will test the hypothesis that invariant feature hubs are TABS. We will show this by determining the diagnostic capability of invariant feature hubs for their "encodability" to reconstruct the expression values of their associated invariant feature genes in different individual patients diagnosed under the same disease type.
Finally, we will map these invariant feature hubs to IDG and DrugBank to determine their druggability. For those understudied hubs with no known drugs, we will perform computational analyses such as homology modeling and machine learning to characterize their druggability. We expect timely accomplishment of proposed aims, and successful completion of this project will no doubt provide added values for the selected Common Fund datasets, while providing a new paradigm shift of biomarker and therapeutic target discovery.
Identifying biomarkers that are diagnostic, robust, and generalizable across individuals while possessing therapeutic values is the most wanted endeavor in medicine. However, there are numerous challenges in the identification of such robust therapy-associated biomarkers (TABS).
For example, most of the current methods seek to achieve statistically significant differential biological signals in general patient cohorts but fail to acknowledge heterogenous genetic backgrounds and phenotypic diversity among individual patients. Our recent studies using newly developed machine learning-based feature engineering approaches and conducted in a pan-cancer study across 12 cancer types showed that biologically constrained features (named herein invariant features) are universal in disease and can be used to classify individual cancers.
Importantly, we also show that invariant features can be used to build de novo biological networks and discover network hubs that can be successfully utilized to infer the expression of associated genes. As such, invariant features can act as information encoders. Using information from drug repurposing hub, we show that these hub genes are also drug targets. Collectively, these observations suggest that invariant feature hubs can be TAB candidates.
We propose that under the new light of biological constraints, we can use a dynamic approach for biomarker discovery that encapsulates both the genetic heterogeneity and molecular fluctuation across individual patients. Our central hypothesis is that disease states show constrains in their molecular activities, and identifiable invariable features possess diagnostic and therapeutic values.
The main objective of this proposal is to uncover TABS using selected NIH Common Fund datasets (namely, exRNA, GTEx, LINCS, and IDG). In Aim 1, we will test the hypothesis that biologically constrained invariant features are universal to most if not all biological states. We will show this by finding invariant features with respect to each biological state from selected Common Fund datasets.
We will conduct comparative analyses in disease and normal states in order to dissect disease-specific invariant features. Next, in Aim 2, we will test the hypothesis that invariant feature hubs are TABS. We will show this by determining the diagnostic capability of invariant feature hubs for their "encodability" to reconstruct the expression values of their associated invariant feature genes in different individual patients diagnosed under the same disease type.
Finally, we will map these invariant feature hubs to IDG and DrugBank to determine their druggability. For those understudied hubs with no known drugs, we will perform computational analyses such as homology modeling and machine learning to characterize their druggability. We expect timely accomplishment of proposed aims, and successful completion of this project will no doubt provide added values for the selected Common Fund datasets, while providing a new paradigm shift of biomarker and therapeutic target discovery.
Awardee
Funding Goals
NOT APPLICABLE
Grant Program (CFDA)
Awarding Agency
Place of Performance
Minnesota
United States
Geographic Scope
State-Wide
Related Opportunity
Analysis Notes
Amendment Since initial award the End Date has been extended from 09/19/23 to 09/19/24 and the total obligations have increased 31799900% from $1 to $318,000.
Mayo Clinic was awarded
Project Grant R03OD034496
worth $318,000
from the National Institute of Allergy and Infectious Diseases in September 2022 with work to be completed primarily in Minnesota United States.
The grant
has a duration of 2 years and
was awarded through assistance program 93.310 Trans-NIH Research Support.
The Project Grant was awarded through grant opportunity Pilot Projects Enhancing Utility and Usage of Common Fund Data Sets (R03 Clinical Trial Not Allowed).
Status
(Complete)
Last Modified 2/20/25
Period of Performance
9/20/22
Start Date
9/19/24
End Date
Funding Split
$318.0K
Federal Obligation
$0.0
Non-Federal Obligation
$318.0K
Total Obligated
Activity Timeline
Transaction History
Modifications to R03OD034496
Additional Detail
Award ID FAIN
R03OD034496
SAI Number
R03OD034496-3603408240
Award ID URI
SAI UNAVAILABLE
Awardee Classifications
Nonprofit With 501(c)(3) IRS Status (Other Than An Institution Of Higher Education)
Awarding Office
75AGNA NIH AGGREGATE FINANCIAL ASSISTANCE DATA AWARDING OFFICE
Funding Office
75NA00 NIH OFFICE OF THE DIRECTOR
Awardee UEI
Y2K4F9RPRRG7
Awardee CAGE
5A021
Performance District
MN-90
Senators
Amy Klobuchar
Tina Smith
Tina Smith
Budget Funding
| Federal Account | Budget Subfunction | Object Class | Total | Percentage |
|---|---|---|---|---|
| Office of the Director, National Institutes of Health, Health and Human Services (075-0846) | Health research and training | Grants, subsidies, and contributions (41.0) | $317,999 | 100% |
Modified: 2/20/25