RF1AG090405
Project Grant
Overview
Grant Description
OPTIMIZING VALIDITY OF COMPARATIVE EFFECTIVENESS RESEARCH IN ALZHEIMER'S DISEASE AND RELATED DEMENTIAS USING LARGE LANGUAGE MODELS - SINCE PEOPLE LIVING WITH DEMENTIA (PLWD) ARE VULNERABLE TO MEDICATION ERRORS, DRUG-DRUG INTERACTIONS, AND A VARIETY OF ADVERSE DRUG EVENTS, THEIR PRESCRIBING DECISIONS NEED TO BE INFORMED BY SOLID EVIDENCE. PHYSICIANS’ PRESCRIBING DECISIONS OFTEN RELY ON ROUTINELY COLLECTED DATA BECAUSE RANDOMIZED CONTROLLED TRIALS (RCTS) OFTEN SEVERELY UNDERREPRESENT PEOPLE LIVING WITH DEMENTIA (PLWD). ELECTRONIC HEALTH RECORDS (EHR) ARE AMONG THE MOST COMMONLY USED REAL-WORLD DATA FOR COMPARATIVE EFFECTIVENESS RESEARCH (CER) BECAUSE THEY CONTAIN RICH CLINICAL DATA. HOWEVER, THE STRUCTURED EHR DATA SUFFERS FROM MISSING DATA ON KEY GERIATRIC FACTORS CRITICAL FOR CONDUCTING VALID COMPARATIVE EFFECTIVENESS RESEARCH (CER) AMONG PLWD, SUCH AS DEGREE OF COGNITIVE IMPAIRMENT, MENTAL AND FUNCTIONAL STATUS, AND BEHAVIORAL SYMPTOMS. MUCH OF SUCH INFORMATION IS EMBEDDED IN THE FREE-TEXT CLINICAL NOTES AND REPORTS, BUT TRADITIONAL NATURAL LANGUAGE PROCESSING (NLP) REQUIRES A LABOR- INTENSIVE DATA ANNOTATION PROCESS FOR EACH TARGET PHENOTYPE, WHICH IS NOT SCALABLE FOR THE LARGE NUMBERS OF STUDY VARIABLES NEEDED FOR CONFOUNDING ADJUSTMENT IN A NON-RANDOMIZED CER STUDY. LARGE LANGUAGE MODELS (LLMS) HAVE BEEN SHOWN TO HAVE PROMISING POTENTIAL TO EXTRACT CONCEPTS AND PHENOTYPES THAT WERE NOT PREDEFINED DURING A TRAINING STAGE. HOWEVER, THE PERFORMANCE OF THE EXISTING LLMS IN PREDICTING ADRD-RELEVANT PHENOTYPES IS UNKNOWN. NONE OF THE EXISTING LLMS HAVE BEEN TRAINED ON CLINICAL EHR NOTES LINKING TO EXTERNAL DATA THAT CONTAIN LONGITUDINAL GERIATRIC DATA. OUR OBJECTIVE IS TO BUILD NOVEL LLMS SPECIALIZING IN ADRD-RELEVANT CER. IT IS DESIGNED TO GENERATE ADRD-RELEVANT PHENOTYPES AND TRAINED ON CLINICAL EHR INTEGRATED WITH MULTIPLE GERIATRIC- INFORMATION-ENRICHED EXTERNAL DATASETS. THE GROUND TRUTH OF ALL PHENOTYPES OUR LLMS AIM TO PREDICT WILL BE PROVIDED BY LARGE-SCALE ANNOTATION AVAILABLE AS STRUCTURED DATA IN THE LINKED EXTERNAL DATASETS. OUR INTEGRATED DATASET WILL COVER >850,000 LIVES (>80,000 PLWD) IN TWO LARGE MULTI-CENTER EHR NETWORKS IN MASSACHUSETTS FROM 2000-2024. THE CENTRAL HYPOTHESIS IS THAT LLMS CAN BE USED TO SCALABLY GENERATE VALID FEATURES AND CONSISTENTLY REDUCE MISSING DATA ON KEY GERIATRIC FACTORS, ENHANCING THE ROBUSTNESS OF CAUSAL CER ANALYSES AMONG PLWD. BUILDING ON EXISTING GENERAL-PURPOSE LLMS, WE WILL DEVELOP NOVEL LLMS BY INSTRUCTION-TUNING, CONVERTING THE LINKED STRUCTURED LABELS INTO TEXT INSTRUCTIONS AND FINETUNING THE LLMS THROUGH A TEXT GENERATION FRAMEWORK, AND BY CHAIN-OF-THOUGHT TECHNIQUE, GUIDING LLMS TO INFER RESULTS VIA MULTIPLE REASONING STEPS. IN AIM 1, WE WILL CONTINUAL PRE-TRAIN AND FINETUNE NOVEL LLMS TO DETERMINE EIGHT CATEGORIES OF GERIATRIC-SPECIFIC PHENOTYPES COMMONLY USED IN ARDR-RELEVANT CER. IN AIM 2, WE WILL ASSESS GENERALIZABILITY BY TESTING THE PERFORMANCE TO DETERMINE EIGHT ADDITIONAL PHENOTYPES NOT PREVIOUSLY TARGETED AND OPTIMIZE THE LLMS ACCORDINGLY. IN AIM 3, WE WILL COMPARE THE TREATMENT EFFECT ESTIMATION USING ONLY EHR (MIMICKING THE COMMON RESEARCH SCENARIO WHEN LINKAGE TO EXTERNAL DATA IS INFEASIBLE DUE TO PRIVACY CONCERNS) WITH VS. WITHOUT USING THE LLM- DERIVED FEATURES IN SIX HIGHLY RELEVANT EMPIRICAL DRUG SAFETY AND EFFECTIVENESS STUDIES AMONG PLWD.
Awardee
Funding Goals
TO ENCOURAGE BIOMEDICAL, SOCIAL, AND BEHAVIORAL RESEARCH AND RESEARCH TRAINING DIRECTED TOWARD GREATER UNDERSTANDING OF THE AGING PROCESS AND THE DISEASES, SPECIAL PROBLEMS, AND NEEDS OF PEOPLE AS THEY AGE. THE NATIONAL INSTITUTE ON AGING HAS ESTABLISHED PROGRAMS TO PURSUE THESE GOALS. THE DIVISION OF AGING BIOLOGY EMPHASIZES UNDERSTANDING THE BASIC BIOLOGICAL PROCESSES OF AGING. THE DIVISION OF GERIATRICS AND CLINICAL GERONTOLOGY SUPPORTS RESEARCH TO IMPROVE THE ABILITIES OF HEALTH CARE PRACTITIONERS TO RESPOND TO THE DISEASES AND OTHER CLINICAL PROBLEMS OF OLDER PEOPLE. THE DIVISION OF BEHAVIORAL AND SOCIAL RESEARCH SUPPORTS RESEARCH THAT WILL LEAD TO GREATER UNDERSTANDING OF THE SOCIAL, CULTURAL, ECONOMIC AND PSYCHOLOGICAL FACTORS THAT AFFECT BOTH THE PROCESS OF GROWING OLD AND THE PLACE OF OLDER PEOPLE IN SOCIETY. THE DIVISION OF NEUROSCIENCE FOSTERS RESEARCH CONCERNED WITH THE AGE-RELATED CHANGES IN THE NERVOUS SYSTEM AS WELL AS THE RELATED SENSORY, PERCEPTUAL, AND COGNITIVE PROCESSES ASSOCIATED WITH AGING AND HAS A SPECIAL EMPHASIS ON ALZHEIMER'S DISEASE. SMALL BUSINESS INNOVATION RESEARCH (SBIR) PROGRAM: TO EXPAND AND IMPROVE THE SBIR PROGRAM, TO INCREASE PRIVATE SECTOR COMMERCIALIZATION OF INNOVATIONS DERIVED FROM FEDERAL RESEARCH AND DEVELOPMENT, TO INCREASE SMALL BUSINESS PARTICIPATION IN FEDERAL RESEARCH AND DEVELOPMENT, AND TO FOSTER AND ENCOURAGE PARTICIPATION OF SOCIALLY AND ECONOMICALLY DISADVANTAGED SMALL BUSINESS CONCERNS AND WOMEN-OWNED SMALL BUSINESS CONCERNS IN TECHNOLOGICAL INNOVATION. SMALL BUSINESS TECHNOLOGY TRANSFER (STTR) PROGRAM: TO STIMULATE AND FOSTER SCIENTIFIC AND TECHNOLOGICAL INNOVATION THROUGH COOPERATIVE RESEARCH DEVELOPMENT CARRIED OUT BETWEEN SMALL BUSINESS CONCERNS AND RESEARCH INSTITUTIONS, TO FOSTER TECHNOLOGY TRANSFER BETWEEN SMALL BUSINESS CONCERNS AND RESEARCH INSTITUTIONS, TO INCREASE PRIVATE SECTOR COMMERCIALIZATION OF INNOVATIONS DERIVED FROM FEDERAL RESEARCH AND DEVELOPMENT, AND TO FOSTER AND ENCOURAGE PARTICIPATION OF SOCIALLY AND ECONOMICALLY DISADVANTAGED SMALL BUSINESS CONCERNS AND WOMEN-OWNED SMALL BUSINESS CONCERNS IN TECHNOLOGICAL INNOVATION.
Grant Program (CFDA)
Awarding / Funding Agency
Place of Performance
Massachusetts
United States
Geographic Scope
State-Wide
Related Opportunity
Brigham & Womens Hospital was awarded
Advanced Language Models for Alzheimer's Research Optimization
Project Grant RF1AG090405
worth $3,687,278
from National Institute on Aging in September 2025 with work to be completed primarily in Massachusetts United States.
The grant
has a duration of 4 years and
was awarded through assistance program 93.866 Aging Research.
The Project Grant was awarded through grant opportunity NIH Research Project Grant (Parent R01 Clinical Trial Not Allowed).
Status
(Ongoing)
Last Modified 9/24/25
Period of Performance
9/15/25
Start Date
9/14/29
End Date
Funding Split
$3.7M
Federal Obligation
$0.0
Non-Federal Obligation
$3.7M
Total Obligated
Activity Timeline
Additional Detail
Award ID FAIN
RF1AG090405
SAI Number
RF1AG090405-2522330954
Award ID URI
SAI UNAVAILABLE
Awardee Classifications
Nonprofit With 501(c)(3) IRS Status (Other Than An Institution Of Higher Education)
Awarding Office
75NN00 NIH National Insitute on Aging
Funding Office
75NN00 NIH National Insitute on Aging
Awardee UEI
QN6MS4VN7BD1
Awardee CAGE
0W3J1
Performance District
MA-90
Senators
Edward Markey
Elizabeth Warren
Elizabeth Warren
Modified: 9/24/25