Search Prime Grants

U18TR004138

Cooperative Agreement

Overview

Grant Description
Virtual Approaches to New Chemistries - Project Summary/Abstract

Two new virtual chemistry technologies will be added to the NCATS ASPIRE project as separate modules. The first module will enable new chemistries to be modelled and selected from cutting-edge (DEEP) machine learning technology using the latest structure/activity data taken directly from instruments.

The second module will be a novel informatics system for capturing chemistry-rich data in a semantic template as machine-readable reactions. This will increase the utility of chemical reactions in electronic lab notebooks and allow more precise interrogation and automation of reaction analyses (and their corresponding reaction products).

The deep learning technology in module 1 is based on our new Chemically Rich Vector (CRV) methodology, which is able to compress information about chemical structures into a vector of 64 numbers with an efficiency that allows the encoding process to be reversed. Not only can a CRV be converted back into its original structure with high success (>90% exact match), but a modified CRV can be converted into a structure that is representative of that point in chemical space.

CRVs make excellent descriptors for SAR/QSAR iteration because they contain much more chemical information in a small space, allowing the automation of structure-activity models to be more streamlined, relative to conventional descriptors. The resulting models will explore the multi-dimensional space via an interactive visual interface (human-directed) or a back-end algorithm to constantly search for new and better structures (machine-directed).

Both interactive and automated processes will be connected back into the ASPIRE automation cycle so that they can be synthesized and measured (hypothesis evaluation and iterative optimization).

The second module, machine-readable reactions, draws from our extensive experience developing the BioHarmony Annotator (formerly: BioAssay Express), which uses natural language models to assign semantic ontology terms to biological assay protocols, turning them from unstructured text into machine-readable data.

Extracting the full content of reactions from protocols and chemical structure diagrams is remarkably difficult given the unstructured nature of text, abbreviations, shortcuts, and assumptions that go into diagrams. It is further complicated by the need to connect the materials in the scheme with the reaction text description (e.g. reagents, solvents, the sequences involved in the recipe, reaction workup, and product characterization).

As an alternative, we will modularize the CDD Stoichiometric Sketcher, which will allow us to extract this data. We will work with NCATS to identify important fields to capture, creating a machine-readable chemical reaction template.
Funding Goals
NOT APPLICABLE
Place of Performance
California United States
Geographic Scope
State-Wide
Analysis Notes
Amendment Since initial award the total obligations have increased 100% from $440,000 to $880,000.
Collaborative Drug Discovery was awarded Virtual Approaches to New Chemistries Cooperative Agreement U18TR004138 worth $880,000 from National Center for Advancing Translational Sciences in June 2022 with work to be completed primarily in California United States. The grant has a duration of 2 years and was awarded through assistance program 93.350 National Center for Advancing Translational Sciences. The Cooperative Agreement was awarded through grant opportunity Virtual Approaches Towards New Chemistries for Un-drugged Targets through A Specialized Platform for Innovative Research Exploration (ASPIRE) Collaborative Research Program (U18 Clinical Trials Not Allowed).

Status
(Complete)

Last Modified 8/20/24

Period of Performance
6/6/22
Start Date
5/31/24
End Date
100% Complete

Funding Split
$880.0K
Federal Obligation
$0.0
Non-Federal Obligation
$880.0K
Total Obligated
100.0% Federal Funding
0.0% Non-Federal Funding

Activity Timeline

Interactive chart of timeline of amendments to U18TR004138

Transaction History

Modifications to U18TR004138

Additional Detail

Award ID FAIN
U18TR004138
SAI Number
U18TR004138-279550083
Award ID URI
SAI UNAVAILABLE
Awardee Classifications
Small Business
Awarding Office
75NR00 NIH NATIONAL CENTER FOR ADVANCING TRANSLATIONAL SCIENCES
Funding Office
75NR00 NIH NATIONAL CENTER FOR ADVANCING TRANSLATIONAL SCIENCES
Awardee UEI
KC1DAXHM8424
Awardee CAGE
4N2Z7
Performance District
CA-90
Senators
Dianne Feinstein
Alejandro Padilla

Budget Funding

Federal Account Budget Subfunction Object Class Total Percentage
National Center for Advancing Translational Sciences, National Institutes of Health, Health and Human Services (075-0875) Health research and training Grants, subsidies, and contributions (41.0) $880,000 100%
Modified: 8/20/24