Search Prime Grants

R44GM122196

Project Grant

Overview

Grant Description
Centralized assay datasets for modelling support of small drug discovery organizations.
Funding Goals
THE NATIONAL INSTITUTE OF GENERAL MEDICAL SCIENCES (NIGMS) SUPPORTS BASIC RESEARCH THAT INCREASES OUR UNDERSTANDING OF BIOLOGICAL PROCESSES AND LAYS THE FOUNDATION FOR ADVANCES IN DISEASE DIAGNOSIS, TREATMENT, AND PREVENTION. NIGMS ALSO SUPPORTS RESEARCH IN SPECIFIC CLINICAL AREAS THAT AFFECT MULTIPLE ORGAN SYSTEMS: ANESTHESIOLOGY AND PERI-OPERATIVE PAIN, CLINICAL PHARMACOLOGY ?COMMON TO MULTIPLE DRUGS AND TREATMENTS, AND INJURY, CRITICAL ILLNESS, SEPSIS, AND WOUND HEALING.? NIGMS-FUNDED SCIENTISTS INVESTIGATE HOW LIVING SYSTEMS WORK AT A RANGE OF LEVELSFROM MOLECULES AND CELLS TO TISSUES AND ORGANSIN RESEARCH ORGANISMS, HUMANS, AND POPULATIONS. ADDITIONALLY, TO ENSURE THE VITALITY AND CONTINUED PRODUCTIVITY OF THE RESEARCH ENTERPRISE, NIGMS PROVIDES LEADERSHIP IN SUPPORTING THE TRAINING OF THE NEXT GENERATION OF SCIENTISTS, ENHANCING THE DIVERSITY OF THE SCIENTIFIC WORKFORCE, AND DEVELOPING RESEARCH CAPACITY THROUGHOUT THE COUNTRY.
Place of Performance
North Carolina United States
Geographic Scope
State-Wide
Analysis Notes
Amendment Since initial award the End Date has been extended from 07/31/20 to 07/31/24 and the total obligations have increased 338% from $749,323 to $3,285,580.
Collaborations Pharmaceuticals was awarded Project Grant R44GM122196 worth $3,285,580 from the National Institute of General Medical Sciences in January 2016 with work to be completed primarily in North Carolina United States. The grant has a duration of 7 years 6 months and was awarded through assistance program 93.859 Biomedical Research and Research Training. The Project Grant was awarded through grant opportunity PHS 2020-2 Omnibus Solicitation of the NIH, CDC and FDA for Small Business Innovation Research Grant Applications (Parent SBIR [R43/R44] Clinical Trial Not Allowed).

SBIR Details

Research Type
SBIR Phase II
Title
Centralized assay datasets for modelling support of small drug discovery organizations
Abstract
Project Summary Collaborations Pharmaceuticals, Inc. was formed after identifying a need for software to assist academics and smaller companies in curating their data and discovery of new hits or lead optimisation. In the past two years the continued importance of artificial intelligence (AI) is apparent from the explosive growth in number of these companies and the increasing number of multi-million dollar deals with pharma using Machine Learning (ML) to assist in drug discovery. There is a heavy focus by these companies on the drug discovery modeling aspect but there is a continued unmet need and bottleneck in the curation of quality in vitro and in vivo data ADME/Tox data for ML as well as prospective testing to validate the technologies. In Phase I, we developed a prototype of Assay CentralÒ software and used this with a wide variety of structure activity data from sources both public and private, formatted and unformatted, with ~14 collaborators working on neglected, rare or common disease targets as well as used it for our internal drug discovery projects. In Phase I we also created error checking and correction software. We also built and validated Bayesian models with the datasets that were collected and cleaned. And, in addition, we developed new data visualization tools. The software can be used to create selections of these models for sharing with collaborators as needed and for scoring new molecules and visualizing the multiple outputs in various formats. In Phase II, we have developed Assay CentralÒ into a production tool which is easy to deploy, built on industry standard technologies, provided graphical display of models and information on model applicability. Importantly, we identified that customers wanted us to provide them with the results! We developed our fee-for-service consulting services model using Assay CentralÒ to solve their problems and this has expanded our revenues annually. In Phase II we evaluated additional ML algorithms and molecular descriptors with manually curated datasets as well as compared algorithms across over 5000 auto-curated datasets from ChEMBL. This illustrated the utility of access to multiple algorithms and how the Bayesian algorithm was generally comparable to these other ML algorithms. This also motivated us to develop new software to integrate these algorithms. We have also explored finding rare disease datasets and applying our data curation and ML approach to them. With these and additional collaborations, as well as internal projects on Alzheimer’s disease (through a NIH NIGMS supplement) we have been able to repurpose already approved drugs for several targets for this and other diseases. For multiple projects we have performed several rounds of model building and fed data back into the models to enable improved predictions. Finally, we have developed prototype tools to enable us to develop automated molecule designs, assess their synthesizability and perform retrosynthetic analysis. These combined efforts dramatically increased the number of projects we were able to work on (and ultimately publish to raise our visibility), created new spin off products as collections of models (MegaTransÒ, MegaToxÒ and MegaPredictÒ), molecule related IP, and generated employment. In Phase IIB we now propose a focus on steps to aid commercialization and further development of these technologies. We have identified that developing auto-curation software for dealing with complex biological data in unstructured databases will be a competitive advantage. We have also recognized that for many diseases we can have a complete or near complete collection of targets which may enable us to understand how a molecule may interfere with biological pathways from structure alone and this can be applied to complex diseases and “adverse outcome pathways” in toxicology. We also propose integrating state of the art multi-objective generative models for molecule design into our Assay Central computational software in order to complement our analog generation and retrosynthesis tools created in Phase II and aid in molecule optimization. We will validate this capability using some of the hit molecules identified in Phase II for different targets including human acetylcholinesterase. Assay Central would then have a full suite of integrated capabilities from data curation through to molecule design and retrosynthetic analysis and will enable us to attract larger deals with companies.Narrative In Phase II Collaborations Pharmaceuticals Inc. developed Assay CentralÒ software, applied it to build thousands of models using public and private drug discovery and ADME/Tox datasets and identified that the Bayesian algorithm performs comparably to other more advanced Machine learning (ML) algorithms. We have applied this modeling approach to rare and common diseases to identify new molecules to test and have identified active molecules through numerous prospective validations, leading to many patents and publications. We have also applied the trademarked Assay CentralÒ technology on consulting projects with pharmaceutical, consumer product and preclinical CRO companies, demonstrating it meets an unmet need and in the process generated revenues. As we have developed software to propose novel compounds for testing, that integrates our machine learning, molecule design and retrosynthesis we now propose extending our auto-curation to unstructured databases, integrating machine learning models for a disease pathway and the addition of multi- objective generative models in order to build out our Assay Central capabilities.
Topic Code
400
Solicitation Number
PA20-260

Status
(Complete)

Last Modified 3/20/25

Period of Performance
1/1/17
Start Date
7/31/24
End Date
100% Complete

Funding Split
$3.3M
Federal Obligation
$0.0
Non-Federal Obligation
$3.3M
Total Obligated
100.0% Federal Funding
0.0% Non-Federal Funding

Activity Timeline

Interactive chart of timeline of amendments to R44GM122196

Subgrant Awards

Disclosed subgrants for R44GM122196

Transaction History

Modifications to R44GM122196

Additional Detail

Award ID FAIN
R44GM122196
SAI Number
R44GM122196-1701739314
Award ID URI
SAI UNAVAILABLE
Awardee Classifications
Small Business
Awarding Office
75NS00 NIH National Institute of General Medical Sciences
Funding Office
75NS00 NIH National Institute of General Medical Sciences
Awardee UEI
EFJCLL37KKH3
Awardee CAGE
7BGW7
Performance District
NC-90
Senators
Thom Tillis
Ted Budd

Budget Funding

Federal Account Budget Subfunction Object Class Total Percentage
National Institute of General Medical Sciences, National Institutes of Health, Health and Human Services (075-0851) Health research and training Grants, subsidies, and contributions (41.0) $3,628,711 91%
National Institute on Aging, National Institutes of Health, Health and Human Services (075-0843) Health research and training Grants, subsidies, and contributions (41.0) $377,518 9%
Modified: 3/20/25