Search Prime Grants

2403911

Cooperative Agreement

Overview

Grant Description
Apto: Constructing causal knowledge graphs for assessing and predicting technology outcomes - Assessing and predicting technology outcomes (APTO) is crucial for evaluating the impact of R&D investments on innovation, economic growth, and national competitiveness.

Tackling this complex task requires appropriate datasets and effective data creation tools.

The latest natural language processing (NLP) technologies have reached human-level performance in certain crucial information extraction tasks, as evidenced by results from community-organized challenges.

This project will leverage an award-winning pipeline for constructing knowledge graphs (KGs) by expanding it substantially to include a diverse set of technology-related entities and their relations.

KGs comprise entities like diseases, genes, drugs, etc., and their relations, including associations, bindings, positive correlations, etc.

The enhanced KG will be ready for developing models for predicting technology outcomes in healthcare and drug discovery.

In addition to the dataset, the project team will also develop an end-to-end toolkit for fellow APTO teams to perform information extraction tasks and construct KGs in their own domains.

By utilizing the latest advancements in NLP and predictive modeling, the project will provide a comprehensive assessment of the capabilities and applications of biomedical technologies.

This will not only inform R&D investments but will also contribute to informed decision-making in healthcare and technology policy, as well as address the disparities between healthcare spending and outcomes.

Furthermore, the project's approach of extracting vast amounts of information from text to build predictive models can be applied to other sectors, advancing research and knowledge across various fields.

Ultimately, this project has the potential to drive strategic investments in technology and innovation, improving health outcomes and fostering economic prosperity on a national and global scale.

This project will leverage a pipeline recently developed that won the NIH-organized Litcoin NLP Challenge, a competition that evaluated methods for constructing biomedical knowledge graphs (KGs) by extracting entities and their relations from biomedical texts.

Using this pipeline, the project team created a large-scale KG by extracting information from all PubMed abstracts.

The KG, named IKRAPH, contains substantially more information than that in public databases.

To adapt IKRAPH for causal inference, the project team annotated direction information for the relations in the Litcoin dataset and developed models to predict the direction of relations, which enabled the construction of a causal KG capable of inferring causality between indirectly connected entities.

In this project, IKRAPH will be enhanced by adding a diverse set of technology-related entities and their relations such as equipment, technology, technology features, feature values, problems, methods, data types, datasets, and geographical entities, etc.

The project team will extract relevant information from unstructured text including PubMed abstracts, PubMed Central full-text articles, patents, marketing reports, and Wikipedia articles.

Relevant data from public databases will also be integrated into IKRAPH.

The toolkit for constructing KGs, designed for end-to-end annotation and model building, will utilize an AI-assisted methodology.

This approach incorporates AI models at every stage of the annotation process to enhance quality and significantly improve efficiency.

Finally, the project team will conduct a case study on advanced manufacturing technologies (AMT) for the production of generic off-patent drugs using the constructed KG.

This award reflects NSF's statutory mission and has been deemed worthy of support through evaluation using the foundation's intellectual merit and broader impacts review criteria.

Subawards are not planned for this award.
Awardee
Funding Goals
THE GOAL OF THIS FUNDING OPPORTUNITY, "ASSESSING AND PREDICTING TECHNOLOGY OUTCOMES", IS IDENTIFIED IN THE LINK: HTTPS://WWW.NSF.GOV/PUBLICATIONS/PUB_SUMM.JSP?ODS_KEY=NSF23600
Awarding / Funding Agency
Place of Performance
Tallahassee, Florida 32312-5701 United States
Geographic Scope
Single Zip Code
Analysis Notes
Amendment Since initial award the total obligations have increased 100% from $1,932,818 to $3,865,636.
Insilicom was awarded Predicting Technology Outcomes with Enhanced Knowledge Graphs Cooperative Agreement 2403911 worth $3,865,636 from National Science Foundation in August 2024 with work to be completed primarily in Tallahassee Florida United States. The grant has a duration of 3 years and was awarded through assistance program 47.084 NSF Technology, Innovation, and Partnerships. The Cooperative Agreement was awarded through grant opportunity Assessing and Predicting Technology Outcomes.

Status
(Ongoing)

Last Modified 8/12/25

Period of Performance
8/1/24
Start Date
7/31/27
End Date
39.0% Complete

Funding Split
$3.9M
Federal Obligation
$0.0
Non-Federal Obligation
$3.9M
Total Obligated
100.0% Federal Funding
0.0% Non-Federal Funding

Activity Timeline

Interactive chart of timeline of amendments to 2403911

Transaction History

Modifications to 2403911

Additional Detail

Award ID FAIN
2403911
SAI Number
None
Award ID URI
SAI EXEMPT
Awardee Classifications
Small Business
Awarding Office
491501 TECHNOLOGY FRONTIERS
Funding Office
491501 TECHNOLOGY FRONTIERS
Awardee UEI
UEZ1NQZ388J6
Awardee CAGE
702F0
Performance District
FL-02
Senators
Marco Rubio
Rick Scott
Modified: 8/12/25