Research Topic Description, including Problem Statement:

Cheminformatics is a relatively new field of science that brings together chemistry, computer science, and data analysis, *1. While cheminformatics has seen significant applicability in drug discovery and pharmaceutical development, this may be only the tip of the iceberg in terms of how these tools could be used by researchers and the Intelligence Community. By harnessing the power of advanced analytical tools, such as AI and machine learning, cheminformatics enables researchers to predict molecular properties, design new compounds, and characterize those compounds in new and insightful ways.*2,*3,*4.

The pharmaceutical industry uses cheminformatics tools in drug discovery as described in the review article by Lawless, et al from 2016, *5. These models use chemical properties such as solubility, permeability, metabolic stability, and structure to determine likely candidates for particular bioactivities. Building on a similar framework, cheminformatics tools can be used to predict a number of molecular and spectral properties. Identification of novel materials, understanding the potential synthesis pathways for novel materials, and characterizing the spectra derived from novel materials are three types of information that can be time consuming and/or expensive to compile using traditional laboratory methods. Cheminformatics may be able to greatly improve the time and cost related to these tasks, *6.

A key aspect of all cheminformatics tools are the libraries of data they are able to pull from. Developing new methods, or augmenting existing methods, for parsing large and diverse datasets is a critical component of developing the cheminformatics tools of the future.


Example Approaches:

Researchers seeking to reply to this topic should seek to develop new cheminformatics tools or apply currently existing tools to new applications, such as:

  • Prediction of chemical structure of novel materials
  • Prediction of synthesis pathways for novel materials
  • Novel methods for spectral modeling of existing materials
  • Characterization of molecular properties through computational methods

Through these studies, it is hoped that the academic and intelligence communities will have an enhanced understanding of what is possible through cheminformatics and how to apply these tools to new and evolving threat materials. While the focus of the effort should be on computational chemistry and modeling tools, collaboration with experimental scientists and/or laboratories is encouraged.

Relevance to the Intelligence Community:

The S&T priority AI and Machine Learning directly aligns with this research topic.

 – Develop/enhance agile, scalable, accessible, and reliable methods of processing disparate data. (Category Zero)
 – Develop/enhance capabilities to flag anomalies within massive data sets (Category Zero)
 – Develop/enhance understanding of global dual-use technologies that may be used for chemical, biological, radiological, and nuclear weapons programs. (Category One)
 – Develop/enhance methods to discriminate offensive biological activities from defensive programs or other non-offensive research (Category Two).
 – Develop/enhance methods to detect, assess, and/or evaluate current and emerging threats that cause death, disease, or other biological malfunction of the neurological system. (Category Two)
 – Develop/enhance capabilities to detect and identify chemical agents, and associated delivery systems (Category Two) 2N051 – Develop/enhance detection and monitoring of chemical weapon program activities, including Schedule 1 chemicals or precursors (Category Two)
 – Develop/enhance detection capabilities to identify chemical weaponization (Category Two)

Key Words: cheminformatics, chemoinformatic, computational chemistry, chemical sensing, predictive modeling, spectral modeling, molecular modeling, chemical structure, data mining, mathematical modeling, database development


