Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Sep-Oct;21(5):1529-1541.
doi: 10.1109/TCBB.2024.3402675. Epub 2024 Oct 9.

Molecular Design Based on Integer Programming and Splitting Data Sets by Hyperplanes

Molecular Design Based on Integer Programming and Splitting Data Sets by Hyperplanes

Jianshen Zhu et al. IEEE/ACM Trans Comput Biol Bioinform. 2024 Sep-Oct.

Abstract

A novel framework for designing the molecular structure of chemical compounds with a desired chemical property has recently been proposed. The framework infers a desired chemical graph by solving a mixed integer linear program (MILP) that simulates the computation process of two functions: a feature function defined by a two-layered model on chemical graphs and a prediction function constructed by a machine learning method. To improve the learning performance of prediction functions in the framework, we design a method that splits a given data set C into two subsets C(i),i=1,2 by a hyperplane in a chemical space so that most compounds in the first (resp., second) subset have observed values lower (resp., higher) than a threshold θ. We construct a prediction function ψ to the data set C by combining prediction functions ψi,i=1,2 each of which is constructed on C(i) independently. The results of our computational experiments suggest that the proposed method improved the learning performance for several chemical properties to which a good prediction function has been difficult to construct.

PubMed Disclaimer

References

Publication types