Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2020 Oct 27:11:592430.
doi: 10.3389/fmicb.2020.592430. eCollection 2020.

RNMFMDA: A Microbe-Disease Association Identification Method Based on Reliable Negative Sample Selection and Logistic Matrix Factorization With Neighborhood Regularization

Affiliations

RNMFMDA: A Microbe-Disease Association Identification Method Based on Reliable Negative Sample Selection and Logistic Matrix Factorization With Neighborhood Regularization

Lihong Peng et al. Front Microbiol. .

Abstract

Microbes with abnormal levels have important impacts on the formation and development of various complex diseases. Identifying possible Microbe-Disease Associations (MDAs) helps to understand the mechanisms of complex diseases. However, experimental methods for MDA identification are costly and time-consuming. In this study, a new computational model, RNMFMDA, was developed to find possible MDAs. RNMFMDA contains two main processes. First, Reliable Negative MDA samples were selected based on Positive-Unlabeled (PU) learning and random walk with restart on the heterogeneous microbe-disease network. Second, Logistic Matrix Factorization with Neighborhood Regularization (LMFNR) was developed to compute the association probabilities for all microbe-disease pairs. To evaluate the performance of the proposed RNMFMDA method, we compared RNMFMDA with five state-of-the-art MDA prediction methods based on five-fold cross-validations on microbes, diseases, and MDAs. As a result, RNMFMDA obtained the best AUCs of 0.6332, 0.8669, and 0.9081, respectively for the three five-fold cross validations, significantly outperforming other models. The promising prediction performance may be attributed to the following three features: highly quality negative MDA sample selection, LMFNR-based MDA prediction model, and various biological information integration. In addition, a few predicted microbe-disease pairs with high association scores are worthy of further experimental validation.

Keywords: logistic matrix factorization with neighborhood regularization; microbe-disease associations; positive-unlabeled learning; random walk with restart; reliable negative samples.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Performance comparison of RNMFMDA with other five methods under CV1.
Figure 2
Figure 2
Performance comparison of RNMFMDA with other five methods under CV2.
Figure 3
Figure 3
Performance comparison of RNMFMDA with other five methods under CV3.
Figure 4
Figure 4
The performance comparison under different negative MDA selection ratios under CV1.
Figure 5
Figure 5
The performance comparison under different negative MDA selection ratios under CV2.
Figure 6
Figure 6
The performance comparison under different negative MDA selection ratios under CV3.

References

    1. Abu-Ali G. S., Mehta R. S., Lloyd-Price J., Mallick H., Branck T., Ivey K. L., et al. . (2018). Metatranscriptome of human faecal microbial communities in a cohort of adult men. Nat. Microbiol. 3:356. 10.1038/s41564-017-0084-4 - DOI - PMC - PubMed
    1. Bao W., Jiang Z., Huang D.-S. (2017). Novel human microbe-disease association prediction using network consistency projection. BMC Bioinformatics 18:543. 10.1186/s12859-017-1968-2 - DOI - PMC - PubMed
    1. Byrd A. L., Belkaid Y., Segre J. A. (2018). The human skin microbiome. Nat. Rev. Microbiol. 16:143. 10.1038/nrmicro.2017.157 - DOI - PubMed
    1. Chen L., Zhang Y.-H., Huang G., Pan X., Wang S., Huang T., et al. . (2018). Discriminating cirRNAs from other lncRNAs using a hierarchical extreme learning machine (H-ELM) algorithm with feature selection. Mol. Genet. Genomics 293, 137–149. 10.1007/s00438-017-1372-7 - DOI - PubMed
    1. Chen X., Huang Y.-A., You Z.-H., Yan G.-Y., Wang X.-S. (2017). A novel approach based on Katz measure to predict associations of human microbiota with non-infectious diseases. Bioinformatics 33, 733–739. 10.1093/bioinformatics/btw715 - DOI - PubMed

LinkOut - more resources