Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jul 2;20(1):370.
doi: 10.1186/s12859-019-2969-0.

Exploring the druggable space around the Fanconi anemia pathway using machine learning and mechanistic models

Affiliations

Exploring the druggable space around the Fanconi anemia pathway using machine learning and mechanistic models

Marina Esteban-Medina et al. BMC Bioinformatics. .

Abstract

Background: In spite of the abundance of genomic data, predictive models that describe phenotypes as a function of gene expression or mutations are difficult to obtain because they are affected by the curse of dimensionality, given the disbalance between samples and candidate genes. And this is especially dramatic in scenarios in which the availability of samples is difficult, such as the case of rare diseases.

Results: The application of multi-output regression machine learning methodologies to predict the potential effect of external proteins over the signaling circuits that trigger Fanconi anemia related cell functionalities, inferred with a mechanistic model, allowed us to detect over 20 potential therapeutic targets.

Conclusions: The use of artificial intelligence methods for the prediction of potentially causal relationships between proteins of interest and cell activities related with disease-related phenotypes opens promising avenues for the systematic search of new targets in rare diseases.

Keywords: Big data; Fanconi anemia; Genomics; Machine learning; Mathematical models; Signaling pathways.

PubMed Disclaimer

Conflict of interest statement

The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Fanconi anemia curated map, based in the KEGG FA pathway. There are two protein complexes: RPA, composed of RPA1, RPA2, RPA3 and RPA4, and Core, composed of FANCM, FANCG, FANCL, FAAP100, FANCA, FANCB, UBE2T, STRA13, FANCC, FAAP24, HES1, FANCE, FANCF, BLM, RMI1, RMI2 and TOP3A. At the end of the effector nodes, whose names are taken for the circuits, a description of the main functionalities triggered by the signaling circuits can be found
Fig. 2
Fig. 2
Schema of the procedure followed for the analysis
Fig. 3
Fig. 3
Observed distribution of circuit activities in the comparison between healthy and FA bone marrow cells
Fig. 4
Fig. 4
Observed distribution of circuit activities in blood, a tissue affected by the disease, two tissues with a high rate of cell replication (skin and gastrointestinal), where DNA reparation is expected to play a relevant role and another tissue with low rate of cell replication (brain)
Fig. 5
Fig. 5
Distributions of the cross-validation of the relevance values for the top 50 most relevant genes ordered by their mean. Above the relevance value of 0.006 the relevance rendered by the ML procedure and the means obtained from the cross-validation are consistent. Then this value is taken as a threshold
Fig. 6
Fig. 6
the distribution of the R2 score for each signaling circuit of the FA pathway across all the training/test splits. The R2 score goes from -infinite to 1, where 0 represents a model that always predicts the mean for each task and a perfect model has a score of 1
Fig. 7
Fig. 7
Enrichment analysis with GO terms and rare diseases

References

    1. Kahvejian A, Quackenbush J, Thompson JF. What would you do if you could sequence everything? Nat Biotechnol. 2008;26(10):1125–1133. doi: 10.1038/nbt1494. - DOI - PMC - PubMed
    1. Mardis ER. DNA sequencing technologies: 2006–2016. Nat Protoc. 2017;12(2):213. doi: 10.1038/nprot.2016.182. - DOI - PubMed
    1. Stephens ZD, Lee SY, Faghri F, Campbell RH, Zhai C, Efron MJ, Iyer R, Schatz MC, Sinha S, Robinson GE. Big data: astronomical or genomical? PLoS Biol. 2015;13(7):e1002195. doi: 10.1371/journal.pbio.1002195. - DOI - PMC - PubMed
    1. Topol EJ. High-performance medicine: the convergence of human and artificial intelligence. Nat Med. 2019;25(1):44. doi: 10.1038/s41591-018-0300-7. - DOI - PubMed
    1. Lindsey R, Daluiski A, Chopra S, Lachapelle A, Mozer M, Sicular S, Hanel D, Gardner M, Gupta A, Hotchkiss R. Deep neural network improves fracture detection by clinicians. Proc Natl Acad Sci. 2018;115(45):11591–11596. doi: 10.1073/pnas.1806905115. - DOI - PMC - PubMed