Evaluation of deep learning approaches for modeling transcription factor sequence specificity
- PMID: 34534646
- DOI: 10.1016/j.ygeno.2021.09.009
Evaluation of deep learning approaches for modeling transcription factor sequence specificity
Abstract
As a key component of gene regulation, transcription factors (TFs) play an important role in a number of biological processes. To fully understand the underlying mechanism of TF-mediated gene regulation, it is therefore critical to accurately identify TF binding sites and predict their affinities. Recently, deep learning (DL) algorithms have achieved promising results in the prediction of DNA-TF binding, however, various deep learning architectures have not been systematically compared, and the relative merit of each architecture remains unclear. To address this problem, we applied four different deep learning architectures to SELEX-seq and HT-SELEX data, covering three species and 35 families. We evaluated and compared the performance of different deep neural models using 10-fold cross-validation. Our results indicate that the hybrid CNN + DNN model shows the best performances. We expect that our study will be broadly applicable to modeling and predicting TF binding specificity when more high-throughput affinity data are available.
Keywords: Binding specificity; Convolutional neural network; Deep learning; Deep neural network; Transcription factor.
Copyright © 2021 Elsevier Inc. All rights reserved.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Research Materials
Miscellaneous