Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Sep 30;38(19):4636-4638.
doi: 10.1093/bioinformatics/btac572.

MMGraph: a multiple motif predictor based on graph neural network and coexisting probability for ATAC-seq data

Affiliations

MMGraph: a multiple motif predictor based on graph neural network and coexisting probability for ATAC-seq data

Shuangquan Zhang et al. Bioinformatics. .

Abstract

Motivation: Transcription factor binding sites (TFBSs) prediction is a crucial step in revealing functions of transcription factors from high-throughput sequencing data. Assay for Transposase-Accessible Chromatin using sequencing (ATAC-seq) provides insight on TFBSs and nucleosome positioning by probing open chromatic, which can simultaneously reveal multiple TFBSs compare to traditional technologies. The existing tools based on convolutional neural network (CNN) only find the fixed length of TFBSs from ATAC-seq data. Graph neural network (GNN) can be considered as the extension of CNN, which has great potential in finding multiple TFBSs with different lengths from ATAC-seq data.

Results: We develop a motif predictor called MMGraph based on three-layer GNN and coexisting probability of k-mers for finding multiple motifs from ATAC-seq data. The results of the experiment which has been conducted on 88 ATAC-seq datasets indicate that MMGraph has achieved the best performance on area of eight metrics radar score of 2.31 and could find 207 higher-quality multiple motifs than other existing tools.

Availability and implementation: MMGraph is wrapped in Python package, which is available at https://github.com/zhangsq06/MMGraph.git.

Supplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
The whole MMGraph workflow consists of three steps

Similar articles

Cited by

References

    1. Alipanahi B. et al. (2015) Predicting the sequence specificities of DNA-and RNA-binding proteins by deep learning. Nat. Biotechnol., 33, 831–838. - PubMed
    1. Bentsen M. et al. (2020) ATAC-seq footprinting unravels kinetics of transcription factor binding during zygotic genome activation. Nat. Commun., 11, 1–11. - PMC - PubMed
    1. Colonnese S. et al. (2021) Protein-Protein Interaction Prediction via Graph Signal Processing. In: IEEE Access, vol. 9, pp. 142681–142692. https://doi.org/10.1109/ACCESS.2021.3119569.
    1. Fletez-Brant C. et al. (2013) kmer-SVM: a web server for identifying predictive regulatory sequence features in genomic data sets. Nucleic Acids Res., 41, W544–W556. - PMC - PubMed
    1. Norouzi M. et al. (2012) Hamming distance metric learning. In: Advances in Neural Information Processing Systems, vol. 25, MIT Press.

Publication types

MeSH terms