Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review

WeMine Aligned Pattern Clustering System for Biosequence Pattern Analysis

In: Bioinformatics [Internet]. Brisbane (AU): Exon Publications; 2021 Mar 20. Chapter 8.
Affiliations
Free Books & Documents
Review

WeMine Aligned Pattern Clustering System for Biosequence Pattern Analysis

En-Shiun Annie Lee et al.
Free Books & Documents

Excerpt

A major challenge in bioinformatics is discovering functional regions in biosequences. These regions may correspond to folded structures, physicochemical functionality, or mutation hotspots. The identification of functional regions in biosequences is essential to better understand biological mechanisms, design new drugs, and uncover novel knowledge concerning sporadic and genetic diseases. Pattern analysis and WeMine aligned pattern clustering (APC) systems enable the discovery of conserved regions with adaptive width and mutations, including frameshift, without relying on prior knowledge or exhaustive search. They align and rank patterns in local and distant correlated regions with statistical support within, and between, sequences. This chapter provides an overview of the WeMine APC and its utility in identifying functional regions such as protein binding sites, predicting pairwise interactions between protein-DNA and protein-protein network, and finding correlations among patterns and residues with class labels. Pattern analysis and WeMine APC could play an important role in personalized medicine, gene therapy, biomarker identification and drug discovery.

PubMed Disclaimer

References

    1. Bailey, Johnson J, Grant CE, Noble WS. The MEME suite. Nucl Acids Res. 2015;43(W1):W39–W49. https://doi.org/10.1093/nar/gkv416 . - DOI - PMC - PubMed
    1. Frith MC, Saunders NF, Kobe B, Bailey TL. Discovering sequence motifs with arbitrary insertions and deletions. PLoS Comput Biol. 2008;4(5):e1000071. https://doi.org/10.1371/journal.pcbi.1000071 . - DOI - PMC - PubMed
    1. Altschul SF, Madden TL, Schaffer AA, Zhang J, Zhang Z, Miller W, et al. Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. Nucl Acids Res. 1997;25(17):3389–3402. https://doi.org/10.1093/nar/25.17.3389 . - DOI - PMC - PubMed
    1. Basic Local Alignment Search Tool. [Online] Available from: https://blast.ncbi.nlm.nih.gov/Blast.cgi?CMD=Web&PAGE_TYPE=BlastHome.
    1. Chen J. Contiguous item sequential pattern mining using UpDown Tree. Intell Data Anal. 2008;12(1):25–49. https://doi.org/10.3233/IDA-2008-12103 . - DOI

LinkOut - more resources