PatMatch: a program for finding patterns in peptide and nucleotide sequences
- PMID: 15980466
- PMCID: PMC1160129
- DOI: 10.1093/nar/gki368
PatMatch: a program for finding patterns in peptide and nucleotide sequences
Abstract
Here, we present PatMatch, an efficient, web-based pattern-matching program that enables searches for short nucleotide or peptide sequences such as cis-elements in nucleotide sequences or small domains and motifs in protein sequences. The program can be used to find matches to a user-specified sequence pattern that can be described using ambiguous sequence codes and a powerful and flexible pattern syntax based on regular expressions. A recent upgrade has improved performance and now supports both mismatches and wildcards in a single pattern. This enhancement has been achieved by replacing the previous searching algorithm, scan_for_matches [D'Souza et al. (1997), Trends in Genetics, 13, 497-498], with nondeterministic-reverse grep (NR-grep), a general pattern matching tool that allows for approximate string matching [Navarro (2001), Software Practice and Experience, 31, 1265-1312]. We have tailored NR-grep to be used for DNA and protein searches with PatMatch. The stand-alone version of the software can be adapted for use with any sequence dataset and is available for download at The Arabidopsis Information Resource (TAIR) at ftp://ftp.arabidopsis.org/home/tair/Software/Patmatch/. The PatMatch server is available on the web at http://www.arabidopsis.org/cgi-bin/patmatch/nph-patmatch.pl for searching Arabidopsis thaliana sequences.
Figures

Similar articles
-
Gene structure prediction from consensus spliced alignment of multiple ESTs matching the same genomic locus.Bioinformatics. 2004 May 1;20(7):1157-69. doi: 10.1093/bioinformatics/bth058. Epub 2004 Feb 5. Bioinformatics. 2004. PMID: 14764557
-
MEME: discovering and analyzing DNA and protein sequence motifs.Nucleic Acids Res. 2006 Jul 1;34(Web Server issue):W369-73. doi: 10.1093/nar/gkl198. Nucleic Acids Res. 2006. PMID: 16845028 Free PMC article.
-
GeneSeqer@PlantGDB: Gene structure prediction in plant genomes.Nucleic Acids Res. 2003 Jul 1;31(13):3597-600. doi: 10.1093/nar/gkg533. Nucleic Acids Res. 2003. PMID: 12824374 Free PMC article.
-
Patscanui: an intuitive web interface for searching patterns in DNA and protein data.Nucleic Acids Res. 2018 Jul 2;46(W1):W205-W208. doi: 10.1093/nar/gky321. Nucleic Acids Res. 2018. PMID: 29722870 Free PMC article.
-
TAIR: a resource for integrated Arabidopsis data.Funct Integr Genomics. 2002 Nov;2(6):239-53. doi: 10.1007/s10142-002-0077-z. Epub 2002 Oct 3. Funct Integr Genomics. 2002. PMID: 12444417 Review.
Cited by
-
Comprehensive Tissue-Specific Transcriptome Analysis Reveals Distinct Regulatory Programs during Early Tomato Fruit Development.Plant Physiol. 2015 Aug;168(4):1684-701. doi: 10.1104/pp.15.00287. Epub 2015 Jun 22. Plant Physiol. 2015. PMID: 26099271 Free PMC article.
-
GRAS-domain transcription factor PAT1 regulates jasmonic acid biosynthesis in grape cold stress response.Plant Physiol. 2021 Jul 6;186(3):1660-1678. doi: 10.1093/plphys/kiab142. Plant Physiol. 2021. PMID: 33752238 Free PMC article.
-
AthaMap web tools for the analysis and identification of co-regulated genes.Nucleic Acids Res. 2007 Jan;35(Database issue):D857-62. doi: 10.1093/nar/gkl1006. Epub 2006 Dec 5. Nucleic Acids Res. 2007. PMID: 17148485 Free PMC article.
-
Candidate regulators and target genes of drought stress in needles and roots of Norway spruce.Tree Physiol. 2021 Jul 5;41(7):1230-1246. doi: 10.1093/treephys/tpaa178. Tree Physiol. 2021. PMID: 33416078 Free PMC article.
-
Plant MetGenMAP: an integrative analysis system for plant systems biology.Plant Physiol. 2009 Dec;151(4):1758-68. doi: 10.1104/pp.109.145169. Epub 2009 Oct 9. Plant Physiol. 2009. PMID: 19819981 Free PMC article.
References
-
- Huala E., Dickerman A., Garcia-Hernandez M., Weems D., Reiser L., LaFond F., Hanley D., Kiphart D., Zhuang J., Huang W., et al. The Arabidopsis Information Resource (TAIR): a comprehensive database and web-based information retrieval, analysis, and visualization system for a model plant. Nucleic Acids Res. 2001;29:102–105. - PMC - PubMed
-
- Rhee S.Y., Beavis W., Berardini T.Z., Chen G., Dixon D., Doyle A., Garcia-Hernandez M., Huala E., Lander G., Montoya M., et al. The Arabidopsis Information Resource (TAIR): a model organism database providing a centralized, curated gateway to Arabidopsis biology, research materials and community. Nucleic Acids Res. 2003;31:224–228. - PMC - PubMed
-
- Navarro G. NR-grep: a fast and flexible pattern matching tool. Software Practice and Experience. 2001;31:1265–1312.
-
- D'Souza M., Larsen N., Overbeek R. Searching for patterns in genomic data. Trends Genet. 1997;13:597–498. - PubMed
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources
Other Literature Sources