Mining for single nucleotide polymorphisms and insertions/deletions in maize expressed sequence tag data
- PMID: 12746514
- PMCID: PMC166954
- DOI: 10.1104/pp.102.019422
Mining for single nucleotide polymorphisms and insertions/deletions in maize expressed sequence tag data
Abstract
We have developed a computer based method to identify candidate single nucleotide polymorphisms (SNPs) and small insertions/deletions from expressed sequence tag data. Using a redundancy-based approach, valid SNPs are distinguished from erroneous sequence by their representation multiple times in an alignment of sequence reads. A second measure of validity was also calculated based on the cosegregation of the SNP pattern between multiple SNP loci in an alignment. The utility of this method was demonstrated by applying it to 102,551 maize (Zea mays) expressed sequence tag sequences. A total of 14,832 candidate polymorphisms were identified with an SNP redundancy score of two or greater. Segregation of these SNPs with haplotype indicates that candidate SNPs with high redundancy and cosegregation confidence scores are likely to represent true SNPs. This was confirmed by validation of 264 candidate SNPs from 27 loci, with a range of redundancy and cosegregation scores, in four inbred maize lines. The SNP transition/transversion ratio and insertion/deletion size frequencies correspond to those observed by direct sequencing methods of SNP discovery and suggest that the majority of predicted SNPs and insertion/deletions identified using this approach represent true genetic variation in maize.
Figures




Similar articles
-
Redundancy based detection of sequence polymorphisms in expressed sequence tag data using autoSNP.Bioinformatics. 2003 Feb 12;19(3):421-2. doi: 10.1093/bioinformatics/btf881. Bioinformatics. 2003. PMID: 12584131
-
Mining single nucleotide polymorphisms from EST data of silkworm, Bombyx mori, inbred strain Dazao.Insect Biochem Mol Biol. 2004 Jun;34(6):523-30. doi: 10.1016/j.ibmb.2004.02.004. Insect Biochem Mol Biol. 2004. PMID: 15147754
-
High-throughput identification, database storage and analysis of SNPs in EST sequences.Genome Inform. 2001;12:194-203. Genome Inform. 2001. PMID: 11791238
-
Applications of single nucleotide polymorphisms in crop genetics.Curr Opin Plant Biol. 2002 Apr;5(2):94-100. doi: 10.1016/s1369-5266(02)00240-6. Curr Opin Plant Biol. 2002. PMID: 11856602 Review.
-
Small insertions and deletions (INDELs) in human genomes.Hum Mol Genet. 2010 Oct 15;19(R2):R131-6. doi: 10.1093/hmg/ddq400. Epub 2010 Sep 21. Hum Mol Genet. 2010. PMID: 20858594 Free PMC article. Review.
Cited by
-
Development of PCR-based SNP markers for rice blast resistance genes at the Piz locus.Theor Appl Genet. 2004 May;108(7):1212-20. doi: 10.1007/s00122-003-1553-0. Epub 2004 Jan 23. Theor Appl Genet. 2004. PMID: 14740086
-
Genome-Wide Discovery of InDel Markers in Sesame (Sesamum indicum L.) Using ddRADSeq.Plants (Basel). 2020 Sep 24;9(10):1262. doi: 10.3390/plants9101262. Plants (Basel). 2020. PMID: 32987937 Free PMC article.
-
QualitySNP: a pipeline for detecting single nucleotide polymorphisms and insertions/deletions in EST data from diploid and polyploid species.BMC Bioinformatics. 2006 Oct 9;7:438. doi: 10.1186/1471-2105-7-438. BMC Bioinformatics. 2006. PMID: 17029635 Free PMC article.
-
SNPs discovery and CAPS marker conversion in soybean.Mol Biol Rep. 2011 Mar;38(3):1841-6. doi: 10.1007/s11033-010-0300-2. Epub 2010 Sep 22. Mol Biol Rep. 2011. PMID: 20859693
-
Mapping Ds insertions in barley using a sequence-based approach.Mol Genet Genomics. 2004 Sep;272(2):181-93. doi: 10.1007/s00438-004-1035-3. Epub 2004 Jul 30. Mol Genet Genomics. 2004. PMID: 15449176
References
-
- Adams MD, Kerlavage AR, Fleischmann RD, Fuldner RA, Bult CJ, Lee NH, Kirkness EF, Weinstock KG, Gocayne JD, White O et al. Initial assessment of human gene diversity and expression patterns based upon 83-million nucleotides of cDNA sequence. Nature. 1995;377:3. - PubMed
-
- Bhattramakki D, Dolan M, Hanafey M, Wineland R, Vaske D, Register JC, III, Tingey SV, Rafalski A. Insertion-deletion polymorphisms in 3′ regions of maize genes occur frequently and can be used as highly informative genetic markers. Plant Mol Biol. 2002;48:539–547. - PubMed
-
- Buetow KH, Edmonson MN, Cassidy AB. Reliable identification of large numbers of candidate SNPs from public EST data. Nat Genet. 1999;21:323–325. - PubMed
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources