Analysis of the largest tandemly repeated DNA families in the human genome
- PMID: 18992157
- PMCID: PMC2588610
- DOI: 10.1186/1471-2164-9-533
Analysis of the largest tandemly repeated DNA families in the human genome
Abstract
Background: Tandemly Repeated DNA represents a large portion of the human genome, and accounts for a significant amount of copy number variation. Here we present a genome wide analysis of the largest tandem repeats found in the human genome sequence.
Results: Using Tandem Repeats Finder (TRF), tandem repeat arrays greater than 10 kb in total size were identified, and classified into simple sequence e.g. GAATG, classical satellites e.g. alpha satellite DNA, and locus specific VNTR arrays. Analysis of these large sequenced regions revealed that several "simple sequence" arrays actually showed complex domain and/or higher order repeat organization. Using additional methods, we further identified a total of 96 additional arrays with tandem repeat units greater than 2 kb (the detection limit of TRF), 53 of which contained genes or repeated exons. The overall size of an array of tandem 12 kb repeats which spanned a gap on chromosome 8 was found to be 600 kb to 1.7 Mbp in size, representing one of the largest non-centromeric arrays characterized. Several novel megasatellite tandem DNA families were observed that are characterized by repeating patterns of interspersed transposable elements that have expanded presumably by unequal crossing over. One of these families is found on 11 different chromosomes in >25 arrays, and represents one of the largest most widespread megasatellite DNA families.
Conclusion: This study represents the most comprehensive genome wide analysis of large tandem repeats in the human genome, and will serve as an important resource towards understanding the organization and copy number variation of these complex DNA families.
Figures





Similar articles
-
[Tandem repeats in rodents genome and their mapping].Tsitologiia. 2015;57(2):102-10. Tsitologiia. 2015. PMID: 26035967 Russian.
-
Human megasatellite DNA RS447: copy-number polymorphisms and interspecies conservation.Genomics. 1998 Nov 15;54(1):39-49. doi: 10.1006/geno.1998.5545. Genomics. 1998. PMID: 9806828
-
Tandemly repeated DNA families in the mouse genome.BMC Genomics. 2011 Oct 28;12:531. doi: 10.1186/1471-2164-12-531. BMC Genomics. 2011. PMID: 22035034 Free PMC article.
-
Complex structure of knobs and centromeric regions in maize chromosomes.Tsitol Genet. 2000 Mar-Apr;34(2):11-5. Tsitol Genet. 2000. PMID: 10857197 Review.
-
Satellite DNAs between selfishness and functionality: structure, genomics and evolution of tandem repeats in centromeric (hetero)chromatin.Gene. 2008 Feb 15;409(1-2):72-82. doi: 10.1016/j.gene.2007.11.013. Epub 2007 Dec 4. Gene. 2008. PMID: 18182173 Review.
Cited by
-
Oncogenic ETS fusions promote DNA damage and proinflammatory responses via pericentromeric RNAs in extracellular vesicles.J Clin Invest. 2024 Mar 26;134(9):e169470. doi: 10.1172/JCI169470. J Clin Invest. 2024. PMID: 38530366 Free PMC article.
-
Function and evolution of local repeats in the Firre locus.Nat Commun. 2016 Mar 24;7:11021. doi: 10.1038/ncomms11021. Nat Commun. 2016. PMID: 27009974 Free PMC article.
-
Condensin controls cellular RNA levels through the accurate segregation of chromosomes instead of directly regulating transcription.Elife. 2018 Sep 19;7:e38517. doi: 10.7554/eLife.38517. Elife. 2018. PMID: 30230473 Free PMC article.
-
Characterization of DXZ4 conservation in primates implies important functional roles for CTCF binding, array expression and tandem repeat organization on the X chromosome.Genome Biol. 2011;12(4):R37. doi: 10.1186/gb-2011-12-4-r37. Epub 2011 Apr 13. Genome Biol. 2011. PMID: 21489251 Free PMC article.
-
Genetic structures of copy number variants revealed by genotyping single sperm.PLoS One. 2009;4(4):e5236. doi: 10.1371/journal.pone.0005236. Epub 2009 Apr 22. PLoS One. 2009. PMID: 19384415 Free PMC article.
References
Publication types
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous