Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Jun 15;316(5831):1586-91.
doi: 10.1126/science.1139815.

The Release 5.1 annotation of Drosophila melanogaster heterochromatin

Affiliations

The Release 5.1 annotation of Drosophila melanogaster heterochromatin

Christopher D Smith et al. Science. .

Erratum in

  • Science. 2007 Sep 7;317(5843):1325

Abstract

The repetitive DNA that constitutes most of the heterochromatic regions of metazoan genomes has hindered the comprehensive analysis of gene content and other functions. We have generated a detailed computational and manual annotation of 24 megabases of heterochromatic sequence in the Release 5 Drosophila melanogaster genome sequence. The heterochromatin contains a minimum of 230 to 254 protein-coding genes, which are conserved in other Drosophilids and more diverged species, as well as 32 pseudogenes and 13 noncoding RNAs. Improved methods revealed that more than 77% of this heterochromatin sequence, including introns and intergenic regions, is composed of fragmented and nested transposable elements and other repeated DNAs. Drosophila heterochromatin contains "islands" of highly conserved genes embedded in these "oceans" of complex repeats, which may require special expression and splicing mechanisms.

PubMed Disclaimer

Figures

Fig. 1
Fig. 1
Computational pipeline results used for the Release 5.1 annotation. An Apollo (36) screenshot of the evidence (black region) and Release 5.1 annotations (light blue region) for scaffold CP000218, which was produced by merging, extending, and finishing the Release 3 WGS scaffolds (AABU01002750 and AABU010027230) (9). New cDNA evidence was used to merge the Release 3.2b annotations CG40067, CG17443, and CG40109 into one Release 5.1 gene (CG41520) and to identify an alternative exon for CG41250-RB (asterisk). ncRNA CG40375 is shown on the opposite strand to illustrate that it is nested within CG41520. CG40388 represents a Release 3.2b gene that is now annotated as a TE fragment. Complete annotation and evidence are shown in (11).
Fig. 2
Fig. 2
Evidence of D. melanogaster heterochromatin protein-coding gene annotations. Venn diagrams show the percentage of protein-coding genes supported by gene prediction (pink), EST or cDNA (yellow), and/or BLASTX/TBLASTN comparative genomic evidence (blue). (A) Multi-exon genes are likely to be complete, whereas (B) single-exon genes are likely to represent genes that are fragmented across multiple scaffolds. The number of genes measured for each class are indicated, as well as the number and percent of genes with putative orthologs in melanogaster group species, nonmelanogaster group Drosophilids, or other insect species.
Fig. 3
Fig. 3
Comparative analysis summary for D. melanogaster heterochromatin genes. (A) Number of heterochromatin protein-coding genes with a predicted ortholog in a given species, ordered (left to right) by increasing evolutionary distance from D. melanogaster.(B) Frequency histograms showing the percentage of heterochromatin protein-coding genes with a predicted ortholog in the 16 insect species tested. (C)Scatter plots of intron lengths(bp) for euchromatin and heterochromatin protein-coding gene introns conserved in either D. erecta or D. virilis. Each data point refers to a single conserved intron. Correlation coefficients (r) for intron lengths are indicated.
Fig. 4
Fig. 4
Density of repeat and gene features across the heterochromatin. The average percentages of indicated annotation types are shown for each chromosome region (total length in boxes below the x axis). Euchromatin is an average of the noncentric regions of arms 2, 3, and X only. 2Lh, 2Rh, 3Lh, 3Rh, and Xh describe heterochromatic regions that are contiguous with the chromosome arms, whereas the Het regions are mapped to arms and ordered, but not necessarily in the correct orientation (8, 9). The average percentages of sequences for LTR- and LINE-like retrotransposons (red), DNA transposons (green), other and unknown repeats (blue), and TRF tandem repeats and satellite sequences (gray) are indicated. The average number of genes per 100 kb is shown above each histogram.
Fig. 5
Fig. 5
Overrepresented GO terms in heterochromatin versus euchromatin genes. The percentages of GO molecular-function domains for genes in heterochomatin (red) and euchromatin (blue) are shown. Numbers in parentheses indicate the actual number of domains in heterochromatin. P-value significance scores are shown to the right. Complete GO analyses are presented in the SOM text.

Similar articles

Cited by

  • Recurrent Amplification of the Heterochromatin Protein 1 (HP1) Gene Family across Diptera.
    Helleu Q, Levine MT. Helleu Q, et al. Mol Biol Evol. 2018 Oct 1;35(10):2375-2389. doi: 10.1093/molbev/msy128. Mol Biol Evol. 2018. PMID: 29924345 Free PMC article.
  • Chromatin Organization and Function in Drosophila.
    Llorens-Giralt P, Camilleri-Robles C, Corominas M, Climent-Cantó P. Llorens-Giralt P, et al. Cells. 2021 Sep 8;10(9):2362. doi: 10.3390/cells10092362. Cells. 2021. PMID: 34572010 Free PMC article. Review.
  • FlyVar: a database for genetic variation in Drosophila melanogaster.
    Wang F, Jiang L, Chen Y, Haelterman NA, Bellen HJ, Chen R. Wang F, et al. Database (Oxford). 2015 Aug 19;2015:bav079. doi: 10.1093/database/bav079. Print 2015. Database (Oxford). 2015. PMID: 26289428 Free PMC article.
  • Silencing of Transposable Elements by piRNAs in Drosophila: An Evolutionary Perspective.
    Luo S, Lu J. Luo S, et al. Genomics Proteomics Bioinformatics. 2017 Jun;15(3):164-176. doi: 10.1016/j.gpb.2017.01.006. Epub 2017 Jun 8. Genomics Proteomics Bioinformatics. 2017. PMID: 28602845 Free PMC article. Review.
  • Retrotransposons Are the Major Contributors to the Expansion of the Drosophila ananassae Muller F Element.
    Leung W, Shaffer CD, Chen EJ, Quisenberry TJ, Ko K, Braverman JM, Giarla TC, Mortimer NT, Reed LK, Smith ST, Robic S, McCartha SR, Perry DR, Prescod LM, Sheppard ZA, Saville KJ, McClish A, Morlock EA, Sochor VR, Stanton B, Veysey-White IC, Revie D, Jimenez LA, Palomino JJ, Patao MD, Patao SM, Himelblau ET, Campbell JD, Hertz AL, McEvilly MF, Wagner AR, Youngblom J, Bedi B, Bettincourt J, Duso E, Her M, Hilton W, House S, Karimi M, Kumimoto K, Lee R, Lopez D, Odisho G, Prasad R, Robbins HL, Sandhu T, Selfridge T, Tsukashima K, Yosif H, Kokan NP, Britt L, Zoellner A, Spana EP, Chlebina BT, Chong I, Friedman H, Mammo DA, Ng CL, Nikam VS, Schwartz NU, Xu TQ, Burg MG, Batten SM, Corbeill LM, Enoch E, Ensign JJ, Franks ME, Haiker B, Ingles JA, Kirkland LD, Lorenz-Guertin JM, Matthews J, Mittig CM, Monsma N, Olson KJ, Perez-Aragon G, Ramic A, Ramirez JR, Scheiber C, Schneider PA, Schultz DE, Simon M, Spencer E, Wernette AC, Wykle ME, Zavala-Arellano E, McDonald MJ, Ostby K, Wendland P, DiAngelo JR, Ceasrine AM, Cox AH, Docherty JEB, Gingras RM, Grieb SM, Pavia MJ, Personius CL, Polak GL, Beach DL, Cerritos HL, Horansky EA, Sharif KA, Moran R, Parrish S, Bickford K, Bland J, Broussard J, … See abstract for full author list ➔ Leung W, et al. G3 (Bethesda). 2017 Aug 7;7(8):2439-2460. doi: 10.1534/g3.117.040907. G3 (Bethesda). 2017. PMID: 28667019 Free PMC article.

References

    1. Hoskins RA, et al. Genome Biol. 2002;3:RESEARCH0085. - PMC - PubMed
    1. The Arabidopsis Genome Initiative. Nature. 2000;408:796. - PubMed
    1. Sullivan BA, Blower MD, Karpen GH. Nat. Rev. Genet. 2001;2:584. - PubMed
    1. Brennecke J, et al. Cell. 2007;128:1089. - PubMed
    1. Elgin SC, Grewal SI. Curr. Biol. 2003;13:R895. - PubMed

Publication types

MeSH terms