Global identification of transcription start sites in the genome of Apis mellifera using 5'LongSAGE
- PMID: 21695780
- DOI: 10.1002/jez.b.21421
Global identification of transcription start sites in the genome of Apis mellifera using 5'LongSAGE
Abstract
The precise identification of the transcription start sites (TSSs) of genes in the honeybee genome will be helpful for inferring start codons and for determining promoter elements. The 5'SAGE approach provides a powerful tool for identifying TSSs in the sequenced genome. The main purpose of this study is to identify the actual TSSs of expressed genes as well as the usage of different TSSs in the Apis mellifera genome. We performed a 5'LongSAGE (5'LS) analysis for the adult drone head, and the TSSs of the expressed genes were determined by mapping the 5'LS tag sequences to the honeybee genome. A total of 8,280 unique 19 bp 5'LS tag sequences were identified that corresponded to 3,655 predicted genes. Out of these tags, 4,998 tags (60.4%) were mapped to a region from -1,000 bp to +100 bp of the start codon of 2,301 reference coding sequences. Notably, we observed that 28-47% of the 3,655 honeybee genes initiated transcription from alternative TSSs. The TSS consensus pattern of the honeybee genes, DT(rich) PyPu(G(rich))(T/A)(T(rich))(3), was obtained by aligning the sequences flanking the 5'LS-TSSs. We also identified three new genes in the regions downstream of 5'LS tags and validated 21 TSSs using RT-PCR amplification. Additionally, 17 genes identified by the 5'LS tags were associated with the Gene Ontology term "behavior." Mapping of the 5'LS tags on the genome not only provided direct evidence of expression for in silico predicted genes but also allowed for the identification of previously unrecognized, novel exons and alternative TSSs.
Copyright © 2011 Wiley Periodicals, Inc.
Similar articles
-
Mapping of transcription start sites of human retina expressed genes.BMC Genomics. 2007 Feb 7;8:42. doi: 10.1186/1471-2164-8-42. BMC Genomics. 2007. PMID: 17286855 Free PMC article.
-
[Analysis, identification and correction of some errors of model refseqs appeared in NCBI Human Gene Database by in silico cloning and experimental verification of novel human genes].Yi Chuan Xue Bao. 2004 May;31(5):431-43. Yi Chuan Xue Bao. 2004. PMID: 15478601 Chinese.
-
5'SAGE: 5'-end Serial Analysis of Gene Expression database.Nucleic Acids Res. 2005 Jan 1;33(Database issue):D550-2. doi: 10.1093/nar/gki085. Nucleic Acids Res. 2005. PMID: 15608259 Free PMC article.
-
[Determination of transcription start sites: CAGE & GIS tag sequences].Tanpakushitsu Kakusan Koso. 2004 Dec;49(17 Suppl):2701-3. Tanpakushitsu Kakusan Koso. 2004. PMID: 15669242 Review. Japanese. No abstract available.
-
Deep cap analysis gene expression (CAGE): genome-wide identification of promoters, quantification of their expression, and network inference.Biotechniques. 2008 Apr;44(5):627-8, 630, 632. doi: 10.2144/000112802. Biotechniques. 2008. PMID: 18474037 Review.
Cited by
-
Insights into the Transcriptional Architecture of Behavioral Plasticity in the Honey Bee Apis mellifera.Sci Rep. 2015 Jun 15;5:11136. doi: 10.1038/srep11136. Sci Rep. 2015. PMID: 26073445 Free PMC article.
-
Quantitative analysis of transcription start site selection reveals control by DNA sequence, RNA polymerase II activity and NTP levels.Nat Struct Mol Biol. 2024 Jan;31(1):190-202. doi: 10.1038/s41594-023-01171-9. Epub 2024 Jan 4. Nat Struct Mol Biol. 2024. PMID: 38177677 Free PMC article.
Publication types
MeSH terms
Substances
LinkOut - more resources
Full Text Sources