GISSD: Group I Intron Sequence and Structure Database

Yu Zhou¹, Chen Lu, Qi-Jia Wu, Yu Wang, Zhi-Tao Sun, Jia-Cong Deng, Yi Zhang

Affiliations

PMID: 17942415
PMCID: PMC2238919
DOI: 10.1093/nar/gkm766

GISSD: Group I Intron Sequence and Structure Database

Yu Zhou et al. Nucleic Acids Res. 2008 Jan.

. 2008 Jan;36(Database issue):D31-7.

doi: 10.1093/nar/gkm766. Epub 2007 Oct 16.

Authors

Yu Zhou¹, Chen Lu, Qi-Jia Wu, Yu Wang, Zhi-Tao Sun, Jia-Cong Deng, Yi Zhang

Affiliation

¹ State Key Laboratory of Virology, College of Life Sciences, Wuhan University, Wuhan, Hubei 430072, People's Republic of China.

PMID: 17942415
PMCID: PMC2238919
DOI: 10.1093/nar/gkm766

Abstract

Group I Intron Sequence and Structure Database (GISSD) is a specialized and comprehensive database for group I introns, focusing on the integration of useful group I intron information from available databases and providing de novo data that is essential for understanding these introns at a systematic level. This database presents 1789 complete intron records, including the nucleotide sequence of each annotated intron plus 15 nt of the upstream and downstream exons, and the pseudoknots-containing secondary structures predicted by integrating comparative sequence analyses and minimal free energy algorithms. These introns represent all 14 subgroups, with their structure-based alignments being separately provided. Both structure predictions and alignments were done manually and iteratively adjusted, which yielded a reliable consensus structure for each subgroup. These consensus structures allowed us to judge the confidence of 20 085 group I introns previously found by the INFERNAL program and to classify them into subgroups automatically. The database provides intron-associated taxonomy information from GenBank, allowing one to view the detailed distribution of all group I introns. CDSs residing in introns and 3D structure information are also integrated if available. About 17 000 group I introns have been validated in this database; approximately 95% of them belong to the IC3 subgroup and reside in the chloroplast tRNA(Leu) gene. The GISSD database can be accessed at http://www.rna.whu.edu.cn/gissd/

PubMed Disclaimer

Figures

**Figure 1.**
GISSD pipeline. Green blocks indicate foreign databases, blue blocks highlight the core data of GISSD and the orange block indicates the local taxonomy data in GISSD. CM: covariance model, which is computed from intron alignments by the INFERNAL software package.

**Figure 2.**
The schematic representation of secondary structure prediction and alignment. (I) Locate the core components in the intron by using conservative sequence patterns. The order is as: (1) find J6/7-P7, which was very conservative; (2) find P3′, which is usually 2 or 3 nt after P7, if no insertion sequences; (3) find J8/7-P7′, according to the base pairing of P7 and P7′ and the conservation of J8/7 and (4) find P3, paired with P3′. (II) Partition the sequence into four parts. In the first part, 5′ exon and 3′ exon sequences were used to find P1′ and P10, and the rest of the sequence before P3 was used to identify P2 and P2.1. (**III**) The four parts were folded by *RNAstructure* 4.11 (18) separately. Besides the minimum energy structure, other suboptimal structures were also checked. By comparing the folded structure to known structures, the structure having similar pattern was chosen manually. (IV) The whole structure of an intron was completed by integrating the structures of the subsequences. (V) When structure prediction was finished for certain numbers of introns in a subgroup, the alignment process was started. The core components and peripheral elements were sequentially aligned manually based on their structures. A point to emphasize is that the aligning process and the structure prediction procedure were iteratively done. Once one part of an intron ran into difficulty in the alignment with other sequences, the corresponding structure was reselected from the candidate structures of *RNAstructure*. Sometimes the core components needed to be reconsidered, and the whole process of structure prediction was redone for that particular intron.

**Figure 3.**
Screenshots of GISSD. (A) Intron search page, (B) search result page, (C) sequence, structure and alignment page, (D) intron distribution page and (E) gIRfam page.

See this image and copyright information in PMC

Cited by

A Phylogenetic Approach to Structural Variation in Organization of Nuclear Group I Introns and Their Ribozymes.
Furulund BMN, Karlsen BO, Babiak I, Johansen SD. Furulund BMN, et al. Noncoding RNA. 2021 Jul 22;7(3):43. doi: 10.3390/ncrna7030043. Noncoding RNA. 2021. PMID: 34449660 Free PMC article.
LAHEDES: the LAGLIDADG homing endonuclease database and engineering server.
Taylor GK, Petrucci LH, Lambert AR, Baxter SK, Jarjour J, Stoddard BL. Taylor GK, et al. Nucleic Acids Res. 2012 Jul;40(Web Server issue):W110-6. doi: 10.1093/nar/gks365. Epub 2012 May 8. Nucleic Acids Res. 2012. PMID: 22570419 Free PMC article.
RNArchitecture: a database and a classification system of RNA families, with a focus on structural information.
Boccaletto P, Magnus M, Almeida C, Zyla A, Astha A, Pluta R, Baginski B, Jankowska E, Dunin-Horkawicz S, Wirecki TK, Boniecki MJ, Stefaniak F, Bujnicki JM. Boccaletto P, et al. Nucleic Acids Res. 2018 Jan 4;46(D1):D202-D205. doi: 10.1093/nar/gkx966. Nucleic Acids Res. 2018. PMID: 29069520 Free PMC article.
Single molecule fluorescence approaches shed light on intracellular RNAs.
Pitchiaya S, Heinicke LA, Custer TC, Walter NG. Pitchiaya S, et al. Chem Rev. 2014 Mar 26;114(6):3224-65. doi: 10.1021/cr400496q. Epub 2014 Jan 8. Chem Rev. 2014. PMID: 24417544 Free PMC article. Review. No abstract available.
Convergent evolution of twintron-like configurations: One is never enough.
Hafez M, Hausner G. Hafez M, et al. RNA Biol. 2015;12(12):1275-88. doi: 10.1080/15476286.2015.1103427. RNA Biol. 2015. PMID: 26513606 Free PMC article. Review.

See all "Cited by" articles

References

1. Burke J.M., Belfort M., Cech T.R., Davies R.W., Schweyen R.J., Shub D.A., Szostak J.W., Tabak H.F. Structural conventions for group I introns. Nucleic Acids Res. 1987;15:7217–7221. - PMC - PubMed
1. Cech T.R. Self-splicing of group I introns. Annu. Rev. Biochem. 1990;59:543–568. - PubMed
1. Vicens Q., Cech T.R. Atomic level architecture of group I introns revealed. Trends Biochem. Sci. 2006;31:41–51. - PubMed
1. Michel F., Westhof E. Modelling of the three-dimensional architecture of group I catalytic introns based on comparative sequence analysis. J. Mol. Biol. 1990;216:585–610. - PubMed
1. Li Z.J., Zhang Y. Predicting the secondary structures and tertiary interactions of 211 group I introns in IE subgroup. Nucleic Acids Res. 2005;33:2118–2128. - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
Research Materials
- NCI CPTC Antibody Characterization Program

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

GISSD: Group I Intron Sequence and Structure Database

Affiliation

GISSD: Group I Intron Sequence and Structure Database

Authors

Affiliation

Abstract

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Other Literature Sources

Research Materials