Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Jan 6;53(D1):D331-D339.
doi: 10.1093/nar/gkae1018.

ASpdb: an integrative knowledgebase of human protein isoforms from experimental and AI-predicted structures

Affiliations

ASpdb: an integrative knowledgebase of human protein isoforms from experimental and AI-predicted structures

Yuntao Yang et al. Nucleic Acids Res. .

Abstract

Alternative splicing is a crucial cellular process in eukaryotes, enabling the generation of multiple protein isoforms with diverse functions from a single gene. To better understand the impact of alternative splicing on protein structures, protein-protein interaction and human diseases, we developed ASpdb (https://biodataai.uth.edu/ASpdb/), a comprehensive database integrating experimentally determined structures and AlphaFold 2-predicted models for human protein isoforms. ASpdb includes over 3400 canonical isoforms, each represented by both experimentally resolved and predicted structures, and >7200 alternative isoforms with AlphaFold 2 predictions. In addition to detailed splicing events, 3D structures, sequence variations and functional annotations, ASpdb uniquely offers comparative analyses and visualization of structural alterations among isoforms. This resource is invaluable for advancing research in alternative splicing, structural biology and disease mechanisms.

PubMed Disclaimer

Figures

Graphical Abstract
Graphical Abstract
Figure 1.
Figure 1.
Overview of the ASpdb database. (A) Summary of gene isoform structure, gene expression and AS information for 3490 human genes. The left bar plot displays the distribution of AS region lengths in canonical isoforms, while the right bar plot illustrates the distribution of the number of AS isoforms per gene. The information provided for gene summary includes: gene overview, gene isoform structure and gene expression across TCGA and GTEx. The information provided for AS summary include AS knowledge, isoform amino acid sequences, MSAs and functional domain retention. (B) Experimentally solved and AF2-predicted 3D structures of protein isoforms, including active sites and reliability using pLDDT distribution, MSA heatmap and Ramachandran plot. CM: Cryo-electron microscopy. (C) Comparison of 3D structures between canonical and alternative isoforms using TM-scores and statistical tests on changes in structure features. RAS: Relative accessible surface. (D) Overview of protein–protein interactions, clinically important variants, related drugs and associated human diseases.
Figure 2.
Figure 2.
An illustration of the comprehensive analysis results of NF2 gene. The detailed information for each panel can be found in Supplementary Tables S1–S5 and Supplementary Figures S1–S2. (A) Gene summary, including gene name, ID and description. (B) Gene ontology terms with evidence from Entrez and associated PubMed IDs. (C) AS and isoform information for three NF2 isoforms, listing canonical and alternative spliced isoforms and modifications. (D) Conversion tables of UniProt, Ensembl and RefSeq IDs, specific to the three isoforms. (E) Gene structures of canonical and alternatively spliced genes, along with their expression levels across GTEx tissues, visualized as heatmaps.
Figure 3.
Figure 3.
Detailed structural information of APH-1 gene and its isoforms. (A) AF2-predicted structure of the long isoform APH-1aL (Q96BI3-1) and its pLDDT distribution. (B) AF2-predicted structure of the short isoform APH-1aS (Q96BI3-2) and its pLDDT distribution. (C) Superimposed structure of APH-1aL (Q96BI3-1) and APH-1aS (Q96BI3-2) with the corresponding TM-score. (D) Comparison of secondary structure and RAS area changes between APH-1aL (Q96BI3-1) and APH-1aS (Q96BI3-2). (E) Drugs targeting APH1A isoforms.

References

    1. Zheng C.L., Fu X.D., Gribskov M.. Characteristics and regulatory elements defining constitutive splicing and different modes of alternative splicing in human and mouse. RNA. 2005; 11:1777–1787. - PMC - PubMed
    1. Pan Q., Shai O., Lee L.J., Frey B.J., Blencowe B.J.. Deep surveying of alternative splicing complexity in the human transcriptome by high-throughput sequencing. Nat. Genet. 2008; 40:1413–1415. - PubMed
    1. UniProt Consortium UniProt: the universal protein knowledgebase in 2021. Nucleic Acids Res. 2021; 49:D480–D489. - PMC - PubMed
    1. Gallego-Paez L.M., Bordone M.C., Leote A.C., Saraiva-Agostinho N., Ascensão-Ferreira M., Barbosa-Morais N.. Alternative splicing: the pledge, the turn and the prestige: the key role of alternative splicing in human biological systems. Hum. Genet. 2017; 136:1015–1042. - PMC - PubMed
    1. Sreenivasamurthy S.K., Madugundu A.K., Patil A.H., Dey G., Mohanty A.K., Kumar M., Patel K., Wang C., Kumar A., Pandey A.et al. .. Mosquito-borne diseases and omics: tissue-restricted expression and alternative splicing revealed by transcriptome profiling of Anopheles stephensi. OMICS. 2017; 21:488–497. - PubMed

Substances

LinkOut - more resources