Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Dec 6:9:620.
doi: 10.3389/fgene.2018.00620. eCollection 2018.

VarQ: A Tool for the Structural and Functional Analysis of Human Protein Variants

Affiliations

VarQ: A Tool for the Structural and Functional Analysis of Human Protein Variants

Leandro Radusky et al. Front Genet. .

Abstract

Understanding the functional effect of Single Amino acid Substitutions (SAS), derived from the occurrence of single nucleotide variants (SNVs), and their relation to disease development is a major issue in clinical genomics. Despite the existence of several bioinformatic algorithms and servers that predict if a SAS is pathogenic or not, they give little or no information at all on the reasons for pathogenicity prediction and on the actual predicted effect of the SAS on the protein function. Moreover, few actual methods take into account structural information when available for automated analysis. Moreover, many of these algorithms are able to predict an effect that no necessarily translates directly into pathogenicity. VarQ is a bioinformatic pipeline that incorporates structural information for the detailed analysis and prediction of SAS effect on protein function. It is an online tool which uses UniProt id and automatically analyzes known and user provided SAS for their effect on protein activity, folding, aggregation and protein interactions, among others. We show that structural information, when available, can improve the SAS pathogenicity diagnosis and more important explain its causes. We show that VarQ is able to correctly reproduce previous analysis of RASopathies related mutations, saving extensive and time consuming manual curation. VarQ assessment was performed over a set of previously manually curated RASopathies (diseases that affects the RAS/MAPK signaling pathway) related variants, showing its ability to correctly predict the phenotypic outcome and its underlying cause. This resource is available online at http://varq.qb.fcen.uba.ar/. Supporting Information & Tutorials may be found in the webpage of the tool.

Keywords: FoldX; bioinformatics; single amino acid substitutions; single amino acid substitutions classification; variation diagnosis; web server.

PubMed Disclaimer

Figures

FIGURE 1
FIGURE 1
VarQ general pipeline. Each known structural conformation of the input protein is analyzed independently to aid the user in variant effect interpretation.
FIGURE 2
FIGURE 2
An Example of VarQ output for the HRAS gene (PDB id 3DDC). On top left, we show the structural features derived from crystallographic data. Target gene and possible interacting genes are shown in green and gray respectively. Structural coverage, PFam family assignment and location of the variant are mapped over the UniProt original sequence.
FIGURE 3
FIGURE 3
Change in folding energy (ΔΔG) computed by the FoldX software for all mined variants in analyzed RASopathies related proteins. Pathogenic and non-pathogenic variants are shown separately.
FIGURE 4
FIGURE 4
Receiver Operating Characteristic Curves for the diagnosis of Kiel & Serrano manually curated pathogenic variants that were mapped into structure. Pathogenic mutations are considered true positives in Clinvar (black curve) data set when they are labeled as “pathogenic” or “likely pathogenic.” For VarQ (green curve) true positives are considered for all the labels except “no effect”.

Similar articles

Cited by

References

    1. Bateman A., Coin L., Durbin R., Finn R. D., Hollich V., Griffiths-Jones S., et al. (2004). The Pfam protein families database. Nucleic Acids Res. 32D138–D141. - PMC - PubMed
    1. Bromberg Y., Rost B. (2007). SNAP: predict effect of non-synonymous polymorphisms on function. Nucleic Acids Res. 35 3823–3835. - PMC - PubMed
    1. Day I. N. M. (2010). dbSNP in the detail and copy number complexities. Hum. Mutat. 31 2–4. 10.1002/humu.21149 - DOI - PubMed
    1. De Baets G., Van Durme J., Reumers J., Maurer-Stroh S., Vanhee P., Dopazo J., et al. (2012). SNPeffect 4.0: on-line prediction of molecular and structural effects of protein-coding variants. Nucleic Acids Res. 40 D935–D939. 10.1093/nar/gkr996 - DOI - PMC - PubMed
    1. DeLano L. W. (2002). The PyMOL Molecular Graphics System. Available at: http://pymol.org