Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2019 Jun 28;20(1):363.
doi: 10.1186/s12859-019-2919-x.

In silico analysis of missense mutations in exons 1-5 of the F9 gene that cause hemophilia B

Affiliations

In silico analysis of missense mutations in exons 1-5 of the F9 gene that cause hemophilia B

Lennon Meléndez-Aranda et al. BMC Bioinformatics. .

Abstract

Background: Missense mutations in the first five exons of F9, which encodes factor FIX, represent 40% of all mutations that cause hemophilia B. To address the ongoing debate regarding in silico identification of disease-causing mutations at these exons, we analyzed 215 missense mutations from www.factorix.org using six in silico prediction tools, which are the most common used programs for analysis prediction of impact of mutations on the protein structure and function, with further advantage of using similar approaches. We developed different algorithms to integrate multiple predictions from such tools. In order to approach a structural analysis on FIX we performed a modeling of five selected pathogenic mutations.

Results: SIFT, PolyPhen-2 HumDiv, SNAP2, and MutationAssessor were the most successful in identifying true non-causative and causative mutations. A proposed function integrating these algorithms (wgP4) was the most sensitive (90.1%), specific (22.6%), and accurate (87%) than similar functions, and identified 187 variants as deleterious. Clinical phenotype was significantly associated with predicted causative mutations at all five exons. However, PolyPhen-2 HumDiv was more successful in linking clinical severity to specific exons, while functions that integrate 4-6 predictions were more successful in linking phenotype to genotypes at the light chain (exons 3-5). The most important value of integrating multiple predictions is the inclusion of scores derived from different approaches. Modeling of protein structure showed the effects of pathogenic nsSNPs on structure and function of FIX.

Conclusions: A simple function that integrates information from different in silico programs yields the best prediction of mutated phenotypes. However, the specificity, sensitivity, and accuracy of genotype-phenotype predictions depend on specific characteristics of the protein domain and the disease of interest as we validated by the structural analysis of selected pathogenic F9 mutations. The proposed function integrating algorithm (wgP4) might be useful for the analysis of nsSNPs impact on other genes.

Keywords: F9 exons 1–5; Genotype-phenotype correlation; Hemophilia B; In silico analysis.

PubMed Disclaimer

Conflict of interest statement

We have no competing interest at public or private institutions.

Figures

Fig. 1
Fig. 1
Formulas for combined predictions. (1 – SIFT), as SIFT scores are inverse to PolyPhen-2 scores, they were scaled by subtracting from 1. PolyPhen, score obtained from PolyPhen-2 HumDiv. (SNAP2/100)2, SNAP2 scores may be positive and negative percentages, they were scaled to PolyPhen-2 scores by dividing by 100 and squaring. MutationAssessor, scores range from 4 to − 2. Mutations scoring below 1.9 are considered benign, and so are coded as 1. Predicted values were log-transformed at base 5 to obtain values between 0 and 1. PANTHER and PROVEAN, predictions are categorized as deleterious or benign, and are coded 1 and 0 respectively. n, number of programs used in combined analysis. In the functions wgP6 and wgP4, n is substituted by the weight for each program. In B and C, predicted values in the numerator are multiplied by the weight
Fig. 2
Fig. 2
Sensitivity, specificity, and accuracy for five factor IX domains. The first five domains encoded by exons 1–5 were analyzed as one unit using individual tools. See text for more details
Fig. 3
Fig. 3
Sensitivity, specificity, and accuracy for each factor IX domain. The (a) signal peptide at exon 1, (b) propeptide at exon 2, (c) Gla domain at exon 3, (d) EGF-1 domain at exon 4, and (e) EGF-2 domain at exon 5 were analyzed by individual tools. See text for more details
Fig. 4
Fig. 4
Analysis of factor IX secondary structure by the FFPRED tool in PSIPRED. Analysis of predicted changes in (a) percentage alpha helix, extended strand, and random coil, as well as in (b) aliphatic index, hydrophobicity, surface area, and addition or deletion of phosphorylation sites. Domains are depicted in different shades of gray
Fig. 5
Fig. 5
Comparison of the local environment of severe and mild mutations in the EGF domains of FIX. The protein backbone is shown in silver ribbons, interacting amino acids as a black licorice and the calcium ion as a white sphere. a, c, e, g, i correspond to wild type FIX. b, d, f, h, j correspond to mutant FIX. a Location of Gly105 in EGF-1 (from PDB ID 1PFX); the N-terminus of the domain is labeled. b Location of Asp105 in EGF-1; the N-terminus of the domain is labeled c Neighboring residues for Gln143 (from PDB ID 6MV4), labeled. d Neighboring residues for Arg143, clashing with the disulfide bond between Cys157 and Cys170, Tyr161 and Phe423. e Salt bridge between Glu124 in EGF-1 and Arg140 in EGF-2; neighboring positive residue also labeled (from PDB ID 1PFX). f Group of nearby positive charges in the Glu124Lys mutant. g Selected residues close to Val153 (from PDB ID 1PFX). h Residues that clash with Met153. i Residues coordinating the calcium ion in EGF-1; the residues that contribute their side chains are labeled (from PDB ID 1EDM). j Location of Pro96 as a first coordination shell residue for calcium

Similar articles

Cited by

References

    1. Rallapalli P. M., Kemball-Cook G., Tuddenham E. G., Gomez K., Perkins S. J. An interactive mutation database for human coagulation factor IX provides novel insights into the phenotypes and genetics of hemophilia B. Journal of Thrombosis and Haemostasis. 2013;11(7):1329–1340. doi: 10.1111/jth.12276. - DOI - PubMed
    1. Blanchette VS, Key NS, Ljung LR, Manco-Johnson MJ, van den Berg HM, Srivastava A. Definitions in hemophilia: communication from the SSC of the ISTH. J Thromb Haemost. 2014;12(11):1935–1939. doi: 10.1111/jth.12672. - DOI - PubMed
    1. Anson DS, Choo KH, Rees DJ, Giannelli F, Gould K, Huddleston JA, et al. The gene structure of human anti-haemophilic factor IX. EMBO J. 1984;3(5):1053–1060. doi: 10.1002/j.1460-2075.1984.tb01926.x. - DOI - PMC - PubMed
    1. Von Heijne G. The signal peptide. J Membr Biol. 1990;115(3):195–201. doi: 10.1007/BF01868635. - DOI - PubMed
    1. Jorgensen MJ, Cantor AB, Furie BC, Brown CL, Shoemaker CB, Furie B. Recognition site directing vitamin K-dependent gamma-carboxylation resides on the propeptide of factor IX. Cell. 1987;48(2):185–191. doi: 10.1016/0092-8674(87)90422-3. - DOI - PubMed

LinkOut - more resources