. 2024 Dec 23;19(12):e0312733.

doi: 10.1371/journal.pone.0312733. eCollection 2024.

Comprehensive characterization of high-risk coding and non-coding single nucleotide polymorphisms of human CXCR4 gene

Bonoshree Sarkar¹, Muhammad Safiul Alam Mondal¹, Taibur Rahman¹, Md Ismail Hosen², Atiqur Rahman¹

Affiliations

¹ Infection Biology Laboratory, Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka, Bangladesh.
² Clinical Biochemistry and Translational Medicine Laboratory, Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka, Bangladesh.

PMID: 39715225
PMCID: PMC11665994
DOI: 10.1371/journal.pone.0312733

Comprehensive characterization of high-risk coding and non-coding single nucleotide polymorphisms of human CXCR4 gene

Bonoshree Sarkar et al. PLoS One. 2024.

. 2024 Dec 23;19(12):e0312733.

doi: 10.1371/journal.pone.0312733. eCollection 2024.

Authors

Bonoshree Sarkar¹, Muhammad Safiul Alam Mondal¹, Taibur Rahman¹, Md Ismail Hosen², Atiqur Rahman¹

Affiliations

¹ Infection Biology Laboratory, Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka, Bangladesh.
² Clinical Biochemistry and Translational Medicine Laboratory, Department of Biochemistry and Molecular Biology, University of Dhaka, Dhaka, Bangladesh.

PMID: 39715225
PMCID: PMC11665994
DOI: 10.1371/journal.pone.0312733

Abstract

CXCR4, a chemokine receptor known as Fusin or CD184, spans the outer membrane of various human cells, including leukocytes. This receptor is essential for HIV infection as well as for many vital cellular processes and is implicated to be associated with multiple pathologies, including cancers. This study employs various computational tools to investigate the molecular effects of disease-vulnerable germ-line missense and non-coding SNPs of the CXCR4 gene. In this investigation, the tools SIFT, PROVEAN, PolyPhen-2, PANTHER, SNAP 2.0, PhD-SNP, and SNPs&GO were used to predict potentially harmful and disease-causing nsSNPs in CXCR4. Additionally, their impact on protein stability was examined by I-mutant 3.0, MUpro, Consurf, and Netsurf 2.0, combined with conservation and solvent accessibility analyses. Structural analysis with normal and mutant residues of the protein harboring these disease-associated functional SNPs was conducted using TM-align and SWIS MODEL, with visualization aided by PyMOL and the BIOVINA Discovery Studio Visualizer. The functional impact of wild-type and mutated CXCR4 variants was evaluated through molecular docking with its natural ligand CXCR4-modulator 1, using the PyRx tool. Non-coding SNPs in the 3' -UTR were investigated for their regulatory effects on miRNA binding sites using PolymiRTS. Five non-coding SNPs were identified in the 3'-UTR that can disrupt existing miRNA binding sites or create new ones. Non-coding SNPs in the 5' and 3'-UTRs, as well as in intronic regions, were also examined for their potential roles in gene expression regulation. Furthermore, RegulomeDB databases were employed to assess the regulatory potential of these non-coding SNPs based on chromatin state and protein binding regulation. In the mostly annotated variant (ENSP00000241393) of the CXCR4 gene, we found 23 highly deleterious and pathogenic nsSNPs and these were selected for in-depth analysis. Among the 23 nsSNPs, five (G55V, H79P, L80P, H113P, and P299L) displayed notable structural alternation, with elevated RMSD values and reduced TM (TM-score) values. A molecular docking study revealed the significant impact of the H113P variant on the protein-ligand binding affinity, supported by MD simulation over 100 nanoseconds, which highlighted substantial stability differences between wild-type and H113P mutated proteins during ligand binding. This comprehensive analysis shed light on the potential functional consequences of genetic variation in the CXCR genes, offering valuable insights into the implications of disease susceptibility and may pave the way for future therapeutic interventions.

Copyright: © 2024 Sarkar et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

**Fig 1. Flowchart of the methodology used in this study.**
All the coding SNPs obtained from the NCBI and ENSEMBL databases were filtered to find out the most deleterious and disease-causing SNPs using seven different bioinformatics tools. Then the filtered nsSNPs were assessed for their effect on the secondary and tertiary structure of the protein. The effect of a drug was predicted against nsSNPs that have a massive effect on the protein structure. The consequences of the non-coding SNPs were also investigated.

**Fig 2. Most deleterious and disease-causing nsSNPs estimated by different in silico tools.**
Five in silico tools–SIFT, Polyphen2.0, PROVEAN, PANTHER, and SNAP2.0 were used to predict the most deleterious nsSNPs, and two tools–PhD-SNP and SNPs & GO–were used to predict disease-causing nsSNPs. Among the deleterious and disease-causing nsSNPs, 23 were common.

**Fig 3. Schematic representation of the relative surface accessibility and conservation of the high-risk SNPs predicted by NetSurfP-2.0 and Consurf web tool.**
In NetSurfP2.0, a threshold of 25% was utilized, meaning SNPs with more than 25% RSA are expected to be exposed on the protein surface.

**Fig 4. Effects of the filtered nsSNPs within CXCR4 on post-translational modification.**
(a) Within the wild-type of CXCR4 protein, K38, and Q314 do not exhibit acetylation or Pyrrolidone carboxylic acid modification, while (b) in the context of mutated protein with F36C and P299L substitution, K38 becomes acetylated and Q314 undergoes pyrrolidone carboxylic acid modification.

**Fig 5. Superimposed structures of wild and mutant variants of CXCR4.**
(a) G55V, (b) H79P, (c) L80P, (d) H113P, and (e) P299L. Here, the blue-colored residues represent the native amino acids, and the red residues represent mutated residues. The superimposed position of the native and mutated amino acids shows that the distinguished structure of the side chain of these amino acids likely have impact on the alteration of the 3D structure of the protein.

**Fig 6. Structural analysis of wild type and mutated variants of CXCR4.**
(a) G55 in the wild-type protein and Val55 in mutated protein forms 2 H-bonds with Thr51 and Val59; (b) His79 in wild type protein engages in 4 H-bonds with Lys75, Tyr76, Val82, and Ala83, while the mutated Pro79 forms only 1 H-bond with Ala83; (c) H-bond number again changed with Leu80 (with Tyr76, Ala83, and Asn84) mutation from 3 to 2 with Pro80 (with Ala83 and Ans84); meanwhile, (d) H-bond number decreased from 4 to 2 in the mutated version of H113 (Cys109, Cys109, Thr117, Asp171) to P113 (Tyr116, Thr117); (e) P299 and L299 both create 2 H-bond but with different amino acids, P299 with Tyr302, Cys295, and L299 with Tyr302, Ala303, respectively. The changes in H-bond in the 3D structure of the CXCR4 protein likely have a significant impact on the structure functions of the protein.

**Fig 7. The structure of the protein shows the docking-related and mutation-prone residues.**
Docking-related residues (denoted with red) and the mutation-prone residues (denoted with blue) from both (a) side view and (b) top view. In both images, the residue H113, positioned in the docking site and susceptible to mutation is indicated by a red circle.

**Fig 8. Molecular docking results of CXCR4-modulator 1 with CXCR4 protein using PyRx software.**
All the mutant variants of the ligand showed greater affinity with their receptor during the simulated docking process. However, a significant decrease in the affinity was observed for the H113P mutant variant.

**Fig 9. Molecular dynamics simulation.**
The MD simulation results show key structural properties for both the wild type (depicted in green) and the mutant (depicted in blue). (a) RMSD of the wild type and the mutant, with time in nanoseconds (ns) on the X-axis and the RMSD values in nanometers (nm) on the Y-axis represent. (b) RMSF of the wild type and the mutant are compared, using amino acid residues on the X-axis and the RMSF values in nm on the Y-axis. (c) The radius of Gyration of the wild type and the mutant, representing the time in picoseconds (ps) on the X-axis and the area in squire nanometer (nm²) on the Y-axis. (d) The SASA of the wild type and the mutant, with time in picoseconds (ps) on the and the SASA values in nm on the Y-axis.

See this image and copyright information in PMC

References

1. Collins FS, Patrinos A, Jordan E, Chakravarti A, Gesteland R, Walters LR. New goals for the U.S. Human Genome Project: 1998–2003. Science (80-). 1998;282(5389):682–9. - PubMed
1. Bhardwaj V, Purohit R. Computational investigation on effect of mutations in PCNA resulting in structural perturbations and inhibition of mismatch repair pathway. J Biomol Struct Dyn [Internet]. 2020;38(7):1963–74. Available from: 10.1080/07391102.2019.1621210 - DOI - PubMed
1. Giral H, Landmesser U, Kratzer A. Into the Wild: GWAS Exploration of Non-coding RNAs. Front Cardiovasc Med. 2018;5(December). - PMC - PubMed
1. Xiao X, Chang H, Li M. Molecular mechanisms underlying noncoding risk variations in psychiatric genetic studies. Mol Psychiatry. 2017;22(4):497–511. - PMC - PubMed
1. Singh A, Thakur M, Singh SK, Sharma LK, Chandra K. Exploring the effect of nsSNPs in human YPEL3 gene in cellular senescence. Sci Rep [Internet]. 2020;10(1):1–11. Available from: 10.1038/s41598-020-72333-8 - DOI - PMC - PubMed

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions

Substances

Actions
Actions

LinkOut - more resources

Full Text Sources
- PubMed Central
- Public Library of Science

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Comprehensive characterization of high-risk coding and non-coding single nucleotide polymorphisms of human CXCR4 gene

Affiliations

Comprehensive characterization of high-risk coding and non-coding single nucleotide polymorphisms of human CXCR4 gene

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

References

MeSH terms

Substances

LinkOut - more resources

Full Text Sources

Abstract

Conflict of interest statement

Figures

Similar articles

References

MeSH terms

Substances

Related information

LinkOut - more resources

Full Text Sources