Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Sep 1;41(9):btaf522.
doi: 10.1093/bioinformatics/btaf522.

qcCHIP: an R package to identify clonal hematopoiesis variants using cohort-specific data characteristics

Affiliations

qcCHIP: an R package to identify clonal hematopoiesis variants using cohort-specific data characteristics

Xiang Liu et al. Bioinformatics. .

Abstract

Summary: Clonal hematopoiesis (CH) is a molecular biomarker associated with various adverse outcomes in both healthy individuals and those with underlying conditions, including cancer. Detecting CH usually involves genomic sequencing of individual blood samples followed by robust bioinformatics data filtering. We report an R package, qcCHIP, a bioinformatics pipeline that implements permutation-based parameter optimization to guide quality control filtering and cohort-specific CH identification. We benchmark qcCHIP under various data settings, including different sequencing depths, ranges of cohort sizes, with and without normal-tumor paired samples, and across different cancer types. We show that qcCHIP allows users to customize analysis needs to generate CH calls based on cohort-specific data characteristics.

Availability and implementation: qcCHIP R package is freely accessible at GitHub https://github.com/tenglab/qcCHIP and DOI: 10.5281/zenodo.16421861.

PubMed Disclaimer

Figures

Figure 1.
Figure 1.
Permutation-based approach to optimize clonal hematopoiesis (CH) quality control metric cutoffs. (a) Quality metrics to filter CH mutations by qcCHIP. (b) Permutation analysis and (c) precision and recall at different variant allele frequency (VAF) cutoffs using blood-only whole exome sequencing data from the ORIEN breast cancer cohort. Permutation consistency is the proportion of CH calls from the full cohort that are also called in the permuted subsets. Each point represents average consistency over 100 permutations. Colors indicate different permuted sample sizes. (d) Permutation analysis and (e) precision and recall at different VAF cutoffs using blood-tumor paired samples for the same breast cancer cohort. (f) Permutation analysis and (g) precision-VAF curves using blood-only ultra-high-depth targeted exon sequencing from an independent cohort of patients with breast cancer. Minor effects on recall are showed in Fig. 2, available as supplementary data at Bioinformatics online. The point sizes represent the number of common CH mutations between qcCHIP identification and manually curated CH mutations.

References

    1. Bolton KL, Gillis NK, Coombs CC et al. Managing clonal hematopoiesis in patients with solid tumors. J Clin Oncol 2019;37:7–11. - PMC - PubMed
    1. Chan ICC, Panchot A, Schmidt E et al. ArCH: improving the performance of clonal hematopoiesis variant calling and interpretation. Bioinformatics 2024;40:btae121. - PMC - PubMed
    1. Comen EA, Bowman RL, Selenica P et al. Evaluating clonal hematopoiesis in tumor-infiltrating leukocytes in breast cancer and secondary hematologic malignancies. J Natl Cancer Inst 2020;112:107–10. - PMC - PubMed
    1. Coombs CC, Gillis NK, Tan X et al. Identification of clonal hematopoiesis mutations in solid tumor patients undergoing unpaired next-generation sequencing assays. Clin Cancer Res 2018;24:5918–24. - PMC - PubMed
    1. Coombs CC, Zehir A, Devlin SM et al. Therapy-related clonal hematopoiesis in patients with non-hematologic cancers is common and associated with adverse clinical outcomes. Cell Stem Cell 2017;21:374–82.e4. - PMC - PubMed