Measuring, visualizing, and diagnosing reference bias with biastools
- PMID: 38641647
- PMCID: PMC11027314
- DOI: 10.1186/s13059-024-03240-8
Measuring, visualizing, and diagnosing reference bias with biastools
Abstract
Many bioinformatics methods seek to reduce reference bias, but no methods exist to comprehensively measure it. Biastools analyzes and categorizes instances of reference bias. It works in various scenarios: when the donor's variants are known and reads are simulated; when donor variants are known and reads are real; and when variants are unknown and reads are real. Using biastools, we observe that more inclusive graph genomes result in fewer biased sites. We find that end-to-end alignment reduces bias at indels relative to local aligners. Finally, we use biastools to characterize how T2T references improve large-scale bias.
Keywords: Pangenomics; Reference bias; Sequence alignment.
© 2024. The Author(s).
Conflict of interest statement
The authors declare that they have no competing interests.
Figures
Update of
-
Measuring, visualizing and diagnosing reference bias with biastools.bioRxiv [Preprint]. 2024 Feb 15:2023.09.13.557552. doi: 10.1101/2023.09.13.557552. bioRxiv. 2024. Update in: Genome Biol. 2024 Apr 19;25(1):101. doi: 10.1186/s13059-024-03240-8. PMID: 37745608 Free PMC article. Updated. Preprint.
References
-
- Baid G, Nattestad M, Kolesnikov A, Goel S, Yang H, Chang PC, et al. Google Brain Genomics Sequencing Dataset for Benchmarking and Development. Dataset. 2020. https://console.cloud.google.com/storage/browser/brain-genomics-public/r.... Accessed 15 Apr 2024.
Publication types
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources
