RabbitBAM: Accelerating BAM File Manipulation on Multi-Core Platforms
- PMID: 40811285
- DOI: 10.1109/TCBBIO.2025.3590412
RabbitBAM: Accelerating BAM File Manipulation on Multi-Core Platforms
Abstract
With the continuous advancement of sequencing technology, the scale of biological data has rapidly increased. BAM format, widely used for storing aligned sequence data, is very popular due to its ease of use and good compression ratio. However, existing BAM-format file I/O libraries often fail to fully leverage the computational power of modern multi-core platforms, resulting in low CPU utilization. To address this, we introduce RabbitBAM, a fast BAM-format file I/O library. RabbitBAM employs pre-parsing and parallel parsing techniques to eliminate parsing bottlenecks and improve parallel efficiency. Additionally, we optimize multi-threaded data handling through the use of dedicated lock-free queues and memory pools. RabbitBAM achieves 2.1-3.3x speedups on next-generation sequencing data and 1-2.2x speedups on third-generation sequencing data compared to state-of-the-art SAMtools (HTSlib). We also present two case studies (BAM file quality control and sorting) using RabbitBAM, demonstrating 1.4-2.4x speedups compared to other implementations. RabbitBAM and case studies are freely available at https://github.com/RabbitBio , with usage documentation at https://rabbitbam.readthedocs.io/en/latest.
Similar articles
-
Prescription of Controlled Substances: Benefits and Risks.2025 Jul 6. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–. 2025 Jul 6. In: StatPearls [Internet]. Treasure Island (FL): StatPearls Publishing; 2025 Jan–. PMID: 30726003 Free Books & Documents.
-
RabbitTrim: An Efficient and Versatile Trimmer on Multi-Core Platforms.IEEE Trans Comput Biol Bioinform. 2025 Jun 12;PP. doi: 10.1109/TCBBIO.2025.3579070. Online ahead of print. IEEE Trans Comput Biol Bioinform. 2025. PMID: 40811278
-
Exercise therapy for chronic fatigue syndrome.Cochrane Database Syst Rev. 2024 Dec 19;12(12):CD003200. doi: 10.1002/14651858.CD003200.pub9. Cochrane Database Syst Rev. 2024. PMID: 39697147
-
123I-MIBG scintigraphy and 18F-FDG-PET imaging for diagnosing neuroblastoma.Cochrane Database Syst Rev. 2015 Sep 29;2015(9):CD009263. doi: 10.1002/14651858.CD009263.pub2. Cochrane Database Syst Rev. 2015. PMID: 26417712 Free PMC article.
-
Patient navigator programmes for children and adolescents with chronic diseases.Cochrane Database Syst Rev. 2024 Oct 9;10(10):CD014688. doi: 10.1002/14651858.CD014688.pub2. Cochrane Database Syst Rev. 2024. PMID: 39382077
LinkOut - more resources
Research Materials
Miscellaneous