COSAP: Comparative Sequencing Analysis Platform
- PMID: 38532317
- PMCID: PMC10967217
- DOI: 10.1186/s12859-024-05756-z
COSAP: Comparative Sequencing Analysis Platform
Abstract
Background: Recent improvements in sequencing technologies enabled detailed profiling of genomic features. These technologies mostly rely on short reads which are merged and compared to reference genome for variant identification. These operations should be done with computers due to the size and complexity of the data. The need for analysis software resulted in many programs for mapping, variant calling and annotation steps. Currently, most programs are either expensive enterprise software with proprietary code which makes access and verification very difficult or open-access programs that are mostly based on command-line operations without user interfaces and extensive documentation. Moreover, a high level of disagreement is observed among popular mapping and variant calling algorithms in multiple studies, which makes relying on a single algorithm unreliable. User-friendly open-source software tools that offer comparative analysis are an important need considering the growth of sequencing technologies.
Results: Here, we propose Comparative Sequencing Analysis Platform (COSAP), an open-source platform that provides popular sequencing algorithms for SNV, indel, structural variant calling, copy number variation, microsatellite instability and fusion analysis and their annotations. COSAP is packed with a fully functional user-friendly web interface and a backend server which allows full independent deployment for both individual and institutional scales. COSAP is developed as a workflow management system and designed to enhance cooperation among scientists with different backgrounds. It is publicly available at https://cosap.bio and https://github.com/MBaysanLab/cosap/ . The source code of the frontend and backend services can be found at https://github.com/MBaysanLab/cosap-webapi/ and https://github.com/MBaysanLab/cosap_frontend/ respectively. All services are packed as Docker containers as well. Pipelines that combine algorithms can be customized and new algorithms can be added with minimal coding through modular structure.
Conclusions: COSAP simplifies and speeds up the process of DNA sequencing analyses providing commonly used algorithms for SNV, indel, structural variant calling, copy number variation, microsatellite instability and fusion analysis as well as their annotations. COSAP is packed with a fully functional user-friendly web interface and a backend server which allows full independent deployment for both individual and institutional scales. Standardized implementations of popular algorithms in a modular platform make comparisons much easier to assess the impact of alternative pipelines which is crucial in establishing reproducibility of sequencing analyses.
Keywords: Copy number variation; Microsatellite instability; NGS Analysis; Variant annotation; Variant classification.
© 2024. The Author(s).
Conflict of interest statement
The authors declare that they have no competing interests.
Figures










Similar articles
-
iCopyDAV: Integrated platform for copy number variations-Detection, annotation and visualization.PLoS One. 2018 Apr 5;13(4):e0195334. doi: 10.1371/journal.pone.0195334. eCollection 2018. PLoS One. 2018. PMID: 29621297 Free PMC article.
-
SLIM: a flexible web application for the reproducible processing of environmental DNA metabarcoding data.BMC Bioinformatics. 2019 Feb 19;20(1):88. doi: 10.1186/s12859-019-2663-2. BMC Bioinformatics. 2019. PMID: 30782112 Free PMC article.
-
CoVaCS: a consensus variant calling system.BMC Genomics. 2018 Feb 5;19(1):120. doi: 10.1186/s12864-018-4508-1. BMC Genomics. 2018. PMID: 29402227 Free PMC article.
-
Review on the Computational Genome Annotation of Sequences Obtained by Next-Generation Sequencing.Biology (Basel). 2020 Sep 18;9(9):295. doi: 10.3390/biology9090295. Biology (Basel). 2020. PMID: 32962098 Free PMC article. Review.
-
SpliceAI-visual: a free online tool to improve SpliceAI splicing variant interpretation.Hum Genomics. 2023 Feb 10;17(1):7. doi: 10.1186/s40246-023-00451-1. Hum Genomics. 2023. PMID: 36765386 Free PMC article. Review.
References
-
- Afgan E, Baker D, Batut B, Van Den Beek M, Bouvier D, Čech M, Chilton J, Clements D, Coraor N, Grüning BA, Guerler A. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update. Nucleic Acids Res. 2018;46(W1):W537–W544. doi: 10.1093/nar/gky379. - DOI - PMC - PubMed
MeSH terms
Grants and funding
LinkOut - more resources
Full Text Sources