AIRRSHIP: simulating human B cell receptor repertoire sequences
- PMID: 37279738
- PMCID: PMC10272706
- DOI: 10.1093/bioinformatics/btad365
AIRRSHIP: simulating human B cell receptor repertoire sequences
Abstract
Summary: Adaptive Immune Receptor Repertoire Sequencing is a rapidly developing field that has advanced understanding of the role of the adaptive immune system in health and disease. Numerous tools have been developed to analyse the complex data produced by this technique but work to compare their accuracy and reliability has been limited. Thorough, systematic assessment of their performance is dependent on the ability to produce high quality simulated datasets with known ground truth. We have developed AIRRSHIP, a flexible and fast Python package that produces synthetic human B cell receptor sequences. AIRRSHIP uses a comprehensive set of reference data to replicate key mechanisms in the immunoglobulin recombination process, with a particular focus on junctional complexity. Repertoires generated by AIRRSHIP are highly similar to published data and all steps in the sequence generation process are recorded. These data can be used to not only determine the accuracy of repertoire analysis tools but can also, by tuning of the large number of user-controllable parameters, give insight into factors that contribute to inaccuracies in results.
Availability and implementation: AIRRSHIP is implemented in Python. It is available via https://github.com/Cowanlab/airrship and on PyPI at https://pypi.org/project/airrship/. Documentation can be found at https://airrship.readthedocs.io/.
© The Author(s) 2023. Published by Oxford University Press.
Conflict of interest statement
None declared.
Figures

References
-
- Bolotin DA, Poslavsky S, Mitrophanov I. et al. MiXCR: software for comprehensive adaptive immunity profiling. Nat Methods 2015;12:380–1. - PubMed