Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2024 Mar 29;40(4):btae168.
doi: 10.1093/bioinformatics/btae168.

A machine-readable specification for genomics assays

Affiliations

A machine-readable specification for genomics assays

Ali Sina Booeshaghi et al. Bioinformatics. .

Abstract

Motivation: Understanding the structure of sequenced fragments from genomics libraries is essential for accurate read preprocessing. Currently, different assays and sequencing technologies require custom scripts and programs that do not leverage the common structure of sequence elements present in genomics libraries.

Results: We present seqspec, a machine-readable specification for libraries produced by genomics assays that facilitates standardization of preprocessing and enables tracking and comparison of genomics assays.

Availability and implementation: The specification and associated seqspec command line tool is available at https://www.doi.org/10.5281/zenodo.10213865.

PubMed Disclaimer

Conflict of interest statement

None declared.

Figures

Figure 1.
Figure 1.
The structure of molecules in genomic libraries. Sequencing libraries are constructed by combining Atomic Regions to form an adapter-insert-adapter construct. The seqspec for the assay annotates the construct with Regions and meta Regions.
Figure 2.
Figure 2.
Uniform processing enabled with seqspec. The seqspec index command produces a technology string that identifies appropriate sequence elements and can be passed into processing tools.

Update of

Similar articles

Cited by

References

    1. Cao J, Cusanovich DA, Ramani V. et al. Joint profiling of chromatin accessibility and gene expression in thousands of single cells. Science 2018;361:1380–5. - PMC - PubMed
    1. Chen X. Collections of library structure and sequence of popular single cell genomic methods. GitHub. 2020. https://github.com/Teichlab/scg_lib_structs.
    1. Cheow LF, Courtois ET, Tan Y. et al. Single-cell multimodal profiling reveals cellular epigenetic heterogeneity. Nat Methods 2016;13:833–6. - PubMed
    1. He D, Zakeri M, Sarkar H. et al. Alevin-fry unlocks rapid, accurate and memory-frugal quantification of single-cell RNA-seq data. Nat Methods 2022;19:316–22. - PMC - PubMed
    1. Healey HM, Bassham S, Cresko WA.. Single-cell iso-sequencing enables rapid genome annotation for scRNAseq analysis. Genetics 2022;220. https://academic.oup.com/genetics/article/220/3/iyac017/6526397. - PMC - PubMed