Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Dec 15;32(24):3829-3832.
doi: 10.1093/bioinformatics/btw602. Epub 2016 Sep 25.

LongISLND: in silico sequencing of lengthy and noisy datatypes

Affiliations

LongISLND: in silico sequencing of lengthy and noisy datatypes

Bayo Lau et al. Bioinformatics. .

Abstract

LongISLND is a software package designed to simulate sequencing data according to the characteristics of third generation, single-molecule sequencing technologies. The general software architecture is easily extendable, as demonstrated by the emulation of Pacific Biosciences (PacBio) multi-pass sequencing with P5 and P6 chemistries, producing data in FASTQ, H5, and the latest PacBio BAM format. We demonstrate its utility by downstream processing with consensus building and variant calling.

Availability and implementation: LongISLND is implemented in Java and available at http://bioinform.github.io/longislnd CONTACT: hugo.lam@roche.comSupplementary information: Supplementary data are available at Bioinformatics online.

PubMed Disclaimer

Figures

Fig. 1.
Fig. 1.
(a) Number of 7-mers binned with respect to accuracy, determined within 1% as discussed in the Supplementary material. A context-independent error profile would yield a delta peaking function centered at the global accuracy. (b) Fraction of samples of a certain sequence length aligned to a homopolymer of true length 6. Compared to the analytical expression derived in the Supplementary Material, G/C deletion bias is observed in both P5 and P6 chemistries

References

    1. Berlin K. et al. (2015) Assembling large genomes with single-molecule sequencing and locality-sensitive hashing. Nat. Biotechnol., 33, 623–630. - PubMed
    1. Chaisson M.J., Tesler G. (2012) Mapping single molecule sequencing reads using basic local alignment with successive refinement (BLASR): application and theory. BMC Bioinformatics, 13, 238.. - PMC - PubMed
    1. Chin C.S. et al. (2013) Nonhybrid, finished microbial genome assemblies from long-read SMRT sequencing data. Nat. Methods, 10, 563–569. - PubMed
    1. Eid J. et al. (2009) Real-time DNA sequencing from single polymerase molecules. Science, 323, 133–138. - PubMed
    1. English A.C. et al. (2014) Pbhoney: identifying genomic variants via long-read discordance and interrupted mapping. BMC Bioinformatics, 15, 180.. - PMC - PubMed

MeSH terms