Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2015 Feb 11;16(1):30.
doi: 10.1186/s13059-015-0596-2.

Bridger: a new framework for de novo transcriptome assembly using RNA-seq data

Bridger: a new framework for de novo transcriptome assembly using RNA-seq data

Zheng Chang et al. Genome Biol. .

Abstract

We present a new de novo transcriptome assembler, Bridger, which takes advantage of techniques employed in Cufflinks to overcome limitations of the existing de novo assemblers. When tested on dog, human, and mouse RNA-seq data, Bridger assembled more full-length reference transcripts while reporting considerably fewer candidate transcripts, hence greatly reducing false positive transcripts in comparison with the state-of-the-art assemblers. It runs substantially faster and requires much less memory space than most assemblers. More interestingly, Bridger reaches a comparable level of sensitivity and accuracy with Cufflinks. Bridger is available at https://sourceforge.net/projects/rnaseqassembly/files/?source=navbar.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Flowchart of Bridger. (a) The algorithm takes RNA-seq reads (single or paired) to assemble splicing graphs, each of which provides a complete representation of all alternative splicing transcripts for each locus. (b-d) Each splicing graph is processed independently. (b) Each edge in a splicing graph represents one splice junction. In this example, edges 1 and 3 are compatible, while edges 3 and 4 are not compatible. (c) A compatibility graph. (d) A minimum path cover model is applied to recover a minimal set of transcripts that could be tiled together through overlapping sequence reads and ‘explain’ all observed junctions in a splicing graph.
Figure 2
Figure 2
Splicing graph construction. (a) Splicing graph after branch extension. The red k-mer (k = 5) ATCAG on the left is a bifurcation 5-mer because there is an unused 5-mer TCAGC in the hash table that provides an alternative extension. Extend this 5-mer to a new contig until it cannot be further extended. We check the last 4-mer of this branch to see if there is a matching 4-mer in the current splicing graph. If so, another bifurcation 5-mer is found (for example, the red 5-mer CTAGC). (b) A modified splicing graph by merging the k-1 overlapping nucleotides (4-mer CTAG) and adding a new directed edge between two bifurcation k-mers.
Figure 3
Figure 3
Comparison of the number of full-length reconstructed reference transcripts for (a) dog, (b) human, and (c) mouse.
Figure 4
Figure 4
Comparison of accuracy for (a) dog, (b) human, and (c) mouse.
Figure 5
Figure 5
Run time and RAM usage for each assembler in (a) dog, (b) human, and (c) mouse. Same parameter values are used for all assemblers: k = 25 and CPU = 6.
Figure 6
Figure 6
CPU time for each assembler in (a) dog, (b) human, and (c) mouse.
Figure 7
Figure 7
A novel gene containing 10 exons was assembled by all assemblers. Interestingly, all de novo assemblers captured longer UTR than the reference-based assembler Cufflinks.

References

    1. Ozsolak F, Milos PM. RNA sequencing: advances, challenges and opportunities. Nat Rev Genet. 2010;12:87–98. doi: 10.1038/nrg2934. - DOI - PMC - PubMed
    1. Wang Z, Gerstein M, Snyder M. RNA-Seq: a revolutionary tool for transcriptomics. Nat Rev Genet. 2009;10:57–63. doi: 10.1038/nrg2484. - DOI - PMC - PubMed
    1. Marguerat S, Bähler J. RNA-seq: from technology to biology. Cell Mol Life Sci. 2010;67:569–79. doi: 10.1007/s00018-009-0180-6. - DOI - PMC - PubMed
    1. Wilhelm BT, Landry J-R. RNA-Seq–quantitative measurement of expression through massively parallel RNA-sequencing. Methods. 2009;48:249–57. doi: 10.1016/j.ymeth.2009.03.016. - DOI - PubMed
    1. Marioni JC, Mason CE, Mane SM, Stephens M, Gilad Y. RNA-seq: an assessment of technical reproducibility and comparison with gene expression arrays. Genome Res. 2008;18:1509–17. doi: 10.1101/gr.079558.108. - DOI - PMC - PubMed

Publication types