Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012;13 Suppl 17(Suppl 17):S24.
doi: 10.1186/1471-2105-13-S17-S24. Epub 2012 Dec 13.

TranSeqAnnotator: large-scale analysis of transcriptomic data

Affiliations

TranSeqAnnotator: large-scale analysis of transcriptomic data

Ranjeeta Menon et al. BMC Bioinformatics. 2012.

Abstract

Background: The transcriptome of an organism can be studied with the analysis of expressed sequence tag (EST) data sets that offers a rapid and cost effective approach with several new and updated bioinformatics approaches and tools for assembly and annotation. The comprehensive analyses comprehend an organism along with the genome and proteome analysis. With the advent of large-scale sequencing projects and generation of sequence data at protein and cDNA levels, automated analysis pipeline is necessary to store, organize and annotate ESTs.

Results: TranSeqAnnotator is a workflow for large-scale analysis of transcriptomic data with the most appropriate bioinformatics tools for data management and analysis. The pipeline automatically cleans, clusters, assembles and generates consensus sequences, conceptually translates these into possible protein products and assigns putative function based on various DNA and protein similarity searches. Excretory/secretory (ES) proteins inferred from ESTs/short reads are also identified. The TranSeqAnnotator accepts FASTA format raw and quality ESTs along with protein and short read sequences and are analysed with user selected programs. After pre-processing and assembly, the dataset is annotated at the nucleotide, protein and ES protein levels.

Conclusion: TranSeqAnnotator has been developed in a Linux cluster, to perform an exhaustive and reliable analysis and provide detailed annotation. TranSeqAnnotator outputs gene ontologies, protein functional identifications in terms of mapping to protein domains and metabolic pathways. The pipeline is applied to annotate large EST datasets to identify several novel and known genes with therapeutic experimental validations and could serve as potential targets for parasite intervention. TransSeqAnnotator is freely available for the scientific community at http://estexplorer.biolinfo.org/TranSeqAnnotator/.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Schematic diagram of TranSeqAnnotator workflow.
Figure 2
Figure 2
TranSeqAnnotator data submission page.

Similar articles

Cited by

References

    1. Rudd S. Expressed sequence tags: alternative or complement to whole genome sequences? Trends Plant Sci. 2003;8(7):321–329. doi: 10.1016/S1360-1385(03)00131-6. - DOI - PubMed
    1. Dong Q, Kroiss L, Oakley FD, Wang BB, Brendel V. Comparative EST analyses in plant systems. Methods Enzymol. 2005;395:400–418. - PubMed
    1. Jongeneel CV. Searching the expressed sequence tag (EST) databases: panning for genes. Brief Bioinform. 2000;1(1):76–92. doi: 10.1093/bib/1.1.76. - DOI - PubMed
    1. Adams MD, Kelley JM, Gocayne JD, Dubnick M, Polymeropoulos MH, Xiao H, Merril CR, Wu A, Olde B, Moreno RF. et al.Complementary DNA sequencing: expressed sequence tags and human genome project. Science. 1991;252(5013):1651–1656. doi: 10.1126/science.2047873. - DOI - PubMed
    1. Moreno Y, Gros PP, Tam M, Segura M, Valanparambil R, Geary TG, Stevenson MM. Proteomic analysis of excretory-secretory products of Heligmosomoides polygyrus assessed with next-generation sequencing transcriptomic information. PLoS neglected tropical diseases. 2011;5(10):e1370. doi: 10.1371/journal.pntd.0001370. - DOI - PMC - PubMed

Publication types

MeSH terms