Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2012 Mar 7;30(3):226-9.
doi: 10.1038/nbt.2134.

Detecting and annotating genetic variations using the HugeSeq pipeline

Detecting and annotating genetic variations using the HugeSeq pipeline

Hugo Y K Lam et al. Nat Biotechnol. .
No abstract available

PubMed Disclaimer

Conflict of interest statement

COMPETING FINANCIAL INTERESTS

The authors declare competing financial interests: details accompany the full-text HTML version of the paper at http://www.nature.com/naturebiotechnology.

Figures

Figure 1
Figure 1
A MapReduce approach for detecting genetic variants from high-throughput genome sequencing. Phase 1 is the mapping phase including sequence alignment. Phase 2 is the sorting phase including sorting alignments by mapped chromosomes. Phase 3 is the reduction phase including variant detection. chr1, chromosome one; chrM, chromosome M; SV, structural variation; SR, split-read analysis; RD, read-depth analysis; RP, read-pair mapping; JM, junction mapping; VCF, variant call format; GFF, general feature format.
Figure 2
Figure 2
Accuracy and sensitivity of variant detection. (a) Concordant and specific calls in SNP detection by GATK and SAMtools. Original calls, total number of calls from methods; specific calls, total number of method-specific calls. Validated calls, number of calls validated by the array. Ti/Tv, transition-to-transversion rate. (b) Overall sensitivity of SNP and structural variation or CNV detection over different sequencing coverage. X is the average number of reads representing a given nucleotide in a haploid human genome. 50% indicates that detected structural variation calls overlap ≥50% the array calls.

References

    1. Dean J, Ghemawat S. MapReduce: simplified data processing on large clusters. OSDI’04 Proceedings of the 6th Symposium on Operating Systems Design and Implementation; San Francisco. 2004.
    1. Li H, Durbin R. Bioinformatics. 2009;25:1754–1760. - PMC - PubMed
    1. Li H, et al. Bioinformatics. 2009;25:2078–2079. - PMC - PubMed
    1. McKenna A, et al. Genome Res. 2010;20:1297–1303. - PMC - PubMed
    1. Albers CA, et al. Genome Res. 2011;21:961–973. - PMC - PubMed

Publication types