Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014;15 Suppl 11(Suppl 11):S3.
doi: 10.1186/1471-2105-15-S11-S3. Epub 2014 Oct 21.

Leveraging the new with the old: providing a framework for the integration of historic microarray studies with next generation sequencing

Leveraging the new with the old: providing a framework for the integration of historic microarray studies with next generation sequencing

Michael A Bauer et al. BMC Bioinformatics. 2014.

Abstract

Next Generation Sequencing (NGS) methods are rapidly providing remarkable advances in our ability to study the molecular profiles of human cancers. However, the scientific discovery offered by NGS also includes challenges concerning the interpretation of large and non-trivial experimental results. This task is potentially further complicated when a multitude of molecular profiling modalities are available, with the goal of a more integrative and comprehensive analysis of the cancer biology.

PubMed Disclaimer

Figures

Figure 1
Figure 1
NGS Association System Information Architecture. The basis for integration of multiple molecular profiling modalities is illustrated. This includes processes for data transformation, reduction and association, as well as the direct interfacing to multiple custom and third party software tools and subsystems.
Figure 2
Figure 2
Samples and Molecular profiling studies. Illustrated is a diagram of the patient samples and molecular profiling studies used in the various analyses.
Figure 3
Figure 3
System Rendered Results for Key Cancer Biology Genes. Automatically rendered bar chart and table from a Cuffdiff dataset that was searched using two genes (KRAS and TP53), which have cancer biology significance in many cancers, including MM. The Cuffdiff Chart in subfigure A was quickly generated by a single point and click. All genes in the table (subfigure B) contain a significant p-value and q-value.
Figure 4
Figure 4
Triple Integration (WES, Cuffdiff Gene, Microarray). A triple integration was performed by combining Cuffdiff gene data with WES and a gene expression microarray experiment. The integrated WES data (green columns) includes: Effect, Codon Change, AA Change, Tumor DP (Depth), Tumor AF (Allelic Frequency). The corresponding RNA-seq data (light red columns), show the Sample 1 FPKM (pooled normal), Sample 2 FPKM (tumor), and p-value from Cufflinks. The microarray data (blue columns) display the corresponding Affy Probes, and Avg Probe Values. The grey columns are common across all modalities and show, Entrez ID, Gene Symbol, Gene Name, and Member List (gene list).
Figure 5
Figure 5
KRAS Automated Visualization and Exploration. Subfigure A shows an automated visualization of the FPKM Chart for KRAS and its associated isoforms. This was automatically generated by clicking the picture icon located in the rightmost region on the first line of the KRAS entry, in subfigure B. In this view, the novel isoform (i.e., CUFF.19733.1) was also fetched from the experimental data. The columns Transcript Id and P-Value show that only one transcript (ENST00000311936) is significant.
Figure 6
Figure 6
Validation of the KRAS G13D Hotspot Mutation via IGV Automation. IGV automation is utilized to validate the non-synonymous hotspot mutation involving codon 13 for KRAS, where Glycine is replaced by Aspartic Acid (G13D). By a single point and click on the web page entry for KRAS, the IGV display is rendered. During this process, all relevant information is spontaneously sent to IGV. This includes data from Variant Analysis, RNA-seq aligned reads, WES aligned reads, the reference genome and RABT assembly. For illustrative purposes, the significant isoform, ENST00000311936, is circled in green, and the location of the G13D hotspot mutation is circled in red. A potential novel isoform (CUFF.19733.1) from the normal pool is circled in blue.
Figure 7
Figure 7
KRAS Protein Plot. Automation is utilized to generate a "lolliplot", which illustrates the discovered amino acid changes for KRAS in the context of previously reported protein mutations and known domains. The G13D hotspot mutation found in the experiment is colored red. Other canonical mutations are colored blue. The RAS protein domain is colored green. Automation was achieved by a custom interface, along with R, the shiny package and Plot Protein.

References

    1. Johnsen JM, Nickerson DA, Reiner AP. Massively parallel sequencing: the new frontier of hematologic genomics. Blood. 2013;122(19):3268–3275. doi: 10.1182/blood-2013-07-460287. - DOI - PMC - PubMed
    1. Collins FS, Hamburg MA. First FDA authorization for next-generation sequencer. N Engl J Med. 2013;369(25):2369–2371. doi: 10.1056/NEJMp1314561. - DOI - PMC - PubMed
    1. Kandoth C, McLellan MD, Vandin F, Ye K, Niu B, Lu C, Xie M, Zhang Q, McMichael JF, Wyczalkowski MA. et al.Mutational landscape and significance across 12 major cancer types. Nature. 2013;502(7471):333–339. doi: 10.1038/nature12634. - DOI - PMC - PubMed
    1. Mirnezami R, Nicholson J, Darzi A. Preparing for precision medicine. N Engl J Med. 2012;366(6):489–491. doi: 10.1056/NEJMp1114866. - DOI - PubMed
    1. Chapman MA, Lawrence MS, Keats JJ, Cibulskis K, Sougnez C, Schinzel AC, Harview CL, Brunet JP, Ahmann GJ, Adli M. et al.Initial genome sequencing and analysis of multiple myeloma. Nature. 2011;471(7339):467–472. doi: 10.1038/nature09837. - DOI - PMC - PubMed

Publication types

MeSH terms