Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Jun 15:5:1384.
doi: 10.12688/f1000research.8967.2. eCollection 2016.

An end to end workflow for differential gene expression using Affymetrix microarrays

Affiliations

An end to end workflow for differential gene expression using Affymetrix microarrays

Bernd Klaus et al. F1000Res. .

Abstract

In this article, we walk through an end-to-end Affymetrix microarray differential expression workflow using Bioconductor packages. This workflow is directly applicable to current "Gene'' type arrays, e.g.the HuGene or MoGene arrays, but can easily be adapted to similar platforms. The data analyzed here is a typical clinical microarray data set that compares inflamed and non-inflamed colon tissue in two disease subtypes. For each disease, the differential gene expression between inflamed- and non-inflamed colon tissue was analyzed. We will start from the raw data CEL files, show how to import them into a Bioconductor ExpressionSet, perform quality control and normalization and finally differential gene expression (DE) analysis, followed by some enrichment analysis.

Keywords: gene expression; microarray.

PubMed Disclaimer

Conflict of interest statement

No competing interests were disclosed.

Figures

Figure 1.
Figure 1.. Structure of Bioconductor’s ExpressionSet class.
Figure 2.
Figure 2.. PCA plot of the log–transformed raw expression data.
Figure 3.
Figure 3.. Intensity boxplots of the log2–transformed raw data.
Figure 4.
Figure 4.. Visualization of the difference between "Exon" type array (left) and "Gene" type array (right).
Figure 5.
Figure 5.. Boxplot for the RLE values.
Figure 6.
Figure 6.. PCA plot of the calibrated, summarized data.
Figure 7.
Figure 7.. Heatmap of the sample-to-sample distances.
Figure 8.
Figure 8.. Histogram of the median intensities per gene.
Figure 9.
Figure 9.. Histogram of the median intensities per gene with manual intensity filtering threshold (red line).
Figure 10.
Figure 10.. Visualization of expression changes.
Figure 11.
Figure 11.. Expression changes for the CRAT gene.
Figure 12.
Figure 12.. Histogram of the p–values for Crohn’s disease.
Figure 13.
Figure 13.. Histogram of the p–values for ulcerative colitis.
Figure 14.
Figure 14.. Volcano plot of the DE-genes.
Figure 15.
Figure 15.. Selecting a background set of genes for the gene ontology analysis.
Figure 16.
Figure 16.. Significantly enriched GO nodes in the GO hierarchy.
Figure 17.
Figure 17.. Enriched Reactome pathways and their p–values as a bar chart.
Figure 18.
Figure 18.. Enriched Reactome pathways enrichment results as a graph.

References

    1. Palmieri O, Creanza TM, Bossa F, et al. : Genome-wide Pathway Analysis Using Gene Expression Data of Colonic Mucosa in Patients with Inflammatory Bowel Disease. Inflamm Bowel Dis. 2015;21(6):1260–8. 10.1097/MIB.0000000000000370 - DOI - PMC - PubMed
    1. Kolesnikov N, Hastings E, Keays M, et al. : ArrayExpress update--simplifying data submissions. Nucleic Acids Res. 2015;43(Database issue):D1113–D1116. 10.1093/nar/gku1057 - DOI - PMC - PubMed
    1. Huber W, Carey VJ, Gentleman R, et al. : Orchestrating high-throughput genomic analysis with Bioconductor. Nat Methods. 2015;12(2):115–121. 10.1038/nmeth.3252 - DOI - PMC - PubMed
    1. Carvalho BS, Irizarry RA: A framework for oligonucleotide microarray preprocessing. Bioinformatics. 2010;26(19):2363–7. 10.1093/bioinformatics/btq431 - DOI - PMC - PubMed
    1. Kauffmann A, Gentleman R, Huber W: arrayQualityMetrics--a bioconductor package for quality assessment of microarray data. Bioinformatics. 2009;25(3):415–6. 10.1093/bioinformatics/btn647 - DOI - PMC - PubMed

LinkOut - more resources