Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Comparative Study
. 2012;7(6):e38919.
doi: 10.1371/journal.pone.0038919. Epub 2012 Jun 11.

Modeling skewness in human transcriptomes

Affiliations
Comparative Study

Modeling skewness in human transcriptomes

Joaquim Casellas et al. PLoS One. 2012.

Abstract

Gene expression data are influenced by multiple biological and technological factors leading to a wide range of dispersion scenarios, although skewed patterns are not commonly addressed in microarray analyses. In this study, the distribution pattern of several human transcriptomes has been studied on free-access microarray gene expression data. Our results showed that, even in previously normalized gene expression data, probe and differential expression within probe effects suffer from substantial departures from the commonly assumed symmetric gaussian distribution. We developed a flexible mixed model for non-competitive microarray data analysis that accounted for asymmetric and heavy-tailed (Student's t distribution) dispersion processes. Random effects for gene expression data were modeled under asymmetric Student's t distributions where the asymmetry parameter (λ) took values from perfect symmetry (λ = 0) to right- (λ>0) or left-side (λ>0) over-expression patterns. This approach was applied to four free-access human data sets and revealed clearly better model performance when comparing with standard approaches accounting for traditional symmetric gaussian distribution patterns. Our analyses on human gene expression data revealed a substantial degree of right-hand asymmetry for probe effects, whereas differential gene expression addressed both symmetric and left-hand asymmetric patterns. Although these results cannot be extrapolated to all microarray experiments, they highlighted the incidence of skew dispersion patterns in human transcriptome; moreover, we provided a new analytical approach to appropriately address this biological phenomenon. The source code of the program accommodating these analytical developments and additional information about practical aspects on running the program are freely available by request to the corresponding author of this article.

PubMed Disclaimer

Conflict of interest statement

Competing Interests: The authors have declared that no competing interests exist.

Figures

Figure 1
Figure 1
Distribution of mean estimates for probe (a) and differential expression within-probe (b) effects under Model SG (grey line) and Model AT (black line) for data set 3.

Similar articles

Cited by

References

    1. Cui X, Churchill GA. Statistical tests for differential expression in cDNA microarray experiments. Genome Biol. 2003;4:210. - PMC - PubMed
    1. Hoeschele I, Li H. A note on joint versus gene-specific mixed model analysis of microarray gene expression data. Biostatistics. 2005;6:186. - PubMed
    1. Wolfinger RD, Gibson G, Wolfinger ED, Bennett L, Hamadeh H. Assessing gene significance from cDNA microarray expression data via mixed models. J Comput Biol. 2001;8:637. - PubMed
    1. Searle SR. Matrix Algebra Useful for Statistics. John Wiley & Sons, New York, NY. 1982.
    1. Casellas J, Ibáñez-Escriche N, Martínez-Giner M, Varona L. GEAMM v1.4.: a versatile program for mixed model analysis of gene expression data. Anim Genet. 2008;39:90. - PubMed

Publication types

MeSH terms