Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2018 Dec;27(12):3797-3813.
doi: 10.1177/0962280217712271. Epub 2017 May 29.

Testing for differentially expressed genetic pathways with single-subject N-of-1 data in the presence of inter-gene correlation

Affiliations

Testing for differentially expressed genetic pathways with single-subject N-of-1 data in the presence of inter-gene correlation

A Grant Schissler et al. Stat Methods Med Res. 2018 Dec.

Abstract

Modern precision medicine increasingly relies on molecular data analytics, wherein development of interpretable single-subject ("N-of-1") signals is a challenging goal. A previously developed global framework, N-of-1- pathways, employs single-subject gene expression data to identify differentially expressed gene set pathways in an individual patient. Unfortunately, the limited amount of data within the single-subject, N-of-1 setting makes construction of suitable statistical inferences for identifying differentially expressed gene set pathways difficult, especially when non-trivial inter-gene correlation is present. We propose a method that exploits external information on gene expression correlations to cluster positively co-expressed genes within pathways, then assesses differential expression across the clusters within a pathway. A simulation study illustrates that the cluster-based approach exhibits satisfactory false-positive error control and reasonable power to detect differentially expressed gene set pathways. An example with a single N-of-1 patient's triple negative breast cancer data illustrates use of the methodology.

Keywords: Gene expression data; N-of-1; RNA-seq; affinity propagation clustering; exemplar learning; gene set; inter-gene correlation; precision medicine; single-subject inference; triple negative breast cancer.

PubMed Disclaimer

Conflict of interest statement

Declaration of conflicting interests

The authors declare no conflicts of interest.

Figures

Figure 1
Figure 1
Empirical false-positive rates (dots) based on 2000 simulated N-of-1-pathways data sets for three competing testing procedures (lower horizontal axis: Clustered-T = proposed test, naïve t = standard t test, Wilcoxon = signed-rank test), cross-classified by correlation structure (top: Independent = uncorrelated mRNA expression, Block = cluster-correlated expression, All = unconstrained inter-gene correlation) and pathway size G (left). The corresponding cluster numbers, m, for each pathway are also listed; see Table 2. Nominal significance level is set to α = 0.05 (dotted horizontal lines). Results reported for ψ = 1.5 (see text). Horizontal bars are pointwise 95% Agresti-Coull confidence intervals for the underlying false-positive rate based on each set of 2000 simulated samples.
Figure 2
Figure 2
Empirical rejection probabilities (‘power’) for the AP-based cluster approach using (7), based on 2000 simulated N-of-1-pathways data sets. Results are presented as a function of DEG proportion π (lower horizontal axis). Displays are cross-classified by correlation structure (top: Independent = uncorrelated mRNA expression, Block = cluster-correlated expression, All = unconstrained inter-gene correlation) and pathway size G (left). The corresponding cluster numbers, m, for each pathway are also listed; see Table 2. Simulated fold change is indicated by line styling: ψ = 4 (solid lines), ψ = 2 (dashes), and ψ = 1.5 (dot-dashes). Nominal significance level is set to α = 0.05. Horizontal bars are pointwise 95% Agresti-Coull confidence intervals for the underlying rejection rate based on each set of 2000 simulated samples.
Figure 3
Figure 3
Comparison of −log{p} values from Spectral Clustering (SC) vs. AP clustering in the clustered-T test of (7) when applied to TNBC data from Sec. 4. Dot color indicates p-value overlap status: (i) dark gray dots for significant p-value overlaps (both below 5% cutoff), (ii) white dots for insignificant p-value overlaps (both above 5% cutoff), (iii) light gray dots for p-value discords with ITS match (high informatic similarity), and (iv) black dots for p-value discords with no ITS match. See text for details.

Similar articles

Cited by

References

    1. van't Veer LJ, et al. Gene expression profiling predicts clinical outcome of breast cancer. Nature. 2002;415:530–536. - PubMed
    1. Lang JE, et al. Expression profiling of circulating tumor cells in metastatic breast cancer. Breast Cancer Res Treat. 2014;149:121–131. - PMC - PubMed
    1. Yang X, et al. Single sample expression-anchored mechanisms predict survival in head and neck cancer. PLoS Comput Biol. 2012;8:e1002350. - PMC - PubMed
    1. Perez-Rathke A, Li H, Lussier YA. Interpreting personal transcriptomes: personalized mechanism-scale profiling of RNA-seq data. Pac Symp Biocomput. 2013;18:159–170. - PMC - PubMed
    1. Lillie EO, et al. The n-of-1 clinical trial: the ultimate strategy for individualizing medicine? Per Med. 2011;8:161–173. - PMC - PubMed

Publication types

LinkOut - more resources