. 2018 Dec;27(12):3797-3813.

doi: 10.1177/0962280217712271. Epub 2017 May 29.

Testing for differentially expressed genetic pathways with single-subject N-of-1 data in the presence of inter-gene correlation

A Grant Schissler^{1

2

3

4}, Walter W Piegorsch^{1

2

3

5}, Yves A Lussier^{1

2

3

4}

Affiliations

¹ 1 Interdisciplinary Program in Statistics, The University of Arizona, Tucson, AZ, USA.
² 2 Center for Biomedical Informatics and Biostatistics (CB2), The University of Arizona, Tucson, AZ, USA.
³ 3 BIO5 Institute, The University of Arizona, Tucson, AZ, USA.
⁴ 4 Department of Medicine, The University of Arizona, Tucson, AZ, USA.
⁵ 5 Department of Mathematics, The University of Arizona, Tucson, AZ, USA.

PMID: 28552011
PMCID: PMC5554097
DOI: 10.1177/0962280217712271

Testing for differentially expressed genetic pathways with single-subject N-of-1 data in the presence of inter-gene correlation

A Grant Schissler et al. Stat Methods Med Res. 2018 Dec.

. 2018 Dec;27(12):3797-3813.

doi: 10.1177/0962280217712271. Epub 2017 May 29.

Authors

A Grant Schissler^{1

2

3

4}, Walter W Piegorsch^{1

2

3

5}, Yves A Lussier^{1

2

3

4}

Affiliations

¹ 1 Interdisciplinary Program in Statistics, The University of Arizona, Tucson, AZ, USA.
² 2 Center for Biomedical Informatics and Biostatistics (CB2), The University of Arizona, Tucson, AZ, USA.
³ 3 BIO5 Institute, The University of Arizona, Tucson, AZ, USA.
⁴ 4 Department of Medicine, The University of Arizona, Tucson, AZ, USA.
⁵ 5 Department of Mathematics, The University of Arizona, Tucson, AZ, USA.

PMID: 28552011
PMCID: PMC5554097
DOI: 10.1177/0962280217712271

Abstract

Modern precision medicine increasingly relies on molecular data analytics, wherein development of interpretable single-subject ("N-of-1") signals is a challenging goal. A previously developed global framework, N-of-1- pathways, employs single-subject gene expression data to identify differentially expressed gene set pathways in an individual patient. Unfortunately, the limited amount of data within the single-subject, N-of-1 setting makes construction of suitable statistical inferences for identifying differentially expressed gene set pathways difficult, especially when non-trivial inter-gene correlation is present. We propose a method that exploits external information on gene expression correlations to cluster positively co-expressed genes within pathways, then assesses differential expression across the clusters within a pathway. A simulation study illustrates that the cluster-based approach exhibits satisfactory false-positive error control and reasonable power to detect differentially expressed gene set pathways. An example with a single N-of-1 patient's triple negative breast cancer data illustrates use of the methodology.

Keywords: Gene expression data; N-of-1; RNA-seq; affinity propagation clustering; exemplar learning; gene set; inter-gene correlation; precision medicine; single-subject inference; triple negative breast cancer.

PubMed Disclaimer

Conflict of interest statement

Declaration of conflicting interests

The authors declare no conflicts of interest.

Figures

**Figure 1**
Empirical false-positive rates (dots) based on 2000 simulated N-of-1-*pathways* data sets for three competing testing procedures (lower horizontal axis: Clustered-T = proposed test, naïve t = standard t test, Wilcoxon = signed-rank test), cross-classified by correlation structure (top: Independent = uncorrelated mRNA expression, Block = cluster-correlated expression, All = unconstrained inter-gene correlation) and pathway size G (left). The corresponding cluster numbers, m, for each pathway are also listed; see Table 2. Nominal significance level is set to α = 0.05 (dotted horizontal lines). Results reported for ψ = 1.5 (see text). Horizontal bars are pointwise 95% Agresti-Coull confidence intervals for the underlying false-positive rate based on each set of 2000 simulated samples.

**Figure 2**
Empirical rejection probabilities (‘power’) for the AP-based cluster approach using (7), based on 2000 simulated N-of-1-*pathways* data sets. Results are presented as a function of DEG proportion π (lower horizontal axis). Displays are cross-classified by correlation structure (top: Independent = uncorrelated mRNA expression, Block = cluster-correlated expression, All = unconstrained inter-gene correlation) and pathway size G (left). The corresponding cluster numbers, m, for each pathway are also listed; see Table 2. Simulated fold change is indicated by line styling: ψ = 4 (solid lines), ψ = 2 (dashes), and ψ = 1.5 (dot-dashes). Nominal significance level is set to α = 0.05. Horizontal bars are pointwise 95% Agresti-Coull confidence intervals for the underlying rejection rate based on each set of 2000 simulated samples.

**Figure 3**
Comparison of −log{p} values from Spectral Clustering (SC) vs. AP clustering in the clustered-T test of (7) when applied to TNBC data from Sec. 4. Dot color indicates p-value overlap status: (i) dark gray dots for significant p-value overlaps (both below 5% cutoff), (ii) white dots for insignificant p-value overlaps (both above 5% cutoff), (iii) light gray dots for p-value discords with ITS match (high informatic similarity), and (iv) black dots for p-value discords with no ITS match. See text for details.

See this image and copyright information in PMC

Cited by

Accounting for extra-binomial variability with differentially expressed genetic pathway data: a collaborative bioinformatic study.
Aberasturi DT, Piegorsch WW, Bedrick EJ, Lussier YA. Aberasturi DT, et al. Stat (Int Stat Inst). 2023 Jan-Dec;12(1):e518. doi: 10.1002/sta4.518. Epub 2022 Oct 24. Stat (Int Stat Inst). 2023. PMID: 37885703 Free PMC article.
A Single-Subject Method to Detect Pathways Enriched With Alternatively Spliced Genes.
Schissler AG, Aberasturi D, Kenost C, Lussier YA. Schissler AG, et al. Front Genet. 2019 May 9;10:414. doi: 10.3389/fgene.2019.00414. eCollection 2019. Front Genet. 2019. PMID: 31143202 Free PMC article.
'Single-subject studies'-derived analyses unveil altered biomechanisms between very small cohorts: implications for rare diseases.
Aberasturi D, Pouladi N, Zaim SR, Kenost C, Berghout J, Piegorsch WW, Lussier YA. Aberasturi D, et al. Bioinformatics. 2021 Jul 12;37(Suppl_1):i67-i75. doi: 10.1093/bioinformatics/btab290. Bioinformatics. 2021. PMID: 34252934 Free PMC article.
Evaluating single-subject study methods for personal transcriptomic interpretations to advance precision medicine.
Rachid Zaim S, Kenost C, Berghout J, Vitali F, Zhang HH, Lussier YA. Rachid Zaim S, et al. BMC Med Genomics. 2019 Jul 11;12(Suppl 5):96. doi: 10.1186/s12920-019-0513-8. BMC Med Genomics. 2019. PMID: 31296218 Free PMC article.
Emergence of pathway-level composite biomarkers from converging gene set signals of heterogeneous transcriptomic responses.
Zaim SR, Li Q, Schissler AG, Lussier YA. Zaim SR, et al. Pac Symp Biocomput. 2018;23:484-495. Pac Symp Biocomput. 2018. PMID: 29218907 Free PMC article.

See all "Cited by" articles

References

1. van't Veer LJ, et al. Gene expression profiling predicts clinical outcome of breast cancer. Nature. 2002;415:530–536. - PubMed
1. Lang JE, et al. Expression profiling of circulating tumor cells in metastatic breast cancer. Breast Cancer Res Treat. 2014;149:121–131. - PMC - PubMed
1. Yang X, et al. Single sample expression-anchored mechanisms predict survival in head and neck cancer. PLoS Comput Biol. 2012;8:e1002350. - PMC - PubMed
1. Perez-Rathke A, Li H, Lussier YA. Interpreting personal transcriptomes: personalized mechanism-scale profiling of RNA-seq data. Pac Symp Biocomput. 2013;18:159–170. - PMC - PubMed
1. Lillie EO, et al. The n-of-1 clinical trial: the ultimate strategy for individualizing medicine? Per Med. 2011;8:161–173. - PMC - PubMed

Publication types

Actions
Actions

MeSH terms

Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions
Actions

Grants and funding

LinkOut - more resources

Full Text Sources
Other Literature Sources
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Testing for differentially expressed genetic pathways with single-subject N-of-1 data in the presence of inter-gene correlation

Affiliations

Testing for differentially expressed genetic pathways with single-subject N-of-1 data in the presence of inter-gene correlation

Authors

Affiliations

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources

Abstract

Conflict of interest statement

Figures

Similar articles

Cited by

References

Publication types

MeSH terms

Related information

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources