Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010;11(11):R112.
doi: 10.1186/gb-2010-11-11-r112. Epub 2010 Nov 23.

Patient-oriented gene set analysis for cancer mutation data

Affiliations

Patient-oriented gene set analysis for cancer mutation data

Simina M Boca et al. Genome Biol. 2010.

Abstract

Recent research has revealed complex heterogeneous genomic landscapes in human cancers. However, mutations tend to occur within a core group of pathways and biological processes that can be grouped into gene sets. To better understand the significance of these pathways, we have developed an approach that initially scores each gene set at the patient rather than the gene level. In mutation analysis, these patient-oriented methods are more transparent, interpretable, and statistically powerful than traditional gene-oriented methods.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Observed (blue) and expected number of altered samples (Ts) across the gene sets in the dataset from [4], as a function of the size of the gene set. The expected numbers are computed using the permutation null and denoted by E(Ts|Hs0). The values within two standard deviations of the E(Ts|Hs0) are also shown.
Figure 2
Figure 2
CAT plot comparing the patient-oriented methods to the gene-oriented method for the glioblastoma data from [4]. Each graph represents a pairwise comparison of two methods: The gene sets are ranked according to the P-value, a list of top gene sets is created at each rank, then the fraction of gene sets in the list common to both methods is graphed.
Figure 3
Figure 3
Power analysis. Plot of the average number of truly positive (spiked-in) gene sets included in the top X gene sets, as X varies. The red line indicates the ideal scenario over 100 simulation runs. Simulations use the permutation null (top panel) or the passenger null (bottom panel) data-generating mechanisms. The four patient-oriented methods return more true positives than the gene-oriented method, except when one focuses on short lists, which include sets that are relatively easy to detect. (Note that the overlap of the two methods which use the passenger null looks like a star (*).)
Figure 4
Figure 4
Calibration analysis. Plot of the average true fraction of false discoveries versus the average q-value over 100 simulation runs. The identity line indicates the ideal scenario of a perfect FDR-control rate; being above the identity line indicates anti-conservative behavior, and being below the identity line indicates conservative behavior. Simulations use the permutation null (top panel) or the passenger null (bottom panel). The gene-oriented method shows very anti-conservative behavior while the patient-oriented methods are generally calibrated or conservative.

References

    1. Sjoblom T, Jones S, Wood LD, Parsons DW, Lin J, Barber TD, Mandelker D, Leary RJ, Ptak J, Silliman N, Szabo S, Buckhaults P, Farrell C, Meeh P, Markowitz SD, Willis J, Dawson D, Willson JK, Gazdar AF, Hartigan J, Wu L, Liu C, Parmigiani G, Park BH, Bachman KE, Papadopoulos N, Vogelstein B, Kinzler KW, Velculescu VE. The consensus coding sequences of human breast and colorectal cancers. Science. 2006;314:268–274. doi: 10.1126/science.1133427. - DOI - PubMed
    1. Wood LD, Parsons DW, Jones S, Lin J, Sjoblom T, Leary RJ, Shen D, Boca SM, Barber T, Ptak J, Silliman N, Szabo S, Dezso Z, Ustyanksky V, Nikolskaya T, Nikolsky Y, Karchin R, Wilson PA, Kaminker JS, Zhang Z, Croshaw R, Willis J, Dawson D, Shipitsin M, Willson JK, Sukumar S, Polyak K, Park BH, Pethiyagoda CL, Pant PV. et al.The genomic landscapes of human breast and colorectal cancers. Science. 2007;318:1108–1113. doi: 10.1126/science.1145720. - DOI - PubMed
    1. Jones S, Zhang X, Parsons DW, Lin JC, Leary RJ, Angenendt P, Mankoo P, Carter H, Kamiyama H, Jimeno A, Hong SM, Fu B, Lin MT, Calhoun ES, Kamiyama M, Walter K, Nikolskaya T, Nikolsky Y, Hartigan J, Smith DR, Hidalgo M, Leach SD, Klein AP, Jaffee EM, Goggins M, Maitra A, Iacobuzio-Donahue C, Eshleman JR, Kern SE, Hruban RH. et al.Core signaling pathways in human pancreatic cancers revealed by global genomic analyses. Science. 2008;321:1801–1806. doi: 10.1126/science.1164368. - DOI - PMC - PubMed
    1. Parsons DW, Jones S, Zhang X, Lin JC, Leary RJ, Angenendt P, Mankoo P, Carter H, Siu IM, Gallia GL, Olivi A, McLendon R, Rasheed BA, Keir S, Nikolskaya T, Nikolsky Y, Busam DA, Tekleab H, Diaz LA Jr, Hartigan J, Smith DR, Strausberg RL, Marie SK, Shinjo SM, Yan H, Riggins GJ, Bigner DD, Karchin R, Papadopoulos N, Parmigiani G. et al.An integrated genomic analysis of human glioblastoma multiforme. Science. 2008;321:1807–1812. doi: 10.1126/science.1164382. - DOI - PMC - PubMed
    1. Ley TJ, Mardis ER, Ding L, Fulton B, McLellan MD, Chen K, Dooling D, Dunford-Shore BH, McGrath S, Hickenbotham M, Cook L, Abbott R, Larson DE, Koboldt DC, Pohl C, Smith S, Hawkins A, Abbott S, Locke D, Hillier LW, Miner T, Fulton L, Magrini V, Wylie T, Glasscock J, Conyers J, Sander N, Shi X, Osborne JR, Minx P. et al.DNA sequencing of a cytogenetically normal acute myeloid leukaemia genome. Nature. 2008;456:66–72. doi: 10.1038/nature07485. - DOI - PMC - PubMed

Publication types