Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2011 Oct 17;3(9):63.
doi: 10.1186/gm279.

Classification of unknown primary tumors with a data-driven method based on a large microarray reference database

Affiliations

Classification of unknown primary tumors with a data-driven method based on a large microarray reference database

Kalle A Ojala et al. Genome Med. .

Abstract

We present a new method to analyze cancer of unknown primary origin (CUP) samples. Our method achieves good results with classification accuracy (88% leave-one-out cross validation for primary tumors from 56 categories, 78% for CUP samples), and can also be used to study CUP samples on a gene-by-gene basis. It is not tied to any a priori defined gene set as many previous methods, and is adaptable to emerging new information.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A graph of the accuracy of the method as a function of the similarity score of the best hit. The graph was formed by moving a sliding window of width 0.1 along the score axis, which ranges from -0.021 to 0.495, and calculating the achieved accuracy within that window. As can clearly be seen, the better the similarity, the higher the probability that the classification is correct.
Figure 2
Figure 2
Similarities for 83 metastatic test samples. (a) A comparison of the test samples' similarities to the healthy tissue where the metastasis was found (y-axis) and the cancer of origin for the metastasis (x-axis). The spheres are colored according the site of the metastasis ('target tissue'). The gray diagonal line indicates a boundary, above which the similarity to the target tissue is greater than the similarity to the sample's original cancer. Only ten samples display this behavior, and all but one of these are lymph node metastases. (b) A comparison of the test samples' similarities to a representative cancer of the tissue where the metastasis was found (y-axis) and the cancer of origin for the metastasis (x-axis). The triangles are colored according to the site of the metastasis ('target tissue'). The gray diagonal line indicates a boundary above which the similarity to the cancer of the target tissue is greater than the similarity to the sample's original cancer. As can be seen, most samples fall below this line.
Figure 3
Figure 3
Cancer-specific genes. (a) Tissue specificity scores, unmodified by gene weight, for genes whose weight in renal cancer is greater than 0.25 (40 out of 58 present) are shown for 10 renal cancer metastasis samples. The genes can be divided into two groups, those that lose their renal cancer-specific expression (blue) and those that do not (red). The samples are named according to where the metastasis was located, and numbered according to their (relative to each other) similarity to renal cancer. Sample 10 was the only one whose closest similarity was not renal cancer, it instead being lung squamous cell carcinoma. Samples 1 to 3 are the closest to renal cancer, and retain for most of the genes renal cancer-specific expression levels. The other samples have lost renal cancer-specific expression among the genes with a blue background. (b) Similar analysis for the 17 metastatic melanoma samples, showing 42 (out of 63) genes. (c) Similar analysis for 10 metastatic gastric cancer samples, showing 40 (out of 53) genes.

References

    1. Pavlidis N, Fizazi K. Cancer of unknown primary (CUP). Crit Rev Oncol Hematol. 2005;54:243–250. doi: 10.1016/j.critrevonc.2004.10.002. - DOI - PubMed
    1. American Cancer Society. Cancer Facts and Figures 2010. Atlanta, GA: American Cancer Society; 2010.
    1. Pavlidis N, Pentheroudakis G. Cancer of unknown primary site: 20 questions to be answered. Ann Oncol. 2010;21(Suppl 7):vii303–vii307. - PubMed
    1. Talantov D, Baden J, Jatkoe T, Hahn K, Yu J, Rajpurohit Y, Jiang Y, Choi C, Ross JS, Atkins D, Wang Y, Mazumder A. A quantitative reverse transcriptase-polymerase chain reaction assay to identify metastatic carcinoma tissue of origin. J Mol Diagn. 2006;8:320–329. doi: 10.2353/jmoldx.2006.050136. - DOI - PMC - PubMed
    1. Varadhachary GR, Talantov D, Raber MN, Meng C, Hess KR, Jatkoe T, Lenzi R, Spigel DR, Wang Y, Greco FA, Abbruzzese JL, Hainsworth JD. Molecular profiling of carcinoma of unknown primary and correlation with clinical evaluation. J Clin Oncol. 2008;26:4442–4448. doi: 10.1200/JCO.2007.14.4378. - DOI - PubMed

LinkOut - more resources