Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Apr;82(4):903-15.
doi: 10.1016/j.ajhg.2008.01.012. Epub 2008 Mar 20.

SNP arrays in heterogeneous tissue: highly accurate collection of both germline and somatic genetic information from unpaired single tumor samples

Affiliations

SNP arrays in heterogeneous tissue: highly accurate collection of both germline and somatic genetic information from unpaired single tumor samples

Guillaume Assié et al. Am J Hum Genet. 2008 Apr.

Abstract

SNP arrays provide reliable genotypes and can detect chromosomal aberrations at a high resolution. However, tissue heterogeneity is currently a major limitation for somatic tissue analysis. We have developed SOMATICs, an original program for accurate analysis of heterogeneous tissue samples. Fifty-four samples (42 tumors and 12 normal tissues) were processed through Illumina Beadarrays and then analyzed with SOMATICs. We demonstrate that tissue heterogeneity-related limitations not only can be overcome but can also be turned into an advantage. First, admixture of normal cells with tumor can be used as an internal reference, thereby enabling highly sensitive detection of somatic deletions without having corresponding normal tissue. Second, the presence of normal cells allows for discrimination of somatic from germline aberrations, and the proportion of cells in the tissue sample that are harboring the somatic events can be assessed. Third, relatively early versus late somatic events can also be distinguished, assuming that late events occur only in subsets of cancer cells. Finally, admixture by normal cells allows inference of germline genotypes from a cancer sample. All this information can be obtained from any cancer sample containing a proportion of 40-75% of cancer cells. SOMATICs is a ready-to-use open-source program that integrates all of these features into a simple format, comprehensively describing each chromosomal event.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Automatic Detection of Chromosomal Aberrations in Heterogeneous Tissue Samples In each panel, the BAF and the logR ratio are plotted along chromosomal regions. The black boxes at the bottom represent detection with SOMATICs and other currently available methods, namely, Beadstudio LOH score (y axis range from 0 to 5), dChip (default detection threshold), and CNVPartition. (A) Small germline deletion is revealed as a decreased logR ratio (−0.5), and the band of heterozygous SNPs centered on 0.5 is absent on the BAF plot. (B–E) Various types of somatic deletions revealed on the BAF plot in which the single band of heterozygous SNPs is replaced by two bands and the logR ratio is decreased, but to a lesser extent as compared to germline deletions. The various types of somatic deletions are manifested by differences in the position and the size of the two-band patterns. Note that (E) shows a somatic deletion occurring in very few cells. In this situation, detection is easier with use of the BAF (two-band pattern) than with the logR (reduced shift downward). However, the copy number call as a “deletion” relies on a significant decrease of the logR (in this situation, p < 2.2e−16). (F) Wavy fluctuations of logR ratio, which is not reflected in the BAF, are artefacts. This artifact is responsible for false-positive detection by programs focusing only on logR ratio. (G) Two small germline duplications are revealed on the BAF plot as heterozygous SNPs showing a two-band pattern with a logR ratio that is increased. (H) In somatic duplication, the two bands are closer to one another and the increase in logR is less than that of germline amplification (G). As with (E), for (H), the BAF two-band pattern is easier to detect than is the logR shift upward. However, the copy number call as an “amplification” relies on a significant increase of the logR (in this case, p < 2.2e−16). SOMATICs can detect and differentiate the various types of alterations when other programs cannot.
Figure 2
Figure 2
Proportion of Cells, “c,” in a Heterogeneous Tumor Sample Harboring a Somatic Genetic Event (A) BAF and the logR ratio plots from one chromosome reveal three somatic hemizygous deletions occurring in three different proportions of cells. The BAFdel of heterozygote SNPs is measured after “folding” the BAF plot along an axis centered around 0.5. Then BAFdel is used to determine c with the formula designed for somatic deletions (see Appendices for details). (B) Frequency distribution showing the number of SNPs included in the somatic deletions by the proportion of cells, “c,” in which these events occur. Some somatic deletions occur in over 80% of cells (rightmost bar). Assuming that only cancer cells harbor somatic deletions, the proportion of cancer cells is then estimated as 80% in this sample. (C) Schematic illustrating the relationship between the chronology of somatic events during tumorigenesis and the proportion of cancer cells with these events. Early somatic events are present in all (or a great majority of) cancer cells, whereas late somatic events are only present in subsets of cells.
Figure 3
Figure 3
Germline Genotype Inference from a Heterogeneous Cancer Sample (A) Schematic representation of the calling methods used by SOMATICs and by BeadStudio. SOMATICs is specifically designed to generate appropriate calls in genomic regions with allelic imbalance. When bands of heterozygous AB SNPs can be discriminated from the bands of homozygous AA and BB SNPs, a reliable germline genotype call can be provided. (B) illustrates enotyping call rates in regions of allelic imbalance with SOMATICs and BeadStudio, and (C) illlustrates genotyping accuracy in regions of allelic imbalance with SOMATICs and BeadStudio. These results were obtained by comparison of genotypes inferred from tumors with genotypes read in normal corresponding tissue obtained with seven pairs of matched samples. The results in (B) and (C) are expressed as a function of the distance from the heterozygote bands to the homozygote bands, on a scale of 0 to 0.5, representing the mean BAF of the lower band of heterozygote SNPs. This mean BAF is converted into a proportion of cells, “c,” with allelic imbalance by use of the formula for somatic deletions (see Appendices for details).

References

    1. Dutt A., Beroukhim R. Single nucleotide polymorphism array analysis of cancer. Curr. Opin. Oncol. 2007;19:43–49. - PubMed
    1. LaFramboise T., Harrington D., Weir B.A. PLASQ: a generalized linear model-based procedure to determine allelic dosage in cancer cells from SNP array data. Biostatistics. 2007;8:323–336. - PubMed
    1. Huang J., Wei W., Chen J., Zhang J., Liu G., Di X., Mei R., Ishikawa S., Aburatani H., Jones K.W., Shapero M.H. CARAT: a novel method for allelic detection of DNA copy number changes using high density oligonucleotide arrays. BMC Bioinformatics. 2006;7:83. - PMC - PubMed
    1. Komura D., Shen F., Ishikawa S., Fitch K.R., Chen W., Zhang J., Liu G., Ihara S., Nakamura H., Hurles M.E. Genome-wide detection of human copy number variations using high-density DNA oligonucleotide arrays. Genome Res. 2006;16:1575–1584. - PMC - PubMed
    1. Lin M., Wei L.J., Sellers W.R., Lieberfarb M., Wong W.H., Li C. dChipSNP: significance curve and clustering of SNP-array-based loss-of-heterozygosity data. Bioinformatics. 2004 in press. - PubMed

Publication types

LinkOut - more resources