Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2006 Apr 25:7:91.
doi: 10.1186/1471-2164-7-91.

CGHScan: finding variable regions using high-density microarray comparative genomic hybridization data

Affiliations

CGHScan: finding variable regions using high-density microarray comparative genomic hybridization data

Bradley D Anderson et al. BMC Genomics. .

Abstract

Background: Comparative genomic hybridization can rapidly identify chromosomal regions that vary between organisms and tissues. This technique has been applied to detecting differences between normal and cancerous tissues in eukaryotes as well as genomic variability in microbial strains and species. The density of oligonucleotide probes available on current microarray platforms is particularly well-suited for comparisons of organisms with smaller genomes like bacteria and yeast where an entire genome can be assayed on a single microarray with high resolution. Available methods for analyzing these experiments typically confine analyses to data from pre-defined annotated genome features, such as entire genes. Many of these methods are ill suited for datasets with the number of measurements typical of high-density microarrays.

Results: We present an algorithm for analyzing microarray hybridization data to aid identification of regions that vary between an unsequenced genome and a sequenced reference genome. The program, CGHScan, uses an iterative random walk approach integrating multi-layered significance testing to detect these regions from comparative genomic hybridization data. The algorithm tolerates a high level of noise in measurements of individual probe intensities and is relatively insensitive to the choice of method for normalizing probe intensity values and identifying probes that differ between samples. When applied to comparative genomic hybridization data from a published experiment, CGHScan identified eight of nine known deletions in a Brucella ovis strain as compared to Brucella melitensis. The same result was obtained using two different normalization methods and two different scores to classify data for individual probes as representing conserved or variable genomic regions. The undetected region is a small (58 base pair) deletion that is below the resolution of CGHScan given the array design employed in the study.

Conclusion: CGHScan is an effective tool for analyzing comparative genomic hybridization data from high-density microarrays. The algorithm is capable of accurately identifying known variable regions and is tolerant of high noise and varying methods of data preprocessing. Statistical analysis is used to define each variable region providing a robust and reliable method for rapid identification of genomic differences independent of annotated gene boundaries.

PubMed Disclaimer

Figures

Figure 1
Figure 1
An illustration of the random walk. A region is defined by Dini and Dmin, shown here as a shaded area spanning probes 4–11.
Figure 2
Figure 2
The second iteration detects conserved regions within first iteration regions.
Figure 3
Figure 3
Deletions and variable regions in B. melitensis chromosome II. Regions in red are from Rajashekara et al., (2004). Blue regions are predicted by CGHScan, outer ring using a higher cutoff (which results in lower stringency), and the inner ring with a lower cutoff (which results in higher stringency).

Similar articles

Cited by

References

    1. Behr MA, Wilson MA, Gill WP, Salamon H, Schoolnik GK, Rane S, Small PM. Comparative genomics of BCG vaccines by whole-genome DNA microarray. Science. 1999;284:1520–1523. doi: 10.1126/science.284.5419.1520. - DOI - PubMed
    1. Kato-Maeda M, Rhee JT, Gingeras TR, Salamon H, Drenkow J, Smittipat N, Small PM. Comparing genomes within the species Mycobacterium tuberculosis. Genome Res. 2001;11:547–554. doi: 10.1101/gr.166401. - DOI - PMC - PubMed
    1. Rajashekara G, Glasner JD, Glover DA, Splitter GA. Comparative whole-genome hybridization reveals genomic islands in Brucella species. J Bacteriol. 2004;186:5040–5051. doi: 10.1128/JB.186.15.5040-5051.2004. - DOI - PMC - PubMed
    1. Goguet de la Salmoniere YO, Kim CC, Tsolaki AG, Pym AS, Siegrist MS, Small PM. High-throughput method for detecting genomic-deletion polymorphisms. J Clin Microbiol. 2004;42:2913–2918. doi: 10.1128/JCM.42.7.2913-2918.2004. - DOI - PMC - PubMed
    1. Mostowy S, Onipede A, Gagneux S, Niemann S, Kremer K, Desmond EP, Kato-Maeda M, Behr M. Genomic analysis distinguishes Mycobacterium africanum. J Clin Microbiol. 2004;42:3594–3599. doi: 10.1128/JCM.42.8.3594-3599.2004. - DOI - PMC - PubMed

Publication types

LinkOut - more resources