Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2010 Jun 14:11:318.
doi: 10.1186/1471-2105-11-318.

CONAN: copy number variation analysis software for genome-wide association studies

Affiliations

CONAN: copy number variation analysis software for genome-wide association studies

Lukas Forer et al. BMC Bioinformatics. .

Abstract

Background: Genome-wide association studies (GWAS) based on single nucleotide polymorphisms (SNPs) revolutionized our perception of the genetic regulation of complex traits and diseases. Copy number variations (CNVs) promise to shed additional light on the genetic basis of monogenic as well as complex diseases and phenotypes. Indeed, the number of detected associations between CNVs and certain phenotypes are constantly increasing. However, while several software packages support the determination of CNVs from SNP chip data, the downstream statistical inference of CNV-phenotype associations is still subject to complicated and inefficient in-house solutions, thus strongly limiting the performance of GWAS based on CNVs.

Results: CONAN is a freely available client-server software solution which provides an intuitive graphical user interface for categorizing, analyzing and associating CNVs with phenotypes. Moreover, CONAN assists the evaluation process by visualizing detected associations via Manhattan plots in order to enable a rapid identification of genome-wide significant CNV regions. Various file formats including the information on CNVs in population samples are supported as input data.

Conclusions: CONAN facilitates the performance of GWAS based on CNVs and the visual analysis of calculated results. CONAN provides a rapid, valid and straightforward software solution to identify genetic variation underlying the 'missing' heritability for complex traits that remains unexplained by recent GWAS. The freely available software can be downloaded at http://genepi-conan.i-med.ac.at.

PubMed Disclaimer

Figures

Figure 1
Figure 1
Software Architecture. (A) In traditional approaches the application requests data, processes it locally and uploads the results. Thus, additional amount of data transfer reduce the performance. (B) The two-tier architecture of CONAN outsources all data intensive algorithms on the database server. The client invokes stored procedures to execute them on the database server itself; thus no upload of data is required and the client retrieves only the informative results.
Figure 2
Figure 2
CNVR Determination. (A) The boundaries of a sub-CNVR are determined using the start and end SNP of each CNV. (B) The basic algorithm designates a sub-CNVR as a CNVR if its frequency is greater than the threshold. (C) The extended algorithm merges consecutive sub-CNVRs and builds a single one on their basis.
Figure 3
Figure 3
Extended CNVR-Determination. The longest CNV of a subject in the CNVR with a percentage greater than the threshold (e.g. 50%) is used to define the final state.
Figure 4
Figure 4
Overview of steps. The CONAN analysis process is divided into three main steps: data import, data analysis, and data visualization.
Figure 5
Figure 5
Graphical User Interface. All imported datasets, their calculated CNV regions and associated analyses are organized in a tree structure. All CNV regions of the current selected dataset or association analysis are shown as a table: Genome-wide significant CNVRs are highlighted in red and regions with already know associations from SNP-GWAS http://www.genome.gov/gwastudies/ are highlighted in yellow. Information about a specified CNVR is listed in a separate dialog.
Figure 6
Figure 6
Data Visualization. CONAN supports the analysis process by several visualizations: (A) Visualization of the distribution of all detected CNV regions on each chromosome. (B) Visualization of associations via Manhattan plot enables a rapid identification of genome-wide significant CNVRs.

Similar articles

Cited by

References

    1. Johnson AD, O'Donnell CJ. An open access database of genome-wide association results. BMC Med Genet. 2009;10:6. doi: 10.1186/1471-2350-10-6. - DOI - PMC - PubMed
    1. McCarroll SA. Extending genome-wide association studies to copy-number variation. Hum Mol Genet. 2008;17:R135–R142. doi: 10.1093/hmg/ddn282. - DOI - PubMed
    1. Wheeler DA, Srinivasan M, Egholm M, Shen Y, Chen L, McGuire A, He W, Chen YJ, Makhijani V, Roth GT. The complete genome of an individual by massively parallel DNA sequencing. Nature. 2008;452:872–876. doi: 10.1038/nature06884. - DOI - PubMed
    1. Redon R, Ishikawa S, Fitch KR, Feuk L, Perry GH, Andrews TD, Fiegler H, Shapero MH, Carson AR, Chen W. Global variation in copy number in the human genome. Nature. 2006;444:444–454. doi: 10.1038/nature05329. - DOI - PMC - PubMed
    1. McCarroll SA, Huett A, Kuballa P, Chilewski SD, Landry A, Goyette P, Zody MC, Hall JL, Brant SR, Cho JH. Deletion polymorphism upstream of IRGM associated with altered IRGM expression and Crohn's disease. Nat Genet. 2008;40:1107–1112. doi: 10.1038/ng.215. - DOI - PMC - PubMed

Publication types

LinkOut - more resources