Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2007 Jan;175(1):321-33.
doi: 10.1534/genetics.106.065359. Epub 2006 Oct 8.

An integrated in silico gene mapping strategy in inbred mice

Affiliations

An integrated in silico gene mapping strategy in inbred mice

Alessandra C L Cervino et al. Genetics. 2007 Jan.

Abstract

In recent years in silico analysis of common laboratory mice has been introduced and subsequently applied, in slightly different ways, as a methodology for gene mapping. Previously we have demonstrated some limitation of the methodology due to sporadic genetic correlations across the genome. Here, we revisit the three main aspects that affect in silico analysis. First, we report on the use of marker maps: we compared our existing 20,000 SNP map to the newly released 140,000 SNP map. Second, we investigated the effect of varying strain numbers on power to map QTL. Third, we introduced a novel statistical approach: a cladistic analysis, which is well suited for mouse genetics and has increased flexibility over existing in silico approaches. We have found that in our examples of complex traits, in silico analysis by itself does fail to uniquely identify quantitative trait gene (QTG)-containing regions. However, when combined with additional information, it may significantly help to prioritize candidate genes. We therefore recommend using an integrated work flow that uses other genomic information such as linkage regions, regions of shared ancestry, and gene expression information to obtain a list of candidate genes from the genome.

PubMed Disclaimer

Figures

F<sc>igure</sc> 1.—
Figure 1.—
Comparison between the 20K map (top, A and B) and the 140K map (bottom, A and B). (A) Points represent the association between the albinism phenotype and individual SNPs. Colors represent different chromosomes. Tyrosinase is located on chromosome 7. (B) Points represent the association between the albinism phenotype and haplotypes of length 3 using cladistic analysis. Colors represent different chromosomes. Tyrosinase is located on chromosome 7.
F<sc>igure</sc> 1.—
Figure 1.—
Comparison between the 20K map (top, A and B) and the 140K map (bottom, A and B). (A) Points represent the association between the albinism phenotype and individual SNPs. Colors represent different chromosomes. Tyrosinase is located on chromosome 7. (B) Points represent the association between the albinism phenotype and haplotypes of length 3 using cladistic analysis. Colors represent different chromosomes. Tyrosinase is located on chromosome 7.
F<sc>igure</sc> 2.—
Figure 2.—
Comparison between the 20K map (top, A and B) and the 140K map (bottom, A and B). (A) Points represent the association between the albinism phenotype and individual SNPs along chromosome 7. The position of the tyrosinase gene is represented by a red line. (B) Points represent the association between the albinism phenotype and haplotypes of length 3 using cladistic analysis along chromosome 7. The position of the tyrosinase gene is represented by a red line.
F<sc>igure</sc> 2.—
Figure 2.—
Comparison between the 20K map (top, A and B) and the 140K map (bottom, A and B). (A) Points represent the association between the albinism phenotype and individual SNPs along chromosome 7. The position of the tyrosinase gene is represented by a red line. (B) Points represent the association between the albinism phenotype and haplotypes of length 3 using cladistic analysis along chromosome 7. The position of the tyrosinase gene is represented by a red line.
F<sc>igure</sc> 3.—
Figure 3.—
Comparison between the 20K map (top, A and B) and the 140K map (bottom, A and B). (A) Points represent the association between the blindness phenotype and individual SNPs along chromosome 5. The position of the pde6b gene is represented by a red line. (B) Points represent the association between the blindness phenotype and haplotypes of length 3 using cladistic analysis along chromosome 5. The position of the pde6b gene is represented by a red line.
F<sc>igure</sc> 3.—
Figure 3.—
Comparison between the 20K map (top, A and B) and the 140K map (bottom, A and B). (A) Points represent the association between the blindness phenotype and individual SNPs along chromosome 5. The position of the pde6b gene is represented by a red line. (B) Points represent the association between the blindness phenotype and haplotypes of length 3 using cladistic analysis along chromosome 5. The position of the pde6b gene is represented by a red line.
F<sc>igure</sc> 4.—
Figure 4.—
Cladistic analysis based on a 10-SNP haplotype. Similar haplotypes are merged into groups and a test for association is performed at each level of the tree. Missing data are allowed as illustrated here.“〈NA〉” indicates missing genotype.
F<sc>igure</sc> 5.—
Figure 5.—
Comparison of the different algorithms in estimating gene position in a Mendelian trait. (A) P-value distribution using a single-marker analysis (S140). (B) Cladistic analysis with a 3-SNP haplotype (C140.3). (C) Cladistic analysis with a 10-SNP haplotype (C140.10). (D) Cladistic analysis of length 100 kb (C140.sli). (E) C140.3 after applying a smoother. (F) C140.10 after applying a smoother. (G) C140.sli after applying a smoother. The x-axis represents the base-pair position along chromosome 7; the underlying gene for albinism, tyrosinase, is situated at 74.5 Mb.
F<sc>igure</sc> 5.—
Figure 5.—
Comparison of the different algorithms in estimating gene position in a Mendelian trait. (A) P-value distribution using a single-marker analysis (S140). (B) Cladistic analysis with a 3-SNP haplotype (C140.3). (C) Cladistic analysis with a 10-SNP haplotype (C140.10). (D) Cladistic analysis of length 100 kb (C140.sli). (E) C140.3 after applying a smoother. (F) C140.10 after applying a smoother. (G) C140.sli after applying a smoother. The x-axis represents the base-pair position along chromosome 7; the underlying gene for albinism, tyrosinase, is situated at 74.5 Mb.
F<sc>igure</sc> 6.—
Figure 6.—
Comparison of the different algorithms in estimating gene position in a complex trait. (A) P-value distribution using a single-marker analysis (S140). (B) Cladistic analysis with a 3-SNP haplotype (C140.3). (C) Cladistic analysis with a 10-SNP haplotype (C140.10). (D) Cladistic analysis of length 100 kb (C140.sli). (E) C140.3 after applying a smoother. (F) C140.10 after applying a smoother. (G) C140.sli after applying a smoother. The x-axis represents the base-pair position along chromosome 1; one contributing gene for cholesterol, ApoA2, is situated at 171.3 Mb and is represented by a vertical line.
F<sc>igure</sc> 6.—
Figure 6.—
Comparison of the different algorithms in estimating gene position in a complex trait. (A) P-value distribution using a single-marker analysis (S140). (B) Cladistic analysis with a 3-SNP haplotype (C140.3). (C) Cladistic analysis with a 10-SNP haplotype (C140.10). (D) Cladistic analysis of length 100 kb (C140.sli). (E) C140.3 after applying a smoother. (F) C140.10 after applying a smoother. (G) C140.sli after applying a smoother. The x-axis represents the base-pair position along chromosome 1; one contributing gene for cholesterol, ApoA2, is situated at 171.3 Mb and is represented by a vertical line.
F<sc>igure</sc> 7.—
Figure 7.—
An integrated strategy for narrowing down the genome to a list of candidate genes. Integrated work flow brings in information from linkage analysis, IBD blocks, in silico analysis, and gene expression to obtain a short list of candidate genes. This example is based on the cholesterol trait.

References

    1. Adams, D. J., E. T. Dermitzakis, T. Cox, J. Smith, R. Davies et al., 2005. Complex haplotypes, copy number polymorphisms and coding variation in two recently divergent mouse strains. Nat. Genet. 37: 532–536. - PubMed
    1. Cervino, A. C., G. Li, S. Edwards, J. Zhu, C. Laurie et al., 2005. Integrating QTL and high-density SNP analyses in mice to identify Insig2 as a susceptibility gene for plasma cholesterol levels. Genomics 86: 505–517. - PubMed
    1. Cervino, A. C., M. Gosink, M. Fallahi, B. Pascal, C. Mader et al., 2006. A comprehensive mouse IBD database for the efficient localization of quantitative trait loci. Mamm. Genome 17: 565–574. - PubMed
    1. Darvasi, A., 2001. In silico mapping of mouse quantitative trait loci. Science 294: 2423. - PubMed
    1. Durrant, C., and A. P. Morris, 2005. Linkage disequilibrium mapping via cladistic analysis of phase-unknown genotypes and inferred haplotypes in the Genetic Analysis Workshop 14 simulated data. BMC Genet. 6(Suppl. 1): S100. - PMC - PubMed

Publication types