Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2008 Apr;91(4):307-14.
doi: 10.1016/j.ygeno.2007.12.008. Epub 2008 Feb 12.

Natural variation in four human collagen genes across an ethnically diverse population

Affiliations

Natural variation in four human collagen genes across an ethnically diverse population

Ting-Fung Chan et al. Genomics. 2008 Apr.

Abstract

Collagens are members of one of the most important families of structural proteins in higher organisms. There are 28 types of collagens encoded by 43 genes in humans that fall into several different functional protein classes. Mutations in the major fibrillar collagen genes lead to osteogenesis imperfecta (COL1A1 and COL1A2 encoding the chains of Type I collagen), chondrodysplasias (COL2A1 encoding the chains of Type II collagen), and vascular Ehlers-Danlos syndrome (COL3A1 encoding the chains of Type III collagen). Over the past 2 decades, mutations in these collagen genes have been catalogued, in hopes of understanding the molecular etiology of diseases caused by these mutations, characterizing the genotype-phenotype relationships, and developing robust models predicting the molecular and clinical outcomes. To achieve these goals better, it is necessary to understand the natural patterns of variation in collagen genes in human populations. We screened exons, flanking intronic regions, and conserved noncoding regions for variations in COL1A1, COL1A2, COL2A1, and COL3A1 in 48 individuals from each of four ethnically diverse populations. We identified 459 single-nucleotide polymorphisms (SNPs), more than half of which were novel and not found in public databases. Of the 52 SNPs found in coding regions, 15 caused amino acid substitutions while 37 did not. Although the four collagens have similar gene and protein structures, they have different molecular evolutionary characteristics. For example, COL1A1 appears to have been under substantially stronger negative selection than the rest. Phylogenetic analysis also suggests that the four genes have very different evolutionary histories among the different ethnic groups. Our observations suggest that the study of collagen mutations and their relationships with disease phenotypes should be performed in the context of the genetic background of the subjects.

PubMed Disclaimer

Figures

Figure 1
Figure 1. A visual genotype map of all common alleles (MAF ≥ 0.05) in the four collagen genes
SNPs in each gene were arranged from the 5′ (left) to 3′ (right). Each row represents an individual in one of the four ethnic groups: African American (black), European American (blue), Mexican American (green), and Chinese American (red). Homozygous major alleles are represented in blue, heterozygous loci in green, and homozygous minor alleles in yellow. Within each ethnic group, individuals were further clustered by samples using the VG2 program available from the Nickerson group at SeattleSNP (http://pga.gs.washington.edu/VG2.html).
Figure 2
Figure 2. Phylogenetic trees for the four collagen genes
Genotype data from Figure 1 were used to determine the four-way Reynolds distances and phylogenetic trees were generated using the PHYLIP software package.

References

    1. Chu ML, Prockop DJ. Collagen: Gene Structure. In: Royce PM, Steinmann B, editors. Connective Tissue and Its Heritable Disorders: Molecular, Genetic, and Medical Aspects. Wiley-Liss; New York, NY: 2002. pp. 223–248.
    1. Kielty CM, Grant ME. The Collagen Family: Structure, Assembly, and Organization in the Extracellular Matrix. In: Royce PM, Steinmann B, editors. Connective Tissue and Its Heritable Disorders: Molecular, Genetic, and Medical Aspects. Wiley-Liss; New York, NY: 2002. pp. 159–221.
    1. Marini JC, Forlino A, Cabral WA, Barnes AM, San Antonio JD, Milgrom S, Hyland JC, Korkko J, Prockop DJ, De Paepe A, Coucke P, Symoens S, Glorieux FH, Roughley PJ, Lund AM, Kuurila-Svahn K, Hartikka H, Cohn DH, Krakow D, Mottes M, Schwarze U, Chen D, Yang K, Kuslich C, Troendle J, Dalgleish R, Byers PH. Consortium for osteogenesis imperfecta mutations in the helical domain of type I collagen: regions rich in lethal mutations align with collagen binding sites for integrins and proteoglycans. Hum Mutat. 2007;28:209–21. - PMC - PubMed
    1. Pepin M, Schwarze U, Superti-Furga A, Byers PH. Clinical and genetic features of Ehlers-Danlos syndrome type IV, the vascular type. N Engl J Med. 2000;342:673–80. - PubMed
    1. Frazer KA, Pachter L, Poliakov A, Rubin EM, Dubchak I. VISTA: computational tools for comparative genomics. Nucleic Acids Res. 2004;32:W273–9. - PMC - PubMed

Publication types

LinkOut - more resources