Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2018 Apr 9;131(7):jcs203950.
doi: 10.1242/jcs.203950.

The triple helix of collagens - an ancient protein structure that enabled animal multicellularity and tissue evolution

Affiliations
Review

The triple helix of collagens - an ancient protein structure that enabled animal multicellularity and tissue evolution

Aaron L Fidler et al. J Cell Sci. .

Abstract

The cellular microenvironment, characterized by an extracellular matrix (ECM), played an essential role in the transition from unicellularity to multicellularity in animals (metazoans), and in the subsequent evolution of diverse animal tissues and organs. A major ECM component are members of the collagen superfamily -comprising 28 types in vertebrates - that exist in diverse supramolecular assemblies ranging from networks to fibrils. Each assembly is characterized by a hallmark feature, a protein structure called a triple helix. A current gap in knowledge is understanding the mechanisms of how the triple helix encodes and utilizes information in building scaffolds on the outside of cells. Type IV collagen, recently revealed as the evolutionarily most ancient member of the collagen superfamily, serves as an archetype for a fresh view of fundamental structural features of a triple helix that underlie the diversity of biological activities of collagens. In this Opinion, we argue that the triple helix is a protein structure of fundamental importance in building the extracellular matrix, which enabled animal multicellularity and tissue evolution.

Keywords: Cell biology; Collagen; Evolution; Extracellular matrix; Multicellularity; Triple helix.

PubMed Disclaimer

Conflict of interest statement

Competing interestsThe authors declare no competing or financial interests.

Figures

Fig. 1.
Fig. 1.
Collagen IV is the evolutionarily most ancient of the vertebrate collagen superfamily. The collagen superfamily has 28 members in vertebrates, each comprising three of 46 α-chains – the basis of the diverse suprastructures distributed across different tissues – invertebrates have less members. A hallmark feature of all collagens is the triple helix, which is characterized by three intertwined polypeptide chains. A collagen IV-like gene probably first appeared in the last common ancestor (LCA) to filastereans, choanoflagellates and animals. The phylogenetic distribution suggests that collagen IV played a critical role in the transition of unicellular organisms to multicellular animals. FACITs, fibril-associated collagens with interrupted triple helices; MACITs, membrane-associated collagens with interrupted triple helices; Misc., miscellaneous.
Fig. 2.
Fig. 2.
Building a collagen triple helix and encoding information. (A) The building blocks (polypeptide chains) of the triple helix are left-handed polyproline type-II helices, i.e. non-extensible structures that have structural and mechanical roles in the ECM of plants and some animals. Replacement of every third proline (Pro) residue with a glycine Gly) residue results in increased flexibility. (B) Three left-handed superhelices wind together and pack tightly owing to these Gly residues, thereby forming a right-handed triple helix. Once associated, the non-extensibility of the structure is restored, and the combination of three chains results in three binding modes (one, two or three chains) with three levels of variation to these modes, i.e. level 1: 20 variable aa residues, level 2: post-translational modifications (PTMs) and, level 3: chain stagger. Together, these specify the numerous possible binding motifs that are positioned along the length of the triple helix of any of the 28 collagen types to bind various macromolecules.
Fig. 3.
Fig. 3.
The triple helix of collagen IV scaffolds encodes information for the tethering of macromolecules. (A) Assembly begins with collagen IV monomers (α-chains) transcribed and assembled into protomers inside of many cell types. Protomers are composed of three intertwining α-chains that form a triple helix. On the outside of cells, the NC1- and 7S-domains direct the assembly of protomers into network structures of higher order crosslinked by sulfilimine and lysyl oxidase-like protein 2 (LOXL2). The higher Cl concentration of the ECM induces NC1-domain-directed oligomerization of protomers that form ‘smart’ scaffolds. (B) Once assembled, collagen IV networks function as smart scaffolds, bestowing BMs that underlie and surround cells of several capabilities. Vast amounts of structural information is encoded in motifs located at specific sites along the surface of the triple helix to tether macromolecules. The mode of variation is based on 20 variable aa residues on a single chain, or the combination of one, two or three chains, post-translational modifications (PTMs) and chain stagger (see Fig. 2). The tethering at specific sites spatially organizes molecules along the triple helix, resulting in a populated scaffold within the BM that provides tensile strength to tissues, and influences cell behavior, adhesion and migration during tissue development and regeneration. Partly modified, with permission (Protein Science) from Brown et al., 2017.
Fig. 4.
Fig. 4.
Mutations in the triple helix can result in genetic diseases. (A) A single missense mutation of a glycine to another residue results in disruption of the triple helix's function. (B) Glycine mutations are responsible for a number of genetic diseases involving the kidney, teeth, muscle, joints, cartilage, brain, skin, vasculature, bone, inner ear and eye in humans (see Table 1). (B) Diseases and disorders due to replacement of glycine residue(s) in the triple helix of collagens. The type of collagen is indicated in bold Roman numerals. AVN, avascular necrosis of femoral head; CCDD, congenital cranial dysinnervation disorder; CSVD, cerebral small-vessel disease; HANAC, hereditary angiopathy with nephropathy, aneurysms and muscle cramps; UCMD, Ullrich congenital muscular dystrophy; LCPD, Legg–Calvé–Perthes disease; OSMED, otospondylomegaepiphyseal dysplasia; syn, syndrome.
Fig. 5.
Fig. 5.
The fundamental importance of a triple helix in enabling animal multicellularity and tissue evolution. (A) The unique structure and vast encoding properties of the collagen triple helix outside the cell (left) evokes an analogy to the DNA double helix inside the cell (right). (B) The triple helix protein structure was present in unicellular organisms and was co-opted in the form of collagen IV, enabling the transition to multicellular animals. The triple helix was also adapted as a key feature of all members of the diverse collagen superfamily in the ECM, enabling tissue evolution. LCA, last common ancestor.

References

    1. Abedin M. and King N. (2010). Diverse evolutionary paths to cell adhesion. 20, 734-742. 10.1016/j.tcb.2010.08.002 - DOI - PMC - PubMed
    1. Adzhubei A. A., Sternberg M. J. E. and Makarov A. A. (2013). Polyproline-II helix in proteins: structure and function. 425, 2100-2132. 10.1016/j.jmb.2013.03.018 - DOI - PubMed
    1. Ala-Kokko L., Baldwin C. T., Moskowitz R. W. and Prockop D. J. (1990). Single base mutation in the type II procollagen gene (COL2A1) as a cause of primary osteoarthritis associated with a mild chondrodysplasia. 87, 6565-6568. 10.1073/pnas.87.17.6565 - DOI - PMC - PubMed
    1. Aldahmesh M. A., Khan A. O., Mohamed J. Y., Levin A. V., Wuthisiri W., Lynch S., Mccreery K. and Alkuraya F. S. (2013). No evidence for locus heterogeneity in Knobloch syndrome. 50, 565-566. 10.1136/jmedgenet-2013-101755 - DOI - PubMed
    1. Añazco C., López-Jiménez A. J., Rafi M., Vega-Montoto L., Zhang M.-Z., Hudson B. G. and Vanacore R. M. (2016). Lysyl oxidase-like-2 cross-links collagen IV of glomerular basement membrane. 291, 25999-26012. 10.1074/jbc.M116.738856 - DOI - PMC - PubMed

Publication types

MeSH terms

Substances