Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2021 Aug 4;11(17):11689-11699.
doi: 10.1002/ece3.7874. eCollection 2021 Sep.

Information in morphological characters

Affiliations

Information in morphological characters

Congyu Yu et al. Ecol Evol. .

Abstract

The construction of morphological character matrices is central to paleontological systematic study, which extracts paleontological information from fossils. Although the word information has been repeatedly mentioned in a wide array of paleontological systematic studies, its meaning has rarely been clarified nor specifically defined. It is important, however, to establish a standard to measure paleontological information because fossils are hardly complete, rendering the recognition of homologous and homoplastic structures difficult. Here, based on information theory, we show the deep connections between paleontological systematic study and communication system engineering. Information is defined as the decrease of uncertainty and it is the information in morphological characters that allows distinguishing operational taxonomic units (OTUs) and reconstructing evolutionary history. We propose that concepts in communication system engineering such as source coding and channel coding, correspond to the construction of diagnostic features and the entire character matrices in paleontological studies. The two coding strategies should be distinguished following typical communication system engineering, because they serve dual purposes. With character matrices from six different vertebrate groups, we analyzed their information properties including source entropy, mutual information, and channel capacity. Estimation of channel capacity shows character saturation of all matrices in transmitting paleontological information, indicating that, due to the presence of noise, oversampling characters not only increases the burden in character scoring, but also may decrease quality of matrices. We further test the use of information entropy, which measures how informative a variable is, as a character weighting criterion in parsimony-based systematic studies. The results show high consistency with existing knowledge with both good resolution and interpretability.

Keywords: character weighting; information theory; morphology; systematics.

PubMed Disclaimer

Conflict of interest statement

All authors declare that they have no conflicts of interest.

Figures

FIGURE 1
FIGURE 1
(a) Typical communication system modified from Shannon (1948); (b) Paleontological systematic studies in abstract
FIGURE 2
FIGURE 2
(a) Information entropy distribution of a binary variable; (b) information entropy of characters in different morphological character matrices, x‐axis: number of character states, y‐axis: average information entropy; (c) joint information entropy in different morphological character matrices, only first 40 characters are shown to save space; (d) mutual information distribution heat map in different morphological character matrices, orange: high mutual information, blue: low mutual information, anatomical parts abbreviations, C: crania, F: forelimb and pectoral girdle, H: hindlimb and pelvic girdle, A: axial elements, O: others
FIGURE 3
FIGURE 3
(a) noise power distribution in taxa domain; (b) noise power distribution in character domain; (c) channel capacity and bandwidth in character matrices
FIGURE 4
FIGURE 4
Tree results of Ceratopsia. (a) Equal weighting; (b) implied weighting (k = 3); (c) implied weighting (k = 12); (d) information entropy weighting. Colored columns on the right side of trees represent OTUs and their color gradients correspond to the taxa order in the original character matrix

References

    1. Baron, M. G., Norman, D. B., & Barrett, P. M. (2017a). A new hypothesis of dinosaur relationships and early dinosaur evolution. Nature, 543(7646), 501–506. 10.1038/nature21700 - DOI - PubMed
    1. Baron, M. G., Norman, D. B., Barrett, P. (2017b). Baron et al. reply. Nature, 551(7678), E4–E5. 10.1038/nature24012 - DOI - PubMed
    1. Brocklehurst, N., & Benevento, G. L. (2020). Dental characters used in phylogenetic analyses of mammals show higher rates of evolution, but not reduced independence. PeerJ, 8, e8744. 10.7717/peerj.8744 - DOI - PMC - PubMed
    1. Catalano, S. A., Goloboff, P. A., & Giannini, N. P. (2010). Phylogenetic morphometrics (I): The use of landmark data in a phylogenetic framework. Cladistics, 26(5), 539–549. 10.1111/j.1096-0031.2010.00302.x - DOI - PubMed
    1. Congreve, C. R., & Lamsdell, J. C. (2016). Implied weighting and its utility in palaeontological datasets: A study using modelled phylogenetic matrices. Palaeontology, 59(3), 447–462. 10.1111/pala.12236 - DOI

LinkOut - more resources