Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 May 11;2(5):100131.
doi: 10.1016/j.xgen.2022.100131.

Lumping versus splitting: How to approach defining a disease to enable accurate genomic curation

Affiliations

Lumping versus splitting: How to approach defining a disease to enable accurate genomic curation

Courtney Thaxton et al. Cell Genom. .

Abstract

The dilemma of how to categorize and classify diseases has been debated for centuries. The field of medical genetics has historically approached nosology based on clinical phenotypes observed in patients and families. Advances in genomic sequencing and understanding of genetic contributions to disease often provoke a need to reassess these classifications. The Clinical Genome Resource (ClinGen) has developed frameworks to classify the strength of evidence underlying monogenic gene-disease relationships, variant pathogenicity, and clinical actionability. It is therefore necessary to define the disease entity being evaluated, which can be challenging for genes associated with multiple conditions and/or a broad phenotypic spectrum. We therefore developed criteria to guide "lumping and splitting" decisions and improve consistency in defining monogenic gene-disease relationships. Here, we outline the precuration process, the lumping and splitting guidelines with examples, and describe the implications for clinical diagnosis, informatics, and care management.

PubMed Disclaimer

Conflict of interest statement

DECLARATION OF INTERESTS H.L.R. is a member of the Cell Genomics advisory board. The other authors declare no competing interests.

Figures

None
Graphical abstract
Figure 1
Figure 1
Weighing the evidence The four criteria for lumping and splitting should be assessed and weighed as a balance. If only a single assertion has been made in the literature and/or databases of monogenic diseases (e.g., OMIM, Monarch [Mondo ontology], Orphanet), it is possible that no further steps are needed. However, some groups find it useful to precurate genes with a single disease entity to discuss disease nomenclature and review any new evidence or assertions that have not yet been formally captured in nosological and ontological resources. If multiple distinct disease entities have been asserted, then the curator will evaluate the evidence for the molecular mechanism, phenotypic expressivity, and inheritance pattern to determine whether to lump certain entities for curation as a syndrome or an organ-specific complex phenotype or to keep them separate as split disease entities. If the evidence is equally balanced between lumping or splitting, then experts should be consulted to compare the relevant weight of each piece of evidence.
Figure 2
Figure 2
Lumping and splitting conundrum: defining a disease entity When assessing the involvement of any given gene in disease, several possibilities for a disease entity may exist, including the following: (1) an isolated phenotype, in which 1 phenotype (or phenotypic feature) arises in a single organ system with no risk of other phenotypes arising in that organ system or elsewhere; (2) variable phenotypes in a single organ, in which multiple related phenotypes (or phenotypic features) arise in a single organ system; or (3) a syndromic phenotype, in which multiple, varying phenotypes occur in multiple organs. Assessing the appropriate disease entity or entities to curate can be challenging, thus requiring the use of defined criteria.

References

    1. Darwin . University of Cambridge; 1857. Darwin Project.https://www.darwinproject.ac.uk/letter/DCP-LETT-2130.xml#back-mark-Lfoot.f6
    1. McKusick V.A. On lumpers and splitters, or the nosology of genetic disease. Perspect. Biol. Med. 1969;12:298–312. doi: 10.1353/pbm.1969.0039. - DOI - PubMed
    1. Amberger J.S., Bocchini C.A., Scott A.F., Hamosh A. OMIM.org: leveraging knowledge across phenotype-gene relationships. Nucleic Acids Res. 2019;47:D1038–D1043. doi: 10.1093/nar/gky1151. - DOI - PMC - PubMed
    1. Boycott K.M., Rath A., Chong J.X., Hartley T., Alkuraya F.S., Baynam G., Brookes A.J., Brudno M., Carracedo A., den Dunnen J.T., et al. International cooperation to enable the diagnosis of all rare genetic diseases. Am. J. Hum. Genet. 2017;100:695–705. doi: 10.1016/j.ajhg.2017.04.003. - DOI - PMC - PubMed
    1. Biesecker L.G. Lumping and splitting: molecular biology in the genetics clinic. Clin. Genet. 1998;53:3–7. doi: 10.1034/j.1399-0004.1998.531530102.x. - DOI - PubMed