Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025:2886:375-400.
doi: 10.1007/978-1-0716-4310-5_19.

GEMLI: Gene Expression Memory-Based Lineage Inference from Single-Cell RNA-Sequencing Datasets

Affiliations

GEMLI: Gene Expression Memory-Based Lineage Inference from Single-Cell RNA-Sequencing Datasets

A S Eisele et al. Methods Mol Biol. 2025.

Abstract

Gene expression memory-based lineage inference (GEMLI) is a computational tool allowing to predict cell lineages solely from single-cell RNA-sequencing (scRNA-seq) datasets and is publicly available as an R package on GitHub. GEMLI is based on the occurrence of gene expression memory, i.e., the gene-specific maintenance of expression levels through cell divisions. This represents a shift away from experimental lineage tracing techniques based on genetic marks or physical cell lineage separation and greatly eases and expands lineage annotation. GEMLI allows to study cell lineages during differentiation in development, homeostasis, and regeneration, as well as disease onset and progression in various physiological and pathological contexts. This makes it possible to dissect cell type-specific gene expression memory, to discriminate symmetric and asymmetric cell fate decisions, and to reconstruct individual multicellular structures from pooled scRNA-seq datasets. GEMLI is particularly promising for its ability to identify small lineages in human samples, a context in which no other lineage tracing methods are applicable. In this chapter, we provide a detailed protocol of the GEMLI R package usage on gene expression matrices derived from standard scRNA-seq on various platforms. We cover the use of the main function to predict cell lineages and how to adjust its parameters to different tasks. We also show how lineage information is extracted, visualized, and fine-tuned. Finally, we describe the use of the package's functions for the detailed analysis of the predicted cell lineages. This includes the analysis of gene expression memory, cell type composition of individual large lineages, and identification of lineages at the transition point between two cell types.

Keywords: Cell family; Clonality; Fate decisions; Gene expression memory; Gene expression stability; Lineage inference; Lineage tracing; Memory genes; Monotonically expressed genes; Multicellular structure.

PubMed Disclaimer

Similar articles

References

    1. Eisele AS, Tarbier M, Dormann AA et al (2022) Barcode-free prediction of cell lineages from scRNA-seq datasets. bioRxiv. PMID: 38553478. https://doi.org/10.1101/2022.09.20.508646
    1. Shaffer SM, Emert BL, Reyes Hueros RA et al (2020) Memory sequencing reveals heritable single-cell gene expression programs associated with distinct cellular behaviors. Cell 182:947–959 - DOI - PubMed - PMC
    1. Meir Z, Mukamel Z, Chomsky E et al (2020) Single-cell analysis of clonal maintenance of transcriptional and epigenetic states in cancer cells. Nat Genet 52:709–718 - DOI - PubMed - PMC
    1. Kimmerling RJ, Lee Szeto G, Li JW et al (2016) A microfluidic platform enabling single-cell RNA-seq of multigenerational lineages. Nat Commun 7:10220 - DOI - PubMed - PMC
    1. Kumar RM, Cahan P, Shalek AK et al (2014) Deconstructing transcriptional heterogeneity in pluripotent stem cells. Nature 516:56–61 - DOI - PubMed - PMC

LinkOut - more resources