Fig. 3. Histone gene expression robustly identifies cycling cells.
a, Volcano plot showing differentially expressed genes between VASA-seq (right, positive values) and 10x Chromium (left, negative values). Genes that are always highly differentially expressed across timepoints and have a log2 fold change >4 and P < 0.001 are colored; purple color indicates non-polyadenylated, and orange color indicates polyadenylated genes. Many of the differentially expressed genes enriched in the VASA-seq dataset are histone genes. The statistical test used was a two-sided t-test, using uncorrected P values for multiple comparisons. b, Histogram showing the distribution of histone gene expression in VASA-seq compared to 10x Chromium. The overlayed dashed black line shows a bimodal Weibull distribution, and the dashed red line shows a single Weibull distribution. c, Histogram showing the distribution of histone gene expression in VASA-seq labeled with the estimated cell cycle phase using the expression of S and G2M genes for scoring. Detected histone expression in S-phase does not correlate with predictive cell cycle estimation. d, Cells are identified as cycling/S-phase (blue) and non-cycling (yellow) based on the total histone gene expression shown in Fig. 3b. UMAP of the VASA-seq embryonic atlas before (left panel) and after (right panel) removal of cell cycle genes. e, Cell type annotated UMAP of the aggregated VASA-seq dataset after removal of cell cycle genes. Each color and number represent a cell type, called manually based on marker gene expression for each Leiden cluster. Smaller panels (right) highlight cells sampled at each timepoint (E6.5, E7.5, E8.5 and E9.5) in black. In total, 40 different cell types were identified: 1-erythropoiesis (expansive, S-phase), 2-somites, 3-paraxial mesoderm, 4-intermediate mesoderm I, 5-caudal epiblast, 6-lateral plate mesoderm/intermediate mesoderm primordium, 7-spinal cord (differentiated neurons), 8-endothelium, 9-preplacodal/placodal region, 10-rhombomeres (hindbrain), 11-forebrain/hindbrain (isthmus), 12-epiblast (E7.5), 13-forebrain, 14-spinal cord (differentiated neurons), 15-neural crest, 16-allantois, 17-cranial mesoderm, 18-lateral plate mesoderm, 19-early caudal epiblast, 20-trophectoderm, 21-dorsal surface ectoderm, 22-anterior neural crest, 23-pharyngeal arches, 24-primitive erythroid progenitors, 25-caudal epiblast (E7.5), 26-endoderm, 27-visceral endoderm, 28-first heart field, 29-myofibroblasts, 30-epiblast (E6.5), 31-spinal cord (cycling progenitors), 32-pharyngeal arches, 33-primitive heart tube, 34-outflow tract, 35-secondary heart field, 36-intermediate mesoderm I, 37-parietal endoderm, 38-pro-nephros, 39-mesodermal unknown and 40-node. f, Percentage of cycling/S-phase cells per cell type. Average number of cycling cells is 65% (black line) ± 11% (red dashed lines) across all cell types. Late primitive erythrocytes (green) diverge from the average by having 84% of the cells in S-phase. Node cells (brown) and primitive heart tube (pink) have much fewer cells in S-phase—20% and 30%, respectively. g, Plots showing the percentage of cells in S-phase per cell type that spans over three timepoints (E6.5–E8.5, left panel; E7.5–E9.5, right panel). Trophectoderm (light brown) had an unchanged pattern, whereas endothelium (green), allantois (pink), lateral plate mesoderm (blue), endoderm (light green), visceral endoderm (light blue) and outflow tract (dark pink) all had a decreasing fraction of cycling cells as time passes. Allantois has the biggest difference, with 38% cycling in E9.5 compared to 79% in E7.5. The points are the mean and standard error of the mean obtained by bootstrapping the percentage of cells in S-phase for each equivalent cluster and biotype 1,000 times. The number of cells were: n = 140, 32 for cluster 20 and 27 at timepoint E6.5, respectively; n = 105, 340, 314, 392, 156, 171 and 69 for clusters 8, 16, 18, 20, 26, 27 and 34 at E7.5, respectively; n = 810, 552, 331, 117, 284, 339 and 121 for clusters 8, 16, 18, 20, 26, 27 and 34 at E8.5, respectively; and n = 345, 78, 30, 117 and 71 for clusters 8, 16, 18, 26 and 34 at E9.5, respectively. h, Heatmap showing differentially expressed single annotated histone genes. Rows display genes, and columns display cell types. Cell type categories/germ layers can be identified by color above the heat map. i, Example of marker histone gene expression plotted on the UMAP; red represents high expression, and blue represents low expression. H2bc15 is highly expressed in most cell types but absent in certain cell types. H2bc1 is solely expressed in the early epiblast (E6.5, cell type 30), whereas H2bu2 is specific to the ectoderm germ layer and epiblasts (cell types 12 and 30).