Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
[Preprint]. 2023 Mar 1:2023.02.28.530327.
doi: 10.1101/2023.02.28.530327.

Modeling functional cell types in spike train data

Affiliations

Modeling functional cell types in spike train data

Daniel N Zdeblick et al. bioRxiv. .

Update in

  • Modeling functional cell types in spike train data.
    Zdeblick DN, Shea-Brown ET, Witten DM, Buice MA. Zdeblick DN, et al. PLoS Comput Biol. 2023 Oct 12;19(10):e1011509. doi: 10.1371/journal.pcbi.1011509. eCollection 2023 Oct. PLoS Comput Biol. 2023. PMID: 37824442 Free PMC article.

Abstract

A major goal of computational neuroscience is to build accurate models of the activity of neurons that can be used to interpret their function in circuits. Here, we explore using functional cell types to refine single-cell models by grouping them into functionally relevant classes. Formally, we define a hierarchical generative model for cell types, single-cell parameters, and neural responses, and then derive an expectation-maximization algorithm with variational inference that maximizes the likelihood of the neural recordings. We apply this "simultaneous" method to estimate cell types and fit single-cell models from simulated data, and find that it accurately recovers the ground truth parameters. We then apply our approach to in vitro neural recordings from neurons in mouse primary visual cortex, and find that it yields improved prediction of single-cell activity. We demonstrate that the discovered cell-type clusters are well separated and generalizable, and thus amenable to interpretation. We then compare discovered cluster memberships with locational, morphological, and transcriptomic data. Our findings reveal the potential to improve models of neural responses by explicitly allowing for shared functional properties across neurons.

PubMed Disclaimer

Figures

Fig 1.
Fig 1.
A: The Poisson GLM (2) used to model the spiking response of a single neuron. B: The generative model (11) for the simultaneous method.
Fig 2.
Fig 2.. Performance of sequential and simultaneous methods on simulated data.
A: The true stimulus filter βistim for each simulated neuron. B: The true cluster means μkself used to generate simulated datasets, and those estimated by the sequential and simultaneous methods, μ^kself, fit with the correct K = 5. C-F: Mean ± SEM over 50 simulated datasets of accuracy measures, as a function of σ and shown for both K = 3 and K = 5. When K = 3, the three clusters with leftmost μkself in panel B are used. For each condition (K and σ) and for each measure of accuracy, the simultaneous method’s performance is statistically significantly better than that of the sequential method, except for ARS with K = 3 and σ = 10−5/6 (evaluated using the Wilcoxon signed rank test: uncorrected P-value<0.002).
Fig 3.
Fig 3.. Model selection of K^.
Frequencies of K^ estimated via Bayesian information criterion over 50 simulated datasets with the same μk as in Figure 2, and σ = 10−2 (A) or 10−5/6 (B), the maximum value that does not result in degenerate simulations. Black lines indicate the true value of K. The summary below each plot reports the mean ± SEM of the estimated value of K^ across the 50 datasets, for each value of K and each method.
Fig 4.
Fig 4.. IVSCC Dataset: The simultaneous method explains the data with a smaller number of less overlapping clusters.
A: BIC for the simultaneous method over a range of K; BIC determines K^=12 as optimal. Each dot reports the result of running Algorithm 2 from a different random initialization. B: Cluster centers μk of self-interaction filters fit to data using the simultaneous method; shaded region is ±diag(Σk). Only the 9 clusters with at least 20 neurons are shown from the model with BIC-selected K = 12. C: By contrast, the standard BIC of a GMM fit to individually fitted self-interaction filters (the sequential method) suggests an optimal K^ of at least 19. Each dot reports the result of performing the GMM fit, (5), from a different random initialization. D: The clusters of self-interaction filters with at least 20 neurons found by the sequential approach with K = 12 are bunched up closer to the origin, such that the clusters overlap significantly.
Fig 5.
Fig 5.. IVSCC generalization performance: the simultaneous method produces single-cell models and clusterings that generalize better, especially when fitted to more neurons.
Additional details about this figure are available in Section S3. A: ANLL (lower is better) for each held-out neuron’s single-cell model, evaluated on responses to the test stimulus (Noise 2), using the MAP β^i (20) of the simultaneous method with (hyper)parameters K, λstim, Ω^K estimated from the training neurons, versus those found by the sequential method. Color encodes number of spikes for each neuron in the evaluation data. B: Median relative difference between methods in ANLL of held-out neurons, evaluated on responses to the test stimulus, as a function of how many neurons and how much data from each were used in training (more negative values indicate that the simultaneous method is better). White asterisks indicate a significant relative difference; white bars indicate adjacent cases of training data subselection where the relative differences were significantly different. Differences pooled across all vertically (horizontally) adjacent conditions showed a significant, p = 3 × 10−4, (p = 5 × 10−26) trend, with more presentations (neurons) yielding greater improvement by the simultaneous method. C: Same analysis as A, but with EVratio (see (23); higher values indicate that the simultaneous method is better). D: Same analysis as B, but with EVratio. Pooled vertical differences showed no significant trend (p > 0.1); horizontal differences showed a significant (p = 5 × 10−3) trend, with more neurons yielding greater improvement by the simultaneous method. E: Relative differences between methods of EVratio (shown in C) versus ANLL (shown in A); color encodes number of spikes. Neurons with many (few) spikes show only improved ANLL (EVratio) in the simultaneous method. F: Same analysis as B, but for the similarity of cluster assignments k^i between model fits with different held-out neurons, measured by ARS (more positive values indicate that the simultaneous method is better). Pooled vertical differences showed a significant (p = 5 × 10−2) trend, with fewer presentations yielding greater improvement by the simultaneous method; horizontal differences showed no significant trend (p > 0.1).
Fig 6.
Fig 6.. IVSCC metadata is related to discovered cell types.
Z-scored fraction of cells with an attribute in each cluster. Cluster identities in panel A (B) are the same as in Figure 4B (D), obtained using the simultaneous (sequential) method (fitted with BIC-selected K = 12 clusters, showing only clusters with at least 20 neurons). Attributes are spiny or aspiny dendrites, location (hemisphere and cortical layer), and Cre line. Note the ARS between the cluster and metadata labels in each title. Z-scores are calculated as Zi(a)=(p^i(a)p^i)/p^i(a)(1p^i(a))/N(a)+p^i(1p^i)/N, where p^i is the empirical probability that a cell is in cluster i and p^i(a) is the empirical probability that a cell with attribute a is in cluster i, N is the number of cells, and N(a) is the number of cells with attribute a.

References

    1. Gouwens NW, et al. Classification of electrophysiological and morphological types in mouse visual cortex. Neuroscience; 2018. Available from: 10.1101/368456. - DOI - PMC - PubMed
    1. Tasic B, et al. Adult mouse cortical cell taxonomy revealed by single cell transcriptomics. Nature Neuroscience. 2016;19(2):335–346. doi:10.1038/nn.4216. - DOI - PMC - PubMed
    1. Tripathy SJ, Toker L, Li B, Crichlow CL, Tebaykin D, Mancarci BO, et al. Transcriptomic correlates of neuron electrophysiological diversity. PLOS Computational Biology. 2017;13(10):e1005814. doi:10.1371/journal.pcbi.1005814. - DOI - PMC - PubMed
    1. Tasic B, et al. Shared and distinct transcriptomic cell types across neocortical areas. Nature. 2018;563(7729):72–78. doi:10.1038/s41586-018-0654-5. - DOI - PMC - PubMed
    1. Contreras D. Electrophysiological classes of neocortical neurons. Neural Networks. 2004;17(5):633–646. doi:10.1016/j.neunet.2004.04.003. - DOI - PubMed

Publication types

LinkOut - more resources