Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2026 Feb;23(2):338-349.
doi: 10.1038/s41592-025-02919-5. Epub 2025 Dec 31.

cellSTAAR: incorporating single-cell-sequencing-based functional data to boost power in rare variant association testing of noncoding regions

Eric Van Buren  1 Yi Zhang  2   3   4 Xihao Li  5   6 Margaret Sunitha Selvaraj  7   8   9 Zilin Li  10 Hufeng Zhou  1 Nicholette D Palmer  11 Donna K Arnett  12 John Blangero  13 Eric Boerwinkle  14   15 Brian E Cade  8   16   17 Jenna C Carlson  18 April P Carson  19 Yii-Der Ida Chen  20 Joanne Curran  13 Ravindranath Duggirala  13 Myriam Fornage  14   21 Nora Franceschini  6   22 Misa Graff  22 Charles Gu  23 Xiuqing Guo  20 Jiang He  24 Nancy Heard-Cosa  25   26 Lifang Hou  27 Yi-Jen Hung  28 Rita R Kalyani  29 Sharon L R Kardia  30 Eimear Kenny  31   32   33   34 Charles Kooperberg  35 Brian G Kral  29 Leslie Lange  36 Dan Levy  26 Changwei Li  24   37 Simin Liu  38 Donald Lloyd-Jones  39 Ruth J F Loos  40   41 Ani W Manichaikul  42 Lisa Warsinger Martin  43 Rasika Mathias  44 Ryan L Minster  45 Braxton D Mitchell  46 Josyf C Mychaleckyj  42 Take Naseri  47   48 Kari North  22   49   50   51 Jeff O'Connell  46 James A Perry  46 Patricia A Peyser  30 Bruce M Psaty  52   53   54 Laura M Raffield  6 Ramachandran S Vasan  26   55 Susan Redline  16   17 Alex P Reiner  35   53 Stephen S Rich  42 Jennifer A Smith  30   56 Brian Spitzer  57 Hua Tang  58 Kent D Taylor  20 Russell Tracy  59 Satupa'itea Viali  60   61 Lisa Yanek  29 Wei Zhao  30   56 NHLBI Trans-Omics for Precision Medicine (TOPMed) ConsortiumJerome I Rotter  20 Gina M Peloso  62 Pradeep Natarajan  7   8   9 Xihong Lin  63   64
Collaborators, Affiliations

cellSTAAR: incorporating single-cell-sequencing-based functional data to boost power in rare variant association testing of noncoding regions

Eric Van Buren et al. Nat Methods. 2026 Feb.

Abstract

Understanding how rare genetic variants influence complex traits remains a major challenge, particularly when these variants lie in noncoding regions of the genome. The effects of variants within candidate cis-regulatory elements (cCREs) often depend on the cell type, making interpretation difficult. Here we introduce cellSTAAR, which integrates whole-genome sequencing data with single-cell assay for transposase-accessible chromatin using sequencing data to capture variability in chromatin accessibility across cell types via the construction of cell-type-specific functional annotations and regulatory elements. To reflect the uncertainty in cCRE-gene linking, cellSTAAR uses a comprehensive strategy to link cCREs to their target genes. We applied cellSTAAR to data from the Trans-Omics for Precision Medicine consortium (n ≈ 60,000) and replicated our findings using the UK Biobank (n ≈ 190,000). Across four lipid traits, cellSTAAR improved the detection of biologically meaningful associations and enhanced biological interpretability. These results demonstrate the potential of cell-type-aware approaches to boost discovery in rare variant whole-genome sequencing association studies.

PubMed Disclaimer

Conflict of interest statement

Competing interests: E.K. has received personal fees from Regeneron Pharmaceuticals, 23&Me, Allelica and Illumina; has received research funding from Allelica; and serves on the advisory boards for Encompass Biosciences, Overtone and Galateo Bio. P.N. reports research grants from Allelica, Amgen, Apple, Boston Scientific, Cleerly, Genentech/Roche, Ionis, Novartis and Silence Therapeutics, personal fees from AIRNA, Allelica, Apple, AstraZeneca, Bain Capital, Blackstone Life Sciences, Bristol Myers Squibb, Creative Education Concepts, CRISPR Therapeutics, Eli Lilly & Co, Esperion Therapeutics, Foresite Capital, Foresite Labs, Genentech/Roche, GV, HeartFlow, Magnet Biomedicine, Merck, Novartis, Novo Nordisk, TenSixteen Bio and Tourmaline Bio; equity in Bolt, Candela, Mercury, MyOme, Parameter Health, Preciseli and TenSixteen Bio; royalties from Recora for intensive cardiac rehabilitation; and spousal employment at Vertex Pharmaceuticals, all unrelated to the present work. B.M.P. serves on the Steering Committee of the Yale Open Data Access Project funded by Johnson & Johnson. L.M.R. is a consultant for the TOPMed Administrative Coordinating Center (ACC) through Westat. X. Lin is a consultant of AbbVie Pharmaceuticals and Verily Life Sciences. The other authors declare no competing interests.

References

    1. Taliun, D. et al. Sequencing of 53,831 diverse genomes from the NHLBI TOPMed Program. Nature 590, 290–299 (2021). - PubMed - PMC - DOI
    1. Bycroft, C. et al. The UK Biobank resource with deep phenotyping and genomic data. Nature 562, 203–209 (2018). - PubMed - PMC - DOI
    1. All of Us Research Program Investigators; Denny, J. C. et al. The “All of Us” Research Program. N. Engl. J. Med. 381, 668–676, (2019).
    1. Li, Z. et al. A framework for detecting noncoding rare-variant associations of large-scale whole-genome sequencing studies. Nat. Methods 19, 1599–1611 (2022). - PubMed - PMC - DOI
    1. Bansal, V., Libiger, O., Torkamani, A. & Schork, N. J. Statistical analysis strategies for association studies involving rare variants. Nat. Rev. Genet. 11, 773–785 (2010). - PubMed - PMC - DOI

MeSH terms

Grants and funding

LinkOut - more resources