Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2025 Sep 18;188(19):5343-5362.e29.
doi: 10.1016/j.cell.2025.06.020. Epub 2025 Jul 8.

Modeling the vertebrate regulatory sequence landscape by UUATAC-seq and deep learning

Affiliations
Free article

Modeling the vertebrate regulatory sequence landscape by UUATAC-seq and deep learning

Xiaoping Han et al. Cell. .
Free article

Abstract

The regulatory sequences of vertebrate genomes remain incompletely understood. To address this, we developed an ultra-throughput, ultra-sensitive single-nucleus assay for transposase-accessible chromatin using sequencing (UUATAC-seq) protocol that enables the construction of chromatin accessibility landscapes for one species in a 1-day experiment. Using UUATAC-seq, we mapped candidate cis-regulatory elements (cCREs) across five representative vertebrate species. Our analysis revealed that genome size differences across species influence the number but not the size of cCREs. We introduced Nvwa cis-regulatory element (NvwaCE), a mega-task deep-learning model designed to interpret cis-regulatory grammar and predict cCRE landscapes directly from genomic sequences with high precision. NvwaCE demonstrated that regulatory grammar is more conserved than nucleotide sequences and that this grammar organizes cCREs into distinct functional modules. Moreover, NvwaCE accurately predicted the effects of synthetic mutations on lineage-specific cCRE function, aligning with causal quantitative trait loci (QTLs) and genome editing results. Together, our study provides a valuable resource for decoding the vertebrate regulatory language.

Keywords: NvwaCE; UUATAC-seq; cCRE; chromatin accessibility landscape; deep learning; genome editing; genomics; mutation effect; regulatory sequence; snATAC-seq.

PubMed Disclaimer

Conflict of interest statement

Declaration of interests The authors have filed two patents regarding UUATAC-seq technology and the NvwaCE pipeline.

LinkOut - more resources