Optimizing sequence design strategies for perturbation MPRAs: a computational evaluation framework
- PMID: 38296821
- PMCID: PMC10939410
- DOI: 10.1093/nar/gkae012
Optimizing sequence design strategies for perturbation MPRAs: a computational evaluation framework
Abstract
The advent of perturbation-based massively parallel reporter assays (MPRAs) technique has facilitated the delineation of the roles of non-coding regulatory elements in orchestrating gene expression. However, computational efforts remain scant to evaluate and establish guidelines for sequence design strategies for perturbation MPRAs. In this study, we propose a framework for evaluating and comparing various perturbation strategies for MPRA experiments. Within this framework, we benchmark three different perturbation approaches from the perspectives of alteration in motif-based profiles, consistency of MPRA outputs, and robustness of models that predict the activities of putative regulatory motifs. While our analyses show very similar results across multiple benchmarking metrics, the predictive modeling for the approach involving random nucleotide shuffling shows significant robustness compared with the other two approaches. Thus, we recommend designing sequences by randomly shuffling the nucleotides of the perturbed site in perturbation-MPRA, followed by a coherence check to prevent the introduction of other variations of the target motifs. In summary, our evaluation framework and the benchmarking findings create a resource of computational pipelines and highlight the potential of perturbation-MPRA in predicting non-coding regulatory activities.
© The Author(s) 2024. Published by Oxford University Press on behalf of Nucleic Acids Research.
Figures
Update of
-
Best practices for perturbation MPRA-a computational evaluation framework of sequence design strategies.bioRxiv [Preprint]. 2023 Sep 29:2023.09.27.559768. doi: 10.1101/2023.09.27.559768. bioRxiv. 2023. Update in: Nucleic Acids Res. 2024 Feb 28;52(4):1613-1627. doi: 10.1093/nar/gkae012. PMID: 37808807 Free PMC article. Updated. Preprint.
References
-
- Agarwal V., Inoue F., Schubach M., Martin B.K., Dash P.M., Zhang Z., Sohota A., Noble W.S., Yardimci G.G., Kircher M. et al. . Massively parallel characterization of transcriptional regulatory elements in three diverse human cell types. 2023; bioRxiv doi:06 March 2023, preprint: not peer reviewed10.1101/2023.03.05.531189. - DOI
-
- Deng C., Whalen S., Steyert M., Ziffra R., Przytycki P.F., Inoue F., Pereira D.A., Capauto D., Norton S., Vaccarino F.M. et al. . Massively parallel characterization of psychiatric disorder-associated and cell-type-specific regulatory elements in the developing human cortex. 2023; bioRxiv doi:16 February 2023, preprint: not peer reviewed10.1101/2023.02.15.528663. - DOI - PubMed
MeSH terms
Substances
Grants and funding
LinkOut - more resources
Full Text Sources
Molecular Biology Databases
