Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2025 Aug;29(4):3825-3856.
doi: 10.1007/s11030-025-11211-9. Epub 2025 May 20.

Machine learning approaches for predicting the small molecule-miRNA associations: a comprehensive review

Affiliations
Review

Machine learning approaches for predicting the small molecule-miRNA associations: a comprehensive review

Ashish Panghalia et al. Mol Divers. 2025 Aug.

Abstract

MicroRNAs (miRNAs) are evolutionarily conserved small regulatory elements that are ubiquitous in cells and are found to be abnormally expressed during the onset and progression of several human diseases. miRNAs are increasingly recognized as potential diagnostic and therapeutic targets that could be inhibited by small molecules (SMs). The knowledge of SM-miRNA associations (SMAs) is sparse, mainly because of the dynamic and less predictable 3D structures of miRNAs that restrict the high-throughput screening of SMs. Toward augmenting the costly and laborious experiments determining the SM-miRNA interactions, machine learning (ML) has emerged as a cost-effective and efficient platform. In this article, various aspects associated with the ML-guided predictions of SMAs are thoroughly reviewed. Firstly, a detailed account of the SMA data resources useful for algorithms training is provided, followed by an elaboration of various feature extraction methods and similarity measures utilized on SMs and miRNAs. Subsequent to a summary of the ML algorithms basics and a brief description of the performance measures, an exhaustive census of all the 32 ML-based SMA prediction methods developed so far is outlined. Distinctive features of these methods have been described by classifying them into six broad categories, namely, classical ML, deep learning, matrix factorization, network propagation, graph learning, and ensemble learning methods. Trend analyses are performed to investigate the patterns in ML algorithms usage and performance achievement in SMA prediction. Outlining key principles behind the up-to-date methodologies and comparing their accomplishments, this review offers valuable insights into critical areas for future research in ML-based SMA prediction.

Keywords: Machine learning in medicine; SM–miRNA association; Small molecule drugs; miRNA therapeutics; miRNA–disease association.

PubMed Disclaimer

Conflict of interest statement

Declarations. Competing interests: The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper. Ethical approval: In this work, all the data have been accessed from the public databases, and the analyses were performed using computational methods. No part of this work involves human participants, human data, or human tissue. Therefore, no ethical approval is needed.

Similar articles

References

    1. Cech TR, Steitz JA (2014) The noncoding RNA revolution-trashing old rules to forge new ones. Cell 157:77–94. https://doi.org/10.1016/J.CELL.2014.03.008 - DOI - PubMed
    1. Esteller M (2011) Non-coding RNAs in human disease. Nat Rev Genet 12:861–874. https://doi.org/10.1038/nrg3074 - DOI - PubMed
    1. Nemeth K, Bayraktar R, Ferracin M, Calin GA (2023) Non-coding RNAs in disease: from mechanisms to therapeutics. Nat Rev Genet 25:211–232. https://doi.org/10.1038/s41576-023-00662-1 - DOI - PubMed
    1. Winkle M, El-Daly SM, Fabbri M, Calin GA (2021) Noncoding RNA therapeutics—challenges and potential solutions. Nat Rev Drug Discov 20:629–651. https://doi.org/10.1038/s41573-021-00219-z - DOI - PubMed - PMC
    1. Lee RC, Feinbaum RL, Ambros V (1993) The C. elegans heterochronic gene lin-4 encodes small RNAs with antisense complementarity to lin-14. Cell 75:843–854. https://doi.org/10.1016/0092-8674(93)90529-Y - DOI - PubMed

LinkOut - more resources