Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
Review
. 2025 Jul 24;23(1):823.
doi: 10.1186/s12967-025-06802-x.

Medical laboratory data-based models: opportunities, obstacles, and solutions

Affiliations
Review

Medical laboratory data-based models: opportunities, obstacles, and solutions

Jiaojiao Meng et al. J Transl Med. .

Abstract

Medical Laboratory Data (MLD) models, which combine artificial intelligence with big medical data, have great potential in disease screening, diagnosis, personalized medicine, and health management. This study thoroughly examines the opportunities, challenges, and solutions in this field. The use of large-scale MLD improves diagnostic accuracy and allows for real-time disease monitoring. Additionally, integrating social and environmental data enables the analysis of disease mechanisms and trends. Despite these benefits, challenges such as data quality, model optimization, computational requirements, and limited interpretability remain, along with concerns about data privacy, fairness, and security. Proposed solutions include establishing standardized data formats, utilizing deep learning frameworks, employing distributed computing, improving interpretability, and implementing techniques like federated learning and algorithm optimization to address bias and safeguard privacy. Future directions will focus on enhancing performance in specific scenarios, expanding applications across different domains, increasing transparency, enabling real-time processing, and building a supportive ecosystem. It is essential to strengthen policy oversight and promote collaboration among governments, medical institutions, and academia to ensure that technological advancements align with societal progress.

Keywords: Large models; Medical laboratory data; Obstacles; Opportunities.

PubMed Disclaimer

Conflict of interest statement

Declarations. Ethics approval and consent to participate: Not applicable. Consent for publication: Not applicable. Competing interests: The authors declare that they have no competing interests.

Figures

Fig. 1
Fig. 1
Sources, Features, and Applications of MLD. This figure outlines the different sources and characteristics of MLD, emphasizing their multidimensionality, diverse formats, complexity, dynamics, and time series features. It classifies MLD into clinical tests, laboratory biomolecular omics data and physiological monitoring from portable devices, demonstrating how these data types are used in risk prediction, screening, diagnosis, and treatment strategies in various medical fields.
Fig. 2
Fig. 2
Obstacles and solutions to the Medical Laboratory Data Model. This figure outlines the main challenges faced in developing MLD-based models and the corresponding solutions. The obstacles include data quality issues, computational demands, lack of interpretability, and concerns regarding data privacy and security. Solutions include utilizing distributed computing technologies, high-performance hardware, deep learning frameworks, and techniques like federated learning, differential privacy, and homomorphic encryption. Additionally, strategies for improving model optimization, interpretability, and fairness are suggested, such as incorporating regularization layers, residual connections, LASSO regression, and SHAP for interpretability, as well as addressing data and algorithmic biases to enhance model fairness and security.

Similar articles

References

    1. You J et al. Plasma proteomic profiles predict individual future health risk, (in eng), Nat Commun, vol. 14, no. 1, p. 7817, Nov 28 2023, 10.1038/s41467-023-43575-7 - PMC - PubMed
    1. You J et al. Development of a novel dementia risk prediction model in the general population: A large, longitudinal, population-based machine-learning study, (in eng), EClinicalMedicine, vol. 53, p. 101665, Nov 2022, 10.1016/j.eclinm.2022.101665 - PMC - PubMed
    1. Goeminne LJE et al. Plasma protein-based organ-specific aging and mortality models unveil diseases as accelerated aging of organismal systems, (in eng), Cell Metab, vol. 37, no. 1, pp. 205–222.e6, Jan 7 2025, 10.1016/j.cmet.2024.10.005 - PubMed
    1. Nurmohamed NS et al. Proteomics and lipidomics in atherosclerotic cardiovascular disease risk prediction, (in eng). Eur Heart J, 44, 18, pp. 1594–607, May 7 2023, 10.1093/eurheartj/ehad161 - PMC - PubMed
    1. Jee J et al. Automated real-world data integration improves cancer outcome prediction, (in eng), Nature, vol. 636, no. 8043, pp. 728–736, Dec 2024, 10.1038/s41586-024-08167-5 - PMC - PubMed

LinkOut - more resources