Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2022 Jul 14;8(8):e09935.
doi: 10.1016/j.heliyon.2022.e09935. eCollection 2022 Aug.

A study on quality control using delta data with machine learning technique

Affiliations

A study on quality control using delta data with machine learning technique

Yufang Liang et al. Heliyon. .

Abstract

Background: In the big data era, patient-based real-time quality control (PBRTQC), as an emerging quality control (QC) method, is expanding within the clinical laboratory industry. However, the main issue of current PBRTQC methodology is data stability. Our study is aimed to explore a novel protocol for data stability by combining delta data with machine learning (ML) technique to improve the capacity of QC event detection.

Methods: A data set of 423,290 laboratory results from Beijing Chao-yang Hospital 2019 patient results were used as a training set (n = 380960, 90%) and internal validation set (n = 42330, 10%). A further 22,460 results from Beijing Long-fu Hospital 2019 patient results were used as a test set. Three-type data (1) Single-type data processed by truncation limits; (2) delta-type data processed by truncation limits and (3)delta-type data processed by Isolated Forest (IF) algorithm were evaluated with accuracy, sensitivity, NPed, etc., and compared with previously published statistical methods.

Results: The optimal model was based on Random Forest (RF) algorithm by using delta-type data processed by IF algorithm. The model had a better accuracy (0.99), sensitivity (0.99) specificity (0.99) and AUC (0.99) with the dependent test set, surpassing the critical bias of PBRTQC by over 50%. For the LYMPH#, HGB, and PLT, the cumulative MNPed of MLQC were reduced by 95.43%, 97.39%, and 97.97% respectively when compared to the best of the PBRTQC.

Conclusion: Final results indicate that by integrating an innovative ML algorithm with the overall data processing protocol the detection of QC events is improved.

Keywords: Data processing; Delta data; Machine learning; Patient-based real-time quality control; Random forest.

PubMed Disclaimer

Conflict of interest statement

The authors declare no conflict of interest.

Figures

Figure 1
Figure 1
Integrated experimental process diagram.
Figure 2
Figure 2
Data separability between critical biased and unbiased data for three-type data by PCA. The 3 rows from top to bottom represented LYMPH #, HGB and PLT of Cell Blood Count, the 3 columns from left to right represented single-type data, delta-type delta, and delta-type data processed by IF. Every point in each diagram represented a ML sample with the same block size consisting of 10 patient raw data.
Figure 3
Figure 3
The visualization of data distribution feature for the training and the test sets and the performance parameters of five experiments at critical bias for LYMPH #, HGB and PLT. A-C take examples of LYMPH #, HGB and PLT ordered from left to right, represented principal component analysis (PCA) plots of the training set and internal validation set. D represented the TPR, TNR, FPR, FNR and ACC of the five algorithms (TPR - true positive rate; TNR - true negative rate; FPR - false positive rate; FNR - false negative rate; ACC - accuracy). E represented ANPed, MNPed, 95NPed of them (ANped - average of Nped; MNped - median of Nped; 95Nped - 95 quantile of Nped).
Figure 4
Figure 4
The curves for the comparison performance of 5 experiments. A,B,C corresponded to LYMPH #, HGB, and PLT, respectively. Colored lines represented MNPed for each bias, colored area represented the associated 95NPed. Parameters were displayed in the top corner (BS: block size; T: truncation limit, BC: with Box–Cox transformation).

Similar articles

Cited by

References

    1. Miller W.G., Erek A., Cunningham T.D., Oladipo O., Scott M.G., Johnson R.E. Commutability limitations influence quality control results with different reagent lots. Clin. Chem. 2011;57(1):76–83. - PubMed
    1. Thaler M.A., Iakoubov R., Bietenbeck A., Luppa P.B. Clinically relevant lot-to-lot reagent difference in a commercial immunoturbidimetric assay for glycated hemoglobin A1c. Clin. Biochem. 2015;48(16-17):1167–1170. - PubMed
    1. Duan X., Wang B., Zhu J., Zhang C., Jiang W., Zhou J., Shao W., Zhao Y., Yu Q., Lei L., Yiu K.L., Chin K.T., Pan B., Guo W. Regression-adjusted real-time quality control. Clin. Chem. 2021;67(10):1342–1350. - PubMed
    1. Ng D., Polito F.A., Cervinski M.A. Optimization of a moving averages program using a simulated annealing algorithm: the goal is to monitor the process not the patients. Clin. Chem. 2016;62(10):1361–1371. - PubMed
    1. Yamashita T., Ichihara K., Miyamoto A. A novel weighted cumulative delta-check method for highly sensitive detection of specimen mix-up in the clinical laboratory. Clin. Chem. Lab. Med. 2013;51(4):781–789. - PubMed

LinkOut - more resources