Attacking the out-of-domain problem of a parasite egg detection in-the-wild
- PMID: 39670064
- PMCID: PMC11636797
- DOI: 10.1016/j.heliyon.2024.e26153
Attacking the out-of-domain problem of a parasite egg detection in-the-wild
Abstract
The out-of-domain (OO-Do) problem has hindered machine learning models especially when the models are deployed in the real world. The OO-Do problem occurs during machine learning testing phase when a learned machine learning model must predict on data belonging to a class that is different from that of the data used for training. We tackle the OO-Do problem in an object-detection task: a parasite-egg detection model used in real-world situations. First, we introduce the In-the-wild parasite-egg dataset to evaluate the OO-Do-aware model. The dataset contains 1,552 images, 1,049 parasite-egg, and 503 OO-Do images, uploaded through chatbot. It was constructed by conducting a chatbot test session with 222 medical technology students. Thereafter, we propose a data-driven framework to construct a parasite-egg recognition model for in-the-wild applications to address the OO-Do issue. In the framework, we use publicly available datasets to train the parasite-egg recognition models about in-domain and out-of-domain concepts. Finally, we compare the integration strategies for our proposed two-step parasite-egg detection approach on two test sets: standard and In-the-wild datasets. We also investigate different thresholding strategies for model robustness to OO-Do data. Experiments on two test datasets showed that concatenating an OO-Do-aware classification model after an object-detection model achieved outstanding performance in detecting parasite eggs. The framework gained 7.37% and 4.09% F1-score improvement from the baselines on Chula +Wild dataset and the In-the-wild parasite-egg dataset, respectively.
Keywords: Chatbot; Computer vision in-the-wild; Data driven framework; Out-of-domain; Parasite egg detection.
© 2024 The Author(s).
Conflict of interest statement
The authors declare the following financial interests/personal relationships which may be considered as potential competing interests: Thanapong Intharah reports financial support was provided by The National Science, Research and Innovation Fund (NSRF), Thailand.
Figures
References
-
- B. Zhao, S. Yu, W. Ma, M. Yu, S. Mei, A. Wang, J. He, A. Yuille, A. Kortylewski, Ood-cv: A benchmark for robustness to out-of-distribution shifts of individual nuisances in natural images, in: Proceedings of the European Conference on Computer Vision (ECCV).
-
- Olber B., Radlak K., Popowicz A., Szczepankiewicz M., Chachula K. 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) 2023. Detection of out-of-distribution samples using binary neuron activation patterns.
-
- Lai C.-H., Yu S.-S., Tseng H.-Y., Tsai M.-H. A protozoan parasite extraction scheme for digital microscopic images. Comput. Med. Imaging Graph. 2010;34(2):122–130. - PubMed
-
- Osaku D., Cuba C.F., Suzuki C.T., Gomes J.F., Falcão A.X. Automated diagnosis of intestinal parasites: a new hybrid approach and its benefits. Comput. Biol. Med. 2020;123 - PubMed
LinkOut - more resources
Full Text Sources
