EMDS-5: Environmental Microorganism image dataset Fifth Version for multiple image analysis tasks
- PMID: 33979356
- PMCID: PMC8116046
- DOI: 10.1371/journal.pone.0250631
EMDS-5: Environmental Microorganism image dataset Fifth Version for multiple image analysis tasks
Abstract
Environmental Microorganism Data Set Fifth Version (EMDS-5) is a microscopic image dataset including original Environmental Microorganism (EM) images and two sets of Ground Truth (GT) images. The GT image sets include a single-object GT image set and a multi-object GT image set. EMDS-5 has 21 types of EMs, each of which contains 20 original EM images, 20 single-object GT images and 20 multi-object GT images. EMDS-5 can realize to evaluate image preprocessing, image segmentation, feature extraction, image classification and image retrieval functions. In order to prove the effectiveness of EMDS-5, for each function, we select the most representative algorithms and price indicators for testing and evaluation. The image preprocessing functions contain two parts: image denoising and image edge detection. Image denoising uses nine kinds of filters to denoise 13 kinds of noises, respectively. In the aspect of edge detection, six edge detection operators are used to detect the edges of the images, and two evaluation indicators, peak-signal to noise ratio and mean structural similarity, are used for evaluation. Image segmentation includes single-object image segmentation and multi-object image segmentation. Six methods are used for single-object image segmentation, while k-means and U-net are used for multi-object segmentation. We extract nine features from the images in EMDS-5 and use the Support Vector Machine (SVM) classifier for testing. In terms of image classification, we select the VGG16 feature to test SVM, k-Nearest Neighbors, Random Forests. We test two types of retrieval approaches: texture feature retrieval and deep learning feature retrieval. We select the last layer of features of VGG16 network and ResNet50 network as feature vectors. We use mean average precision as the evaluation index for retrieval. EMDS-5 is available at the URL:https://github.com/NEUZihan/EMDS-5.git.
Conflict of interest statement
The authors have declared that no competing interests exist.
Figures












References
-
- Chen Li, Kimiaki Shirahama, Marcin Grzegorzek. Environmental microbiology aided by content-based image analysis. Pattern Analysis and Applications. 2016;19(2):531–547. 10.1007/s10044-015-0498-7 - DOI
-
- Gonzalez Rafael C, Woods Richard E. Digital Image Processing (3rd Edition). Prentice-Hall. 2007;336.
-
- Pal Nikhil R, Pal Sankar K. A review on image segmentation techniques. Pattern recognition. 1993;26(9):1277–1294. 10.1016/0031-3203(93)90135-J - DOI
-
- Isabelle Guyon, Elisseeff André, Norbert Jankowski, Krzysztof Grabczewski, Dreyfus Gérard, Wlodzislaw Duch, et al.. Feature Extraction. Of Studies in Fuzziness & Soft Computing. 2006;31(7):1737–1744.
-
- Sergey Kosov, Kimiaki Shirahama, Chen Li, Marcin Grzegorzek. Environmental Microorganism Classification Using Conditional Random Fields and Deep Convolutional Neural Networks. Pattern recognition. 2017;p.S0031320317305174.
Publication types
MeSH terms
LinkOut - more resources
Full Text Sources
Other Literature Sources
Miscellaneous