Dataset for image classification with knowledge

Franck Anaël Mbiaya^{1

2}, Christel Vrain¹, Frédéric Ros², Thi-Bich-Hanh Dao¹, Yves Lucas²

Affiliations

¹ University Orleans, INSA Centre Val de Loire, LIFO, EA 4022, France.
² University Orleans, INSA Centre Val de Loire, PRISME, EA 4229, France.

PMID: 39328969
PMCID: PMC11424803
DOI: 10.1016/j.dib.2024.110893

Dataset for image classification with knowledge

Franck Anaël Mbiaya et al. Data Brief. 2024.

. 2024 Sep 5:57:110893.

doi: 10.1016/j.dib.2024.110893. eCollection 2024 Dec.

Authors

Franck Anaël Mbiaya^{1

2}, Christel Vrain¹, Frédéric Ros², Thi-Bich-Hanh Dao¹, Yves Lucas²

Affiliations

¹ University Orleans, INSA Centre Val de Loire, LIFO, EA 4022, France.
² University Orleans, INSA Centre Val de Loire, PRISME, EA 4229, France.

PMID: 39328969
PMCID: PMC11424803
DOI: 10.1016/j.dib.2024.110893

Abstract

Deep learning applied to raw data has demonstrated outstanding image classification performance, mainly when abundant data is available. However, performance significantly degrades when a substantial volume of data is unavailable. Furthermore, deep architectures struggle to achieve satisfactory performance levels when distinguishing between distinct classes, such as fine-grained image classification, is challenging. Utilizing a priori knowledge alongside raw data can enhance image classification in demanding scenarios. Nevertheless, only a limited number of image classification datasets given with a priori knowledge are currently available, thereby restricting research efforts in this field. This paper introduces innovative datasets for the classification problem that integrate a priori knowledge. These datasets are built from existing data typically employed for multilabel multiclass classification or object detection. Frequent closed itemset mining is used to create classes and their corresponding attributes (e.g. the presence of an object in an image) and then to extract a priori knowledge expressed by rules on these attributes. The algorithm for generating rules is described.

Keywords: Computer vision; Deep learning; Image classification; Knowledge; Rules.

PubMed Disclaimer

Figures

Fig 1 — **Fig. 1**
Three classes randomly selected from the COCO@11 dataset.

Fig 2 — **Fig. 2**
Illustration with three classes randomly selected from the COCO@24 dataset.

Fig 3 — **Fig. 3**
Illustration with three classes randomly selected from the COCO@48 dataset.

Fig 4 — **Fig. 4**
Steps to create each dataset.

Fig 5 — **Fig. 5**
Enumeration tree for generating rules for the class CD.

See this image and copyright information in PMC

References

1. Lin Tsung Y., et al. Computer Vision – ECCV 2014. Springer International Publishing; Cham: 2014. Microsoft COCO: common objects in context; pp. 740–755. T. and S. B. and T. T. Fleet David and Pajdla, Ed.
1. Durand Nicolas B., Crémilleux . Research and Development in Intelligent Systems XIX. Springer London; London: 2003. ECCLAT: a new approach of clusters discovery in categorical data; pp. 177–190. A. and C. F. Bramer Max and Preece, Ed.
1. von Rueden L., et al. Proceedings of the IEEE Transactions on Knowledge & Data Engineering. Vol. 35. 2023. Informed machine learning – a taxonomy and survey of integrating prior knowledge into learning systems; pp. 614–633. - DOI
1. E. Poeta, G. Ciravegna, E. Pastor, T. Cerquitelli, and E. Baralis, “Concept-based explainable artificial intelligence: a survey,” 2023. [Online]. Available: https://arxiv.org/abs/2312.12936.
1. K. Simonyan and A. Zisserman, “Very deep convolutional networks for large-scale image recognition,” 2015. [Online]. Available: https://arxiv.org/abs/1409.1556.

LinkOut - more resources

Full Text Sources
- Elsevier Science
- PubMed Central

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Dataset for image classification with knowledge

Affiliations

Dataset for image classification with knowledge

Authors

Affiliations

Abstract

Figures

References

LinkOut - more resources

Full Text Sources