CBD: Coffee Beans Dataset
- PMID: 40201542
- PMCID: PMC11978365
- DOI: 10.1016/j.dib.2025.111434
CBD: Coffee Beans Dataset
Abstract
The development of advanced coffee bean classification techniques depends on the availability of high quality datasets. Coffee bean quality is influenced by various factors, including bean size, shape, colour, and defects such as fungal damage, full black, full sour, broken beans, and insect damage. Constructing an accurate and reliable ground truth dataset for coffee bean classification is a challenging and labour intensive process. To address this need, we introduce the Coffee Beans Dataset (CBD) which contains 450 high-resolution images sampled across 9 distinct coffee bean grades A, AA, AAA, AB, C, PB-I, PB-II, BITS and BULK with 50 images per class. These samples were sourced from Wayanad, Kerala, reflecting the region's diverse coffee bean quality .This dataset is specifically designed to support machine learning and deep learning models for coffee bean classification and grading. By providing a comprehensive and diverse dataset, we aim to address key challenges in coffee quality assessment and improvement in classification accuracy. When tested using the EfficientNet-B0 model, the model achieved a high accuracy of 100%, demonstrating its potential to enhance automated coffee bean grading systems. The CBD serves as a valuable resource for researchers and industry professionals, promot-ing innovation in coffee quality monitoring and classification algorithms.
Keywords: Brightness; Coffee bean; Contrast; Grayscale.
© 2025 The Author(s).
Figures
References
-
- Jayakumari B.N.B., Mambilamthoda A.N.K., Stephen S.A., Venkitesan P., Raghavendra V. Coffee bean graded based on deep net models. Int. J. Electr. Comput. Eng. 2024;14(3):3084–3093. doi: 10.11591/ijece.v14i3. 2088-8708. - DOI
LinkOut - more resources
Full Text Sources
