. 2021 Jan 12;16(1):e0245230.

doi: 10.1371/journal.pone.0245230. eCollection 2021.

Multi-view classification with convolutional neural networks

Marco Seeland¹, Patrick Mäder¹

Affiliations

PMID: 33434208
PMCID: PMC7802953
DOI: 10.1371/journal.pone.0245230

Multi-view classification with convolutional neural networks

Marco Seeland et al. PLoS One. 2021.

. 2021 Jan 12;16(1):e0245230.

doi: 10.1371/journal.pone.0245230. eCollection 2021.

Authors

Marco Seeland¹, Patrick Mäder¹

Affiliation

¹ Institute for Computer and Systems Engineering, Technische Universität Ilmenau, Ilmenau, Germany.

PMID: 33434208
PMCID: PMC7802953
DOI: 10.1371/journal.pone.0245230

Erratum in

Correction: Multi-view classification with convolutional neural networks.
PLOS ONE Staff. PLOS ONE Staff. PLoS One. 2021 Apr 8;16(4):e0250190. doi: 10.1371/journal.pone.0250190. eCollection 2021. PLoS One. 2021. PMID: 33831129 Free PMC article.

Abstract

Humans' decision making process often relies on utilizing visual information from different views or perspectives. However, in machine-learning-based image classification we typically infer an object's class from just a single image showing an object. Especially for challenging classification problems, the visual information conveyed by a single image may be insufficient for an accurate decision. We propose a classification scheme that relies on fusing visual information captured through images depicting the same object from multiple perspectives. Convolutional neural networks are used to extract and encode visual features from the multiple views and we propose strategies for fusing these information. More specifically, we investigate the following three strategies: (1) fusing convolutional feature maps at differing network depths; (2) fusion of bottleneck latent representations prior to classification; and (3) score fusion. We systematically evaluate these strategies on three datasets from different domains. Our findings emphasize the benefit of integrating information fusion into the network rather than performing it by post-processing of classification scores. Furthermore, we demonstrate through a case study that already trained networks can be easily extended by the best fusion strategy, outperforming other approaches by large margin.

PubMed Disclaimer

Conflict of interest statement

The authors have declared that no competing interests exist.

Figures

**Fig 1. A collection of images is composed of multiple views depicting the same object instance from different perspectives.**

**Fig 2. Considered multi-view fusion strategies: (a) general architecture of a deep multi-view CNN; (b) investigated fusion strategies; and (c) fusion strategies mapped onto the ResNet-50 architecture.**
Vertical lines mark the insertion of a view-fusion layer.

**Fig 3. Example collections of the three multi-view datasets: (a) CompCars, (b) PlantCLEF, and (c) AntWeb.**
Photographs of the ant specimen CASENT0281563 by Estella Ortega retrieved from www.AntWeb.org [32].

**Fig 4. Distance matrices for the three datasets.**
Matrix diagonal elements refer to intra-class distance, off-diagonal elements to inter-class distances. Elements are sorted from well-separable classes to less-separable classes as computed from the class-wise silhouette scores.

**Fig 5. Distribution of class-averaged top-1 classification accuracy for the single-view baseline and the multi-view classification strategies.**
White dots indicate median accuracy whereas black bars display interquartile ranges. Thin black lines indicate lower and upper adjacent values at 1.5× the interquartile range.

See this image and copyright information in PMC

References

1. LeCun Y, Bengio Y, Hinton G. Deep Learning. Nature. 2015;521(7553):436–444. 10.1038/nature14539 - DOI - PubMed
1. Russakovsky O, Deng J, Su H, Krause J, Satheesh S, Ma S, et al.. ImageNet Large Scale Visual Recognition Challenge. International Journal of Computer Vision. 2015;115(3):211–252. 10.1007/s11263-015-0816-y - DOI
1. Seeland M, Rzanny M, Boho D, Wäldchen J, Mäder P. Image-based classification of plant genus and family for trained and untrained plant species. BMC Bioinformatics. 2019;20(1):4. 10.1186/s12859-018-2474-x - DOI - PMC - PubMed
1. Wäldchen J, Rzanny M, Seeland M, Mäder P. Automated plant species identification—Trends and future directions. PLOS Computational Biology. 2018;14(4):1–19. 10.1371/journal.pcbi.1005993 - DOI - PMC - PubMed
1. Marques ACR, Raimundo MM, Cavalheiro EMB, Salles LFP, Lyra C, Von Zuben FJ. Ant genera identification using an ensemble of convolutional neural networks. PLOS ONE. 2018;13(1):1–13. 10.1371/journal.pone.0192011 - DOI - PMC - PubMed

Publication types

Actions

MeSH terms

Actions
Actions
Actions

LinkOut - more resources

Full Text Sources
Other Literature Sources
- The Lens - Patent Citations Database
- scite Smart Citations

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Multi-view classification with convolutional neural networks

Affiliation

Multi-view classification with convolutional neural networks

Authors

Affiliation

Erratum in

Abstract

Conflict of interest statement

Figures

References

Publication types

MeSH terms

LinkOut - more resources

Full Text Sources

Other Literature Sources