Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2014 May 20:5:417.
doi: 10.3389/fpsyg.2014.00417. eCollection 2014.

A Bayesian generative model for learning semantic hierarchies

Affiliations

A Bayesian generative model for learning semantic hierarchies

Roni Mittelman et al. Front Psychol. .

Abstract

Building fine-grained visual recognition systems that are capable of recognizing tens of thousands of categories, has received much attention in recent years. The well known semantic hierarchical structure of categories and concepts, has been shown to provide a key prior which allows for optimal predictions. The hierarchical organization of various domains and concepts has been subject to extensive research, and led to the development of the WordNet domains hierarchy (Fellbaum, 1998), which was also used to organize the images in the ImageNet (Deng et al., 2009) dataset, in which the category count approaches the human capacity. Still, for the human visual system, the form of the hierarchy must be discovered with minimal use of supervision or innate knowledge. In this work, we propose a new Bayesian generative model for learning such domain hierarchies, based on semantic input. Our model is motivated by the super-subordinate organization of domain labels and concepts that characterizes WordNet, and accounts for several important challenges: maintaining context information when progressing deeper into the hierarchy, learning a coherent semantic concept for each node, and modeling uncertainty in the perception process.

Keywords: Bayesian inference; Bayesian models of cognition; hierarchical clustering; non-parametric Bayes; semantics.

PubMed Disclaimer

Figures

Figure 1
Figure 1
An illustrative example of representing the (A) domain hierarchy using (B) binary vectors, and (C) using a relaxed probabilistic interpretation.
Figure 2
Figure 2
Two fragments of the hierarchy learned using our generative model, using the annotated attributes available for the training set of the PASCAL dataset. The left panel corresponds to the “living things” domain, whereas the right panel corresponds to the “transportation” domain. The sign (−) denotes an internal node.
Figure 3
Figure 3
Two fragments of the hierarchy learned using our generative model, using the attribute scores obtained for the testing set of the PASCAL dataset, when training the attribute detectors using the training set. The top panel corresponds to the “living things” domain, whereas the bottom panel corresponds to the “transportation” domain. The sign (−) denotes an internal node.
Figure 4
Figure 4
The taxonomy for the 20 categories in the PASCAL dataset.
Figure 5
Figure 5
The average edge error (Equation 4) vs. the agglomerative hierarchical clustering threshold parameter, for the hierarchical clustering obtained using the (A) training set's attribute annotations, and (B) attribute detectors applied to the testing set image instances. Smaller values indicate better performance. It can be seen that our attribute tree process (ATP) algorithm outperforms the factored Bernoulli likelihood model (FBLM), and unlike the agglomerative hierarchical clustering (AHC), it is not as sensitive to the choice of the hyper-parameters.

References

    1. Adams R., Ghahramani Z., Jordan M. I. (2010). Tree-structured stick breaking for hierarchical data, in Advances in Neural Information Processing Systems (Vancouver, BC: ), 19–27
    1. Bart E., Welling M., Perona P. (2011). Unsupervised organization of image collections: taxonomies and beyond. IEEE Trans. Pattern Anal. Mach. Intell. 33, 2302–2315 10.1109/TPAMI.2011.79 - DOI - PubMed
    1. Bentivogli L., Forner P., Magnini B., Pianta E. (2004). Revising wordnet domains hierarchy: semantics, coverage, and balancing, in Proceedings of COLING 2004 Workshop on Multilingual Linguistic Resources (Switzerland: ), 101–108
    1. Berg T. L., Berg A. C., Shih J. (2010). Automatic attribute discovery and characterization from noisy web data, in European Conference on Computer Vision, 663–676
    1. Binder A., Muller K. R., Kawanabe M. (2012). On taxonomies for multi-class image categorization. Int. J. Comput. Vis. 99, 281–301 10.1007/s11263-010-0417-8 - DOI

LinkOut - more resources