Skip to main page content
U.S. flag

An official website of the United States government

Dot gov

The .gov means it’s official.
Federal government websites often end in .gov or .mil. Before sharing sensitive information, make sure you’re on a federal government site.

Https

The site is secure.
The https:// ensures that you are connecting to the official website and that any information you provide is encrypted and transmitted securely.

Access keys NCBI Homepage MyNCBI Homepage Main Content Main Navigation
. 2016 Nov:2016:795-804.
doi: 10.18653/v1/d16-1076.

Rationale-Augmented Convolutional Neural Networks for Text Classification

Affiliations

Rationale-Augmented Convolutional Neural Networks for Text Classification

Ye Zhang et al. Proc Conf Empir Methods Nat Lang Process. 2016 Nov.

Abstract

We present a new Convolutional Neural Network (CNN) model for text classification that jointly exploits labels on documents and their constituent sentences. Specifically, we consider scenarios in which annotators explicitly mark sentences (or snippets) that support their overall document categorization, i.e., they provide rationales. Our model exploits such supervision via a hierarchical approach in which each document is represented by a linear combination of the vector representations of its component sentences. We propose a sentence-level convolutional model that estimates the probability that a given sentence is a rationale, and we then scale the contribution of each sentence to the aggregate document representation in proportion to these estimates. Experiments on five classification datasets that have document labels and associated rationales demonstrate that our approach consistently outperforms strong baselines. Moreover, our model naturally provides explanations for its predictions.

PubMed Disclaimer

Figures

Figure 1
Figure 1
A toy example of a CNN for sentence classification. Here there are four filters, two with heights 2 and two with heights 3, resulting in feature maps with lengths 6 and 5 respectively.
Figure 2
Figure 2
A schematic of our proposed Rationale-Augmented Convolution Neural Network (RA-CNN). The sentences comprising a text are passed through a sentence model that outputs probabilities encoding the likelihood that sentences are neutral or a (positive or negative) rationale. Sentences likely to be rationales are given higher weights in the global document vector, which is the input to the document model.

References

    1. Bahdanau Dzmitry, Cho Kyunghyun, Bengio Yoshua. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473. 2014
    1. Collobert Ronan, Weston Jason. A unified architecture for natural language processing: Deep neural networks with multitask learning. Proceedings of the 25th international conference on Machine learning; ACM; 2008. pp. 160–167.
    1. Druck Gregory, Mann Gideon, McCallum Andrew. Learning from labeled features using generalized expectation criteria. Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval; ACM; 2008. pp. 595–602.
    1. Goldberg Yoav. A primer on neural network models for natural language processing. arXiv preprint arXiv:1510.00726. 2015
    1. Higgins Julian PT, Altman Douglas G, Gøtzsche Peter C, Jüni Peter, Moher David, Oxman Andrew D, Savović Jelena, Schulz Kenneth F, Weeks Laura, Sterne Jonathan AC. The cochrane collaborations tool for assessing risk of bias in randomised trials. Bmj. 2011;343:d5928. - PMC - PubMed

LinkOut - more resources