Rationale-Augmented Convolutional Neural Networks for Text Classification

Ye Zhang¹, Iain Marshall², Byron C Wallace³

Affiliations

¹ Department of Computer Science, University of Texas at Austin.
² Department of Primary Care and Public Health Sciences, Kings College London.
³ College of Computer and Information Science, Northeastern University.

PMID: 28191551
PMCID: PMC5300751
DOI: 10.18653/v1/d16-1076

Rationale-Augmented Convolutional Neural Networks for Text Classification

Ye Zhang et al. Proc Conf Empir Methods Nat Lang Process. 2016 Nov.

. 2016 Nov:2016:795-804.

doi: 10.18653/v1/d16-1076.

Authors

Ye Zhang¹, Iain Marshall², Byron C Wallace³

Affiliations

¹ Department of Computer Science, University of Texas at Austin.
² Department of Primary Care and Public Health Sciences, Kings College London.
³ College of Computer and Information Science, Northeastern University.

PMID: 28191551
PMCID: PMC5300751
DOI: 10.18653/v1/d16-1076

Abstract

We present a new Convolutional Neural Network (CNN) model for text classification that jointly exploits labels on documents and their constituent sentences. Specifically, we consider scenarios in which annotators explicitly mark sentences (or snippets) that support their overall document categorization, i.e., they provide rationales. Our model exploits such supervision via a hierarchical approach in which each document is represented by a linear combination of the vector representations of its component sentences. We propose a sentence-level convolutional model that estimates the probability that a given sentence is a rationale, and we then scale the contribution of each sentence to the aggregate document representation in proportion to these estimates. Experiments on five classification datasets that have document labels and associated rationales demonstrate that our approach consistently outperforms strong baselines. Moreover, our model naturally provides explanations for its predictions.

PubMed Disclaimer

Figures

**Figure 1**
A toy example of a CNN for sentence classification. Here there are four filters, two with heights 2 and two with heights 3, resulting in feature maps with lengths 6 and 5 respectively.

**Figure 2**
A schematic of our proposed Rationale-Augmented Convolution Neural Network (RA-CNN). The sentences comprising a text are passed through a sentence model that outputs probabilities encoding the likelihood that sentences are neutral or a (positive or negative) rationale. Sentences likely to be rationales are given higher weights in the global document vector, which is the input to the document model.

See this image and copyright information in PMC

References

1. Bahdanau Dzmitry, Cho Kyunghyun, Bengio Yoshua. Neural machine translation by jointly learning to align and translate. arXiv preprint arXiv:1409.0473. 2014
1. Collobert Ronan, Weston Jason. A unified architecture for natural language processing: Deep neural networks with multitask learning. Proceedings of the 25th international conference on Machine learning; ACM; 2008. pp. 160–167.
1. Druck Gregory, Mann Gideon, McCallum Andrew. Learning from labeled features using generalized expectation criteria. Proceedings of the 31st annual international ACM SIGIR conference on Research and development in information retrieval; ACM; 2008. pp. 595–602.
1. Goldberg Yoav. A primer on neural network models for natural language processing. arXiv preprint arXiv:1510.00726. 2015
1. Higgins Julian PT, Altman Douglas G, Gøtzsche Peter C, Jüni Peter, Moher David, Oxman Andrew D, Savović Jelena, Schulz Kenneth F, Weeks Laura, Sterne Jonathan AC. The cochrane collaborations tool for assessing risk of bias in randomised trials. Bmj. 2011;343:d5928. - PMC - PubMed

Grants and funding

LinkOut - more resources

Full Text Sources
- Europe PubMed Central
- PubMed Central
Other Literature Sources
- The Lens - Patent Citations Database

Save citation to file

Email citation

Add to Collections

Add to My Bibliography

Your saved search

Create a file for external citation management software

Your RSS Feed

Rationale-Augmented Convolutional Neural Networks for Text Classification

Affiliations

Rationale-Augmented Convolutional Neural Networks for Text Classification

Authors

Affiliations

Abstract

Figures

References

Grants and funding

LinkOut - more resources

Full Text Sources

Other Literature Sources