Article

Unsupervised Commonsense Knowledge Enrichment for Domain-Specific Sentiment Analysis

Details

Citation

Ofek N, Poria S, Rokach L, Cambria E, Hussain A & Shabtai A (2016) Unsupervised Commonsense Knowledge Enrichment for Domain-Specific Sentiment Analysis. Cognitive Computation, 8 (3), pp. 467-477. http://link.springer.com/article/10.1007/s12559-015-9375-3; https://doi.org/10.1007/s12559-015-9375-3

Abstract
Sentiment analysis in natural language text is a challenging task involving a deep understanding of both syntax and semantics. Leveraging the polarity of multiword expressions—or concepts—rather than single words can mitigate the difficulty of such a task as these expressions carry more contextual information than isolated words. Such contextual information is the key to understanding both the syntactic and semantic structure of natural language text and hence is useful in tasks such as sentiment analysis. In this work, we propose a new method to enrich SenticNet (a publicly available knowledge base for concept-level sentiment analysis) with domain-level concepts composed of aspects and sentiment word pairs, along with a measure of their polarity. We process a set of unlabeled texts and, by considering the statistical co-occurrence information, generate a direct acyclic graph (DAG) of concepts. The polarity score of known concepts is propagated and used to compute polarity scores of new concepts. By designing and implementing our exhaustive algorithm, we are able to use a seed set containing only two sentiment words (goodandbad). In our evaluation conducted on a dataset of hotel reviews, SenticNet was enriched by a factor of three (from 30,000 to nearly 90,000 concepts). The experiments demonstrate the merit of the concepts discovered by our method at improving sentence-level and aspect-level sentiment analysis tasks. Results of the two-factor ANOVA statistical test showed a confidence level of 95%, verifying that the improvements are statistically significant.

Keywords
Sentiment analysis; Sentiment lexicon; SenticNet; Sentic patterns

Journal
Cognitive Computation: Volume 8, Issue 3

StatusPublished
FundersThe Royal Society of Edinburgh
Publication date30/06/2016
Publication date online12/02/2016
Date accepted by journal07/12/2015
URLhttp://hdl.handle.net/1893/24030
PublisherSpringer
Publisher URLhttp://link.springer.com/article/10.1007/s12559-015-9375-3
ISSN1866-9956
eISSN1866-9964