Domain Adaptation for Sentiment Classification in Light of Multiple Sources

Fang Fang
Fang Fang
[email protected]
Department of Information Systems, National University of Singapore, Singapore 117418
Search for more papers by this author
,
Kaushik Dutta
Kaushik Dutta
[email protected]
Department of Information Systems, National University of Singapore, Singapore 117418
Search for more papers by this author
,
Anindya Datta
Anindya Datta
[email protected]
Department of Information Systems, National University of Singapore, Singapore 117418
Search for more papers by this author

Fang Fang

[email protected]

Department of Information Systems, National University of Singapore, Singapore 117418

Search for more papers by this author

Kaushik Dutta

[email protected]

Department of Information Systems, National University of Singapore, Singapore 117418

Search for more papers by this author

Anindya Datta

[email protected]

Department of Information Systems, National University of Singapore, Singapore 117418

Search for more papers by this author

Published Online:5 May 2014https://doi.org/10.1287/ijoc.2013.0585

Abstract

Sentiment classification is one of the most extensively studied problems in sentiment analysis, and supervised learning methods, which require labeled data for training, have been proven quite effective. However, supervised methods assume that the training domain and the testing domain share the same distribution; otherwise, accuracy drops dramatically. Although this does not pose problems when training data are readily available, in some circumstances, labeled data is quite expensive to acquire. For instance, if we want to detect sentiment from Tweets or Facebook comments, the only way to acquire is to manually label it, and this is prohibitively burdensome and time-consuming. In this paper, we propose a hybrid approach that integrates the sentiment information from source-domain labeled data and a set of preselected sentiment words to solve this problem. The experimental results suggest that our method statistically outperforms the state of the art and even, in some cases, surpasses the in-domain gold standard.

cover image INFORMS Journal on Computing

Volume 26, Issue 3

Summer 2014

Pages 415-643

Article Information

Metrics

Information

Received:October 01, 2012
Accepted:October 01, 2013
Published Online:May 05, 2014

Cite as

Fang Fang, Kaushik Dutta, Anindya Datta (2014) Domain Adaptation for Sentiment Classification in Light of Multiple Sources. INFORMS Journal on Computing 26(3):586-598.

https://doi.org/10.1287/ijoc.2013.0585

Keywords

PDF download

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Available Issues

Domain Adaptation for Sentiment Classification in Light of Multiple Sources

Abstract

Volume 26, Issue 3

Article Information

Metrics

Information

Cite as

Keywords

Sign Up for INFORMS Publications Updates and News