Integrating Topic, Sentiment, and Syntax for Modeling Online Reviews: A Topic Model ApproachSource: Journal of Computing and Information Science in Engineering:;2019:;volume( 019 ):;issue: 001::page 11001DOI: 10.1115/1.4041475Publisher: The American Society of Mechanical Engineers (ASME)
Abstract: Analyzing product online reviews has drawn much interest in the academic field. In this research, a new probabilistic topic model, called tag sentiment aspect models (TSA), is proposed on the basis of Latent Dirichlet allocation (LDA), which aims to reveal latent aspects and corresponding sentiment in a review simultaneously. Unlike other topic models which consider words in online reviews only, syntax tags are taken as visual information and, in this research, as a kind of widely used syntax information, part-of-speech (POS) tags are first reckoned. Specifically, POS tags are integrated into three versions of implementation in consideration of the fact that words with different POS tags might be utilized to express consumers' opinions. Also, the proposed TSA is one unsupervised approach and only a small number of positive and negative words are required to confine different priors for training. Finally, two big datasets regarding digital SLR and laptop are utilized to evaluate the performance of the proposed model in terms of sentiment classification and aspect extraction. Comparative experiments show that the new model can not only achieve promising results on sentiment classification but also leverage the performance on aspect extraction.
|
Show full item record
| contributor author | Tang, Min | |
| contributor author | Jin, Jian | |
| contributor author | Liu, Ying | |
| contributor author | Li, Chunping | |
| contributor author | Zhang, Weiwen | |
| date accessioned | 2019-03-17T10:32:17Z | |
| date available | 2019-03-17T10:32:17Z | |
| date copyright | 10/18/2018 12:00:00 AM | |
| date issued | 2019 | |
| identifier issn | 1530-9827 | |
| identifier other | jcise_019_01_011001.pdf | |
| identifier uri | http://yetl.yabesh.ir/yetl1/handle/yetl/4256192 | |
| description abstract | Analyzing product online reviews has drawn much interest in the academic field. In this research, a new probabilistic topic model, called tag sentiment aspect models (TSA), is proposed on the basis of Latent Dirichlet allocation (LDA), which aims to reveal latent aspects and corresponding sentiment in a review simultaneously. Unlike other topic models which consider words in online reviews only, syntax tags are taken as visual information and, in this research, as a kind of widely used syntax information, part-of-speech (POS) tags are first reckoned. Specifically, POS tags are integrated into three versions of implementation in consideration of the fact that words with different POS tags might be utilized to express consumers' opinions. Also, the proposed TSA is one unsupervised approach and only a small number of positive and negative words are required to confine different priors for training. Finally, two big datasets regarding digital SLR and laptop are utilized to evaluate the performance of the proposed model in terms of sentiment classification and aspect extraction. Comparative experiments show that the new model can not only achieve promising results on sentiment classification but also leverage the performance on aspect extraction. | |
| publisher | The American Society of Mechanical Engineers (ASME) | |
| title | Integrating Topic, Sentiment, and Syntax for Modeling Online Reviews: A Topic Model Approach | |
| type | Journal Paper | |
| journal volume | 19 | |
| journal issue | 1 | |
| journal title | Journal of Computing and Information Science in Engineering | |
| identifier doi | 10.1115/1.4041475 | |
| journal fristpage | 11001 | |
| journal lastpage | 011001-12 | |
| tree | Journal of Computing and Information Science in Engineering:;2019:;volume( 019 ):;issue: 001 | |
| contenttype | Fulltext |