UCL Discovery Stage
UCL home » Library Services » Electronic resources » UCL Discovery Stage

An empirical comparison of machine learning methods for text-based sentiment analysis of online consumer reviews

Alantari, HJ; Currim, IS; Deng, Y; Singh, S; (2022) An empirical comparison of machine learning methods for text-based sentiment analysis of online consumer reviews. International Journal of Research in Marketing , 39 (1) pp. 1-19. 10.1016/j.ijresmar.2021.10.011. Green open access

[thumbnail of Deng_1-s2.0-S0167811621000926-main.pdf]
Preview
Text
Deng_1-s2.0-S0167811621000926-main.pdf - Accepted Version

Download (948kB) | Preview

Abstract

The amount of digital text-based consumer review data has increased dramatically and there exist many machine learning approaches for automated text-based sentiment analysis. Marketing researchers have employed various methods for analyzing text reviews but lack a comprehensive comparison of their performance to guide method selection in future applications. We focus on the fundamental relationship between a consumer’s overall empirical evaluation, and the text-based explanation of their evaluation. We study the empirical tradeoff between predictive and diagnostic abilities, in applying various methods to estimate this fundamental relationship. We incorporate methods previously employed in the marketing literature, and methods that are so far less common in the marketing literature. For generalizability, we analyze 25,119 products in nine product categories, and 260,489 reviews across five review platforms. We find that neural network-based machine learning methods, in particular pre-trained versions, offer the most accurate predictions, while topic models such as Latent Dirichlet Allocation offer deeper diagnostics. However, neural network models are not suited for diagnostic purposes and topic models are ill equipped for making predictions. Consequently, future selection of methods to process text reviews is likely to be based on analysts’ goals of prediction versus diagnostics.

Type: Article
Title: An empirical comparison of machine learning methods for text-based sentiment analysis of online consumer reviews
Open access status: An open access version is available from UCL Discovery
DOI: 10.1016/j.ijresmar.2021.10.011
Publisher version: https://doi.org/10.1016/j.ijresmar.2021.10.011
Language: English
Additional information: This version is the author accepted manuscript. For information on re-use, please refer to the publisher’s terms and conditions.
Keywords: Automated Text Analysis, Sentiment Analysis, Online Reviews, User Generated Content, Machine Learning, Natural Language Processing
UCL classification: UCL
UCL > Provost and Vice Provost Offices > UCL BEAMS
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science
UCL > Provost and Vice Provost Offices > UCL BEAMS > Faculty of Engineering Science > UCL School of Management
URI: https://discovery-pp.ucl.ac.uk/id/eprint/10138086
Downloads since deposit
23,716Downloads
Download activity - last month
Download activity - last 12 months
Downloads by country - last 12 months

Archive Staff Only

View Item View Item