Integrating a Lexicon Based Approach and K Nearest Neighbour for Malay Sentiment Analysis
- 1 FTSM University Kebangsaan Malaysia, Malaysia
Abstract
Sentiment analysis or opinion mining refers to the automatic extraction of sentiments from a natural language text. Although many studies focusing on sentiment analysis have been conducted, there remains a limited amount of studies that focus on sentiment analysis in the Malay language. In this article, a new approach for automatic sentiment analysis of Malay movie reviews is proposed, implemented and evaluated. In contrast to most studies that focus on supervised or unsupervised machine learning approaches, this research aims to propose a new model for Malay sentiment analysis based on a combination of both approaches. We used sentiment lexicons in the new model to generate a new set of features to train a k-Nearest Neighbour (k-NN) classifier. We further illustrated that our hybrid method outperforms the state of-the-art unigram baseline.
DOI: https://doi.org/10.3844/jcssp.2015.639.644
Copyright: © 2015 Ahmed Alsaffar and Nazlia Omar. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
- 3,802 Views
- 2,609 Downloads
- 12 Citations
Download
Keywords
- Malay Sentiment Analysis
- Feature Extraction
- Machine Learning
- Combinations Techniques