Word Embedding Feature for Improvement Machine Learning Performance in Sentiment Analysis Disney Plus Hotstar Comments

Authors

  • Jasmir Jasmir Universitas Dinamika Bangsa
  • Nurhadi Nurhadi
  • Eni Rohaini
  • M Riza Pahlevi B
  • Daniel Sintong Pardamean Simanjuntak

DOI:

https://doi.org/10.26555/jiteki.v10i2.28799

Keywords:

Text Classification, Machine Learning Evaluation, Word Embedding, Sentiment Analysis, Social Media Analysis

Abstract

In this research we apply several machine learning methods and word embedding features to process social media data, specifically comments on the Disney Plus Hotstar application. The word embedding features used include Word2Vec, GloVe, and FastText. Our aim is to evaluate the impact of these features on the classification performance of machine learning methods such as Naive Bayes (NB), K-Nearest Neighbor (KNN), and Random Forest (RF). NB is very simple and efficient and very sensitive to feature selection. Meanwhile, KNN is known for its weaknesses such as biased k values, overly complex computations, memory limitations, and ignoring irrelevant attributes. Then RF has a weakness, namely that the evaluation value can change significantly with just a slight change in the data. Feature selection in text classification is crucial for enhancing scalability, efficiency, and accuracy. Our testing results indicate that KNN achieved the highest accuracy both before and after feature selection. The FastText feature led to the highest performance for KNN, yielding balanced accuracy, precision, recall, and F1-score values.

Downloads

Published

2024-06-28

How to Cite

[1]
J. Jasmir, N. Nurhadi, E. Rohaini, M. R. Pahlevi B, and D. S. Pardamean Simanjuntak, “Word Embedding Feature for Improvement Machine Learning Performance in Sentiment Analysis Disney Plus Hotstar Comments”, J. Ilm. Tek. Elektro Komput. Dan Inform, vol. 10, no. 2, pp. 290–301, Jun. 2024.

Issue

Section

Articles

Similar Articles

1 2 > >> 

You may also start an advanced similarity search for this article.