Implementasi Text Mining Pada Analisis Sentimen Pengguna Twitter Terhadap Marketplace di Indonesia Menggunakan Algoritma Support Vector Machine

Dyah Auliya Agustina, Sri Subanti, Etik Zukhronah

Abstract

In this digital era, technology development has changed the behavior of society from buy offline to online. One of this behavioral changes is marked by the growth of global marketplace including in Indonesia. The big marketplaces in Indonesia that have received a lot of public response on social media are Tokopedia, Shopee, and Bukalapak. This research determines the public sentiment toward both the service and issues surrounding these three marketplaces on media social especially Twitter. Public opinion is classified into a positive or negative sentiment. The data used in this study is obtained from Twitter API (Application Programming Interface) using keyword Shopee, Tokopedia, and Bukalapak. Preprocessing texts are divided into five steps: cleansing, case folding, stemming, stopwords, and tokenizing. Training and testing data are divided using k-fold cross validation method, while visualization the characteristic of text is using word cloud. Research shows that public are posting tweet more positive sentiment than negative one. The perfomance of classification shows that the best G-mean and AUC value for Bukalapak testing data are 0.85 and 0.86 in the first fold. While the best G-mean and AUC value for Shopee testing data are 0.76 and 0.77 in the seventh fold and the best G-mean and AUC value for Tokopedia testing data are 0.82 and 0.83 in the sixth fold.

Keywords : sentiment analysis, marketplace, support vector machine, twitter

Full Text:

PDF

References

Statista, databooks.katadata.co.id, diakses pada 20 Januari 2020.

Imelda dan Affandes, M. Penerapan Metode Support Vector Machine (SVM) Menggunakan Kernel Radial Basis Function (RBF) pada Klasifikasi Tweet. Jurnal Sains, Teknologi, dan Industri. Vol. 12, No. 2. 2015.

Maulana, A. and Pratiwi, H. Sentiment Analysis of Public Towards Infrastructure Development in Indonesia on Twitter Media Using Boosting Support Vector Machine Method. International Conference on Science (ICSAS). AIP Conf. Proc. 2202, 020082-1-020082-13. 2019.

Joachims, T. Text Categorization with Support Vector Machine: Learning with Many Relevant Features. Proceedings of The 10th European Conference on Machine Learning. Pages 137-142. 1998.

Feldman, R. and Sanger, J. The Text Mining Handbook: Advance Approaches in Analyzing Unstructured Data. Cambridge University Press, New York. 2007.

Liu, B. Handbook of Natural Language Processing 2nd Edition. CRC Press, Boca Raton. 2010.

Twitter Help, http://help.twitter.com/, diakses pada 21 Januari 2020.

Sofiani, I. dan Nurhidayat, A. I. Rancang Bangun Aplikasi E-Marketplace Hasil Pertanian Berbasis Website dengan Menggunakan Framework Codeigniter. Jurnal Managemen Informatika, Vol. 10, No. 01. 2019.

Robertson, S. Understanding Inverse Document Frequency: On Theoritical Arguments for IDF. Journal of Documentation, Vol. 60, No. 5, 510. 2004.

Christianini, N. and Taylor, J. An Introduction to Support Vector Machine and Other Kernel-based Learning Methods. Cambridge University Press, Cambridge.

Suyanto. 2018. Data Mining untuk Klasifikasi dan Klasterisasi Data Edisi Revisi. Informatika, Jakarta. 2000.

Gunn, S. R. Support Vector Machine for Classification and Regression. University of Southampton, Southampton. 1998.

Bekkar, M., Djemaa, H. K., and Alitouch, T. A. Evaluation Measure for Models Assesment over Imbalanced Data Sets. Journal of Information Engineering and Aplications, 3, 2738. 2013.

Castella, Q. and Sutton, C. Word Storm: Multiples of Word Clouds for Visual Comparison of Documents. International World Wide Web Conference Committee (IW3C2). ACM 978-1-4503-2744-2/14/04. 2014.

Refbacks

  • There are currently no refbacks.