Application of TF-IDF and Xgboost Methods for Public Sentiment Analysis Towards Ozzaskin Skincare Brand on Social Media

Authors

  • Mesra Betty Yel Sekolah Tinggi Ilmu Komputer Cipta Karya Informatika (STIKOMCKI) Jakarta
  • Elviwani Elviwani Sekolah Tinggi Ilmu Komputer Cipta Karya Informatika (STIKOMCKI) Jakarta
  • Nova Dahliyanti Sekolah Tinggi Ilmu Komputer Cipta Karya Informatika (STIKOMCKI) Jakarta
  • Ahmad Syahran Zidane Sekolah Tinggi Ilmu Komputer Cipta Karya Informatika (STIKOMCKI) Jakarta

DOI:

https://doi.org/10.55606/jeei.v5i1.3676

Keywords:

Sentiment Analysis, Natural Language Processing (NLP), Social Media, Ozzaskin Skincare, Public Opinion

Abstract

Ozzaskin is a local skincare brand founded by Ustadzah Oki Setiana Dewi that targets Muslim women and focuses on reducing dark spots and acne scars. Over time, this domestic brand has attracted considerable public attention on social media—particularly among mothers—garnering both praise for its product efficacy and criticism regarding price and texture. This study aims to analyze public sentiment toward the Ozzaskin brand by performing web scraping on Instagram and TikTok data, employing TF-IDF for textual feature extraction and XGBoost as the classification algorithm. The findings are expected to provide a comprehensive overview of consumer perceptions of Ozzaskin and to assist the marketing team and product developers in formulating communication strategies and improving product formulas that more effectively address user needs. The novelty of this research lies in the comprehensive application of the TF-IDF + XGBoost framework for brand-related sentiment analysis on Indonesian-language social media.

References

Alfian, F., dkk. (2024). Klasifikasi sentimen review health-care di Indonesia: Perbandingan XGBoost dan Naïve Bayes. Jurnal Teknologi Informasi, 15(1), 23–38.

Chen, T., & Guestrin, C. (2016). XGBoost: A scalable tree boosting system. Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, 785–794.

Devlin, J., Chang, M. W., Lee, K., & Toutanova, K. (2019). BERT: Pre-training of deep bidirectional transformers for language understanding. *Proceedings of NAACL-HLT 2019*, 4171–4186.

Euromonitor International. (2024). Beauty and personal care in Indonesia.

Fathonah, F., & Herliana, A. (2024). Analisis sentimen e-commerce Indonesia. Jurnal Sistem & Informatika, 10(2), 45–61.

Hochreiter, S., & Schmidhuber, J. (1997). Long short-term memory. Neural Computation, 9(8), 1735–1780.

Jurafsky, D., & Martin, J. H. (2020). Speech and language processing (3rd ed.). Prentice Hall.

Kaplan, A. M., & Haenlein, M. (2010). Users of the world, unite! The challenges and opportunities of social media. Business Horizons, 53(1), 59–68.

Kim, Y. (2014). Convolutional neural networks for sentence classification. Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), 1746–1751.

Liu, B. (2012). Sentiment analysis and opinion mining. Morgan & Claypool.

Macdonald, A., & O’Brien, L. (2014). Measuring public sentiment in social media. Journal of Information Science, 40(5), 606–617.

Manning, C. D., Raghavan, P., & Schütze, H. (2008). Introduction to information retrieval. Cambridge University Press.

Mitchell, R. (2015). Web scraping with Python: Collecting data from the modern web. O’Reilly Media.

Ozzaskin. (2025, Juni 15). About Us. https://www.ozzaskin.co.id/about

Pappas, N., & Popović, M. (2021). IndoBERT: A pre-trained BERT for Bahasa Indonesia. ACL Anthology.

Downloads

Published

2026-06-02

How to Cite

Mesra Betty Yel, Elviwani Elviwani, Nova Dahliyanti, & Ahmad Syahran Zidane. (2026). Application of TF-IDF and Xgboost Methods for Public Sentiment Analysis Towards Ozzaskin Skincare Brand on Social Media. Journal of Engineering, Electrical and Informatics, 5(1), 18–25. https://doi.org/10.55606/jeei.v5i1.3676