ENHANCING SENTIMENT ANALYSIS ACCURACY WITH BERT AND SILHOUETTE METHOD OPTIMIZATION

KELVIN and FRANS MIKAEL SINAGA and WULAN SRI LESTARI and SUNARYO WINARDI and RONSEN PURBA (2025) ENHANCING SENTIMENT ANALYSIS ACCURACY WITH BERT AND SILHOUETTE METHOD OPTIMIZATION. JITK (JURNAL ILMU PENGETAHUAN DAN TEKNOLOGI KOMPUTER), 11 (1). pp. 76-86. ISSN 2527-4864

Full text not available from this repository.

Abstract

This research is based on the emergence of ChatGPT technology, which has significant implications in various fields. This research aims to design a model that improves sentiment analysis classification accuracy. The methods applied include the use of the Silhouette Coefficient to determine the best cluster parameters before performing data grouping with the Self-Organizing Map (SOM) method. Additionally, the Bidirectional Encoder Representations from Transformers (BERT) model is utilized to perform precise and convergent sentiment classification. The research methodology encompasses several phases, including data preprocessing through natural language processing techniques. Textual data is converted into vector representations, which are then processed using the Silhouette Coefficient to identify the optimal cluster parameters. These parameters are subsequently applied in the Self-Organizing Map method to cluster data, while the Bidirectional Encoder Representations from Transformers model determines public sentiment, categorized as positive, negative, or neutral. The findings of this study indicate that the best cluster parameter is 9, using a batch size of 64 and a maximum sequence length of 128. The highest accuracy achieved using the confusion matrix is 92.06%. Further tests with varying parameters confirm that the Silhouette Coefficient method significantly enhances the convergence and accuracy of classification outcomes. The conclusion of this research is that integrating the Silhouette Coefficient and Bidirectional Encoder Representations from Transformers is effective in optimizing sentiment analysis on large datasets, achieving both accurate and reliable results.

Item Type: Article
Divisions: Lembaga Penelitian dan Pengabdian kepada Masyarakat (LPPM) > Artikel > Fakultas Informatika
Depositing User: Anwar Fauzi Ritonga
Date Deposited: 10 Sep 2025 02:48
Last Modified: 10 Sep 2025 03:20
URI: https://repository.mikroskil.ac.id/id/eprint/4220

Actions (login required)

View Item
View Item