ANALISIS SENTIMEN BERBASIS LEXICON BASED DENGAN ALGORITMA NAIVE BAYES TERHADAP KOMENTAR NETIZEN PADA VIDEO YOUTUBE DEBAT CAPRES/CAWAPRES DALAM PEMILU 2024 | ELECTRONIC THESES AND DISSERTATION

Electronic Theses and Dissertation

Universitas Syiah Kuala

    SKRIPSI

ANALISIS SENTIMEN BERBASIS LEXICON BASED DENGAN ALGORITMA NAIVE BAYES TERHADAP KOMENTAR NETIZEN PADA VIDEO YOUTUBE DEBAT CAPRES/CAWAPRES DALAM PEMILU 2024


Pengarang

Reza Fahrevi - Personal Name;

Dosen Pembimbing

Ardiansyah - 197212261992011001 - Dosen Pembimbing I
Yudha Nurdin - 197910012010121002 - Dosen Pembimbing II



Nomor Pokok Mahasiswa

1904111010058

Fakultas & Prodi

Fakultas Teknik / Teknik Komputer (S1) / PDDIKTI : 56202

Subject
-
Kata Kunci
-
Penerbit

Banda Aceh : .,

Bahasa

No Classification

-

Literature Searching Service

Hard copy atau foto copy dari buku ini dapat diberikan dengan syarat ketentuan berlaku, jika berminat, silahkan hubungi via telegram (Chat Services LSS)

Abstrak - Penelitian ini bertujuan untuk menganalisis sentimen netizen
Indonesia terhadap kandidat Presiden dan Wakil Presiden berdasarkan komentar
pada video debat di channel YouTube KPU. Ketidakpastian mengenai bagaimana
publik secara luas memandang para calon presiden (Capres) dan calon wakil
presiden (Cawapres) menjadi alasan utama dilakukannya penelitian ini.
Banyaknya komentar pro dan kontra pada video debat tersebut menunjukkan
tingginya ketertarikan publik yang berpotensi mempengaruhi pilihan mereka
terhadap Capres dan Cawapres. Data dikumpulkan dari lima debat yang
berlangsung antara 12 Desember 2023 hingga 10 Februari 2024, dengan total
15.027 komentar setelah preprocessing. Teknik analisis sentimen ini
menggunakan metode lexicon-based dan algoritma Naive Bayes menunjukkan
bahwa distribusi sentimen Anies Baswedan memiliki rata-rata sentimen positif
tertinggi di antara Capres (50,9%), diikuti oleh Prabowo Subianto (31,8%) dan
Ganjar Pranowo (17,4%). Sentimen negatif tertinggi ditemukan pada Anies
(45,1%) disusul Prabowo (42,6%), dan Ganjar (12,3%). Sentimen netral paling
banyak ada pada Prabowo (70,4%), dibandingkan Anies (24,7%) dan Ganjar
(4,9%). Sedangkan Cawapres, Gibran Rakabuming memiliki rata-rata sentimen
positif tertinggi (51,6%), diikuti Muhaimin Iskandar (25,9%) dan Mahfud MD
(22,6%). Gibran juga mendominasi sentimen negatif (56,3%), sementara
Muhaimin dan Mahfud mencatat masing-masing 22,6% dan 21,1%. Sentimen
netral tertinggi ada pada Gibran (81,2%), dibandingkan Muhaimin (11,9%) dan
Mahfud (6,8%). Berdasarkan model Naive Bayes dengan fitur TF-IDF
menunjukkan hasil performa yang stabil dengan akurasi 70,96% pada data
validasi dan 70,36% pada data uji. Presisi mencapai 75,60% pada data validasi
dan 72,33% pada data uji. Nilai F1 Score berada di kisaran 67,50% hingga
67,73%, menunjukkan keseimbangan yang baik antara presisi dan recall.
Keyword : Analisis Sentimen, Debat, Crawling, Preprocessing, Lexicon Based,
Naive Bayes, TF-IDF, Evaluasi Model, Visualisasi Data.

Abstract - This research aims to analyze the sentiment of Indonesian netizens toward Presidential and Vice-Presidential candidates based on comments on the debate videos on the KPU YouTube channel. Uncertainty regarding how the public perceives the presidential (Capres) and vice-presidential (Cawapres) candidates is the primary reason for conducting this research. The numerous pro and con comments on the debate videos indicate high public interest, which has the potential to influence their choices regarding the candidates. Data were collected from five debates held between December 12, 2023, and February 10, 2024, totaling 15,027 comments after preprocessing. This sentiment analysis technique uses a lexicon-based approach and the Naive Bayes algorithm, showing that the sentiment distribution for Anies Baswedan has the highest average positive sentiment among the presidential candidates (50.9%), followed by Prabowo Subianto (31.8%) and Ganjar Pranowo (17.4%). The highest negative sentiment was found for Anies (45.1%), followed by Prabowo (42.6%) and Ganjar (12.3%). The most neutral sentiment was seen for Prabowo (70.4%), compared to Anies (24.7%) and Ganjar (4.9%). Among the vice-presidential candidates, Gibran Rakabuming had the highest average positive sentiment (51.6%), followed by Muhaimin Iskandar (25.9%) and Mahfud MD (22.6%). Gibran also dominated negative sentiment (56.3%), while Muhaimin and Mahfud recorded 22.6% and 21.1%, respectively. The highest neutral sentiment was for Gibran (81.2%), compared to Muhaimin (11.9%) and Mahfud (6.8%). Based on the Naive Bayes model with TF-IDF features, the results show stable performance with an accuracy of 70.96% on validation data and 70.36% on test data. Precision reached 75.60% on validation data and 72.33% on test data. The F1 Score ranged from 67.50% to 67.73%, indicating a good balance between precision and recall. Keywords: Sentiment Analysis, Debate, Crawling, Preprocessing, Lexicon-Based, Naive Bayes, TF-IDF, Model Evaluation, Data Visualization.

Citation



    SERVICES DESK