RAMDANI, MIFTAH WISNU (2024) KLASTERISASI TOPIK KONTEN CHANNEL YOUTUBE OTOMOTIF INDONESIA 2024 MENGGUNAKAN LATENT DIRICHLET ALLOCATION. S1 thesis, UNIVERSITAS MUHAMMADIYAH PURWOKERTO.

Text
MIFTAH WISNU RAMDANI_COVER.pdf

Download (3MB)
Text
MIFTAH WISNU RAMDANI_BAB I.pdf

Download (914kB)
Text
MIFTAH WISNU RAMDANI_BAB II.pdf

Download (1MB)
Text
MIFTAH WISNU RAMDANI_BAB III.pdf
Restricted to Registered users only

Download (1MB)
Text
MIFTAH WISNU RAMDANI_BAB IV.pdf
Restricted to Registered users only

Download (1MB)
Text
MIFTAH WISNU RAMDANI_BAB V.pdf
Restricted to Registered users only

Download (908kB)
Text
MIFTAH WISNU RAMDANI_DAFTAR PUSTAKA.pdf

Download (1MB)
Text
MIFTAH WISNU RAMDANI_LAMPIRAN.pdf
Restricted to Registered users only

Download (1MB)

Abstract

The current advancement of information technology has led to changes in various
aspects of human life, including the automotive sector. Many people use the
internet and social media to share opinions and information, with YouTube being
a popular platform for uploading videos and watching a wide range of content.
Based on this context, this study offers a solution to analyze the topic modeling of
content created by automotive YouTubers. The research employs the Latent
Dirichlet Allocation (LDA) method, one of the topic modeling techniques, to help
identify the topics that emerge on automotive YouTuber accounts. The LDA
method involves summarizing, clustering, linking, and processing data to
generate a weighted list of topics for each document. The data collection in this
study was conducted through observation and web scraping on YouTube using
the YouTube API V3. The data obtained in the initial stage were raw, so the next
step involved preprocessing to clean the data, making it easier for subsequent
processing. The topic modeling process using Latent Dirichlet Allocation aimed
to form the best possible topic model. The results showed a perplexity score of -
8.1616 and a coherence score of 0.4732 from six topics, with the most dominant
topic being the first topic, which had an accuracy rate of 27.7%, containing
keywords such as Honda, Yamaha, review, Kawasaki, engine, Ninja, Nmax, and
Vario. This study acknowledges certain limitations and weaknesses. Therefore,
the researchers recommend the following for future studies: 1. Adding a word
cloud to visualize the data, and 2. Incorporating additional machine learning
algorithms to compare and determine which algorithm performs better in future
research. Keywords: Clustering, YouTube, Automotive, Topic Modeling, Latent
Dirichlet Allocation.

Dosen Pembimbing: BADHARUDIN, ABID YANUAR | nidn0603018603
Item Type: Thesis (S1)
Uncontrolled Keywords: Clustering, YouTube, Automotive, Topic Modeling, Latent Dirichlet Allocation
Subjects: T Technology > T Technology (General)
Z Bibliography. Library Science. Information Resources > ZA Information resources
Divisions: Fakultas Tekniik Dan Sains > Teknik Informatika S1
Depositing User: Nur Hardiansyah
Date Deposited: 13 Nov 2024 08:32
Last Modified: 13 Nov 2024 08:32
URI: http://repository.ump.ac.id/id/eprint/17560

Actions (login required)

View Item
View Item