Library Automation and Digital Archive
LONTAR
Fakultas Ilmu Komputer
Universitas Indonesia

Pencarian Sederhana

Find Similar Add to Favorite

Call Number SEM-372
Collection Type Indeks Artikel prosiding/Sem
Title Hate speech detection in the indonesian languange dataset and preliminary study. Hal 233-237
Author Ika Alfina, Rio Mulia, Mohamd Ivan Fanany,and Yudo Ekanata;
Publisher ICACSIS 2017 International Con ference on advanced computer science and information system.
Subject building dataset; classification; hate speech detection; machine learning
Location
Lokasi : Perpustakaan Fakultas Ilmu Komputer
Nomor Panggil ID Koleksi Status
SEM-372 TERSEDIA
Tidak ada review pada koleksi ini: 47288
Abstract- The objective of our work is to detect hate speech in the indonesian language. As far as we know, the research on this subject is still very rare. The only research we found has created a dataset is inadequate. Our research aimed to create a new dataset that covers hate speecch in general, including hatred for religion, race, ethnicity,, and gender. In addition, we also conducted a preliminary study using machine learning approach. Machnie learning so far is the most frequently used approach in classifying text. We compered the performance of several featurs and machine learning algorith for hate speech detection. Features that extracted were word n-gram n=1 and n=2, character n-gram with n=3 and n=4 and negative sentiment. The classification was performed using naive bayes, support vector machine, bayesian logistic regression, and random forest decision tree. An F-measure of 93.5% was achieved when using word n-gram feature with random forest decision tree algorithm. Result also show that word n-gram feature outperformed character n-gram.