Library Automation and Digital Archive
LONTAR
Fakultas Ilmu Komputer
Universitas Indonesia

Pencarian Sederhana

Find Similar Add to Favorite

Call Number SEM-372
Collection Type Indeks Artikel prosiding/Sem
Title Modified DBpedia Entities Expansion for Tagging Automatically NER Dataset. Hal 216-227
Author Ika Alfina, Septiviana Savitri, and Mohamad Ivan Fanamy;
Publisher ICACSIS 2017 International conference on advanced computer science and information system
Subject NER; building dataset; noise reduction; DBpedia
Location
Lokasi : Perpustakaan Fakultas Ilmu Komputer
Nomor Panggil ID Koleksi Status
SEM-372 TERSEDIA
Tidak ada review pada koleksi ini: 47285
Abstarct- Developing NER system using machine learning approach needs a big dataset which is costly if the dataset labeling is done manually. The previos works proposed methods in tagging automatically the indonesian NER dataset using Wikipedia articles as the source of the dataset and DBpedia as the reference of the entity type. However, the quality of the resulting dataset was still inadequate. A method named DBpedia entities expansion (DEE) had introduced several rules to expand named entities in DBpedia in order to improve recall, but it had not managed to remove invalid names from the list of person names in the expanded DBpedia. We call this modification as modified DEE (M-DEE). The evaluation shows that M-DEE can improve the precision for person names around 3% compared to the original DEE. By adding gazetteers for place and organization names into the explaned DBpedia created by M-DEE, the margin about 10% of the overall F1-score for all types was achieved.