In this paper, we proposed a co-reference resolution method for texts in the indonesian language. The objective of co-reference resolution is to identify equivalence between entities as well as between pronouns and entities that were recognized in a named entity recognition phase. We propose a method that uses association rules. The method combines several features, such as pronoun and name classes, string similarity and position in the text, into a vector of attributes. Applied to a corpus of newspaper articles in the indonesian language, the method yields an F-Measure of 84.12%. We compare the result to one of state-of-the-art matchine learning method for co-reference resolution, decision tree, and the result is comparable.
Keywords: Co-reference resolution, association rules, pronoun, entity equivalence.
|
|