Morpheme Recovery Based on Naive Bayes Model

Jae Hoon Kim; Kul Ho Jeon

Morpheme Recovery Based on Naive Bayes Model

Jae Hoon Kim

Kul Ho Jeon

The KIPS Transactions:PartB , Vol. 19, No. 3, pp. 195-200, Jun. 2012

10.3745/KIPSTB.2012.19.3.195, PDF Download:

Abstract

In Korean, spelling change in various forms must be recovered into base forms in morphological analysis as well as part-of-speech (POS) tagging is difficult without morphological analysis because Korean is agglutinative. This is one of notorious problems in Korean morphological analysis and has been solved by morpheme recovery rules, which generate morphological ambiguity resolved by POS tagging. In this paper, we propose a morpheme recovery scheme based on machine learning methods like Naive Bayes models. Input features of the models are the surrounding context of the syllable which the spelling change is occurred and categories of the models are the recovered syllables. The POS tagging system with the proposed model has demonstrated the -score of 97.5% for the ETRI tree-tagged corpus. Thus it can be decided that the proposed model is very useful to handle morpheme recovery in Korean.

Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.

Cite this article

[IEEE Style]

J. H. Kim and K. H. Jeon, "Morpheme Recovery Based on Naive Bayes Model," The KIPS Transactions:PartB , vol. 19, no. 3, pp. 195-200, 2012. DOI: 10.3745/KIPSTB.2012.19.3.195.

[ACM Style]

Jae Hoon Kim and Kul Ho Jeon. 2012. Morpheme Recovery Based on Naive Bayes Model. The KIPS Transactions:PartB , 19, 3, (2012), 195-200. DOI: 10.3745/KIPSTB.2012.19.3.195.