Automatic Construction of Korean Two-Level Lexicon using Lexical and Morphological Information


KIPS Transactions on Software and Data Engineering, Vol. 2, No. 12, pp. 865-872, Dec. 2013
10.3745/KTSDE.2013.2.12.865,   PDF Download:

Abstract

Two-level morphology analysis method is one of rule-based morphological analysis method. This approach handles morphological transformation using rules and analyzes words with morpheme connection information in a lexicon. It is independent of language and Korean Two-level system was also developed. But, it was limited in practical use, because of using very small set of lexicon built manually. And it has also a over-generation problem. In this paper, we propose an automatic construction method of Korean Two-level lexicon for PC-KIMMO from morpheme tagged corpus. We also propose a method to solve over-generation problem using lexical information and sub-tags. The experiment showed that the proposed method reduced over-generation by 68% compared with the previous method, and the performance increased from 39% to 65% in f-measure.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
B. G. Kim and J. S. Lee, "Automatic Construction of Korean Two-Level Lexicon using Lexical and Morphological Information," KIPS Transactions on Software and Data Engineering, vol. 2, no. 12, pp. 865-872, 2013. DOI: 10.3745/KTSDE.2013.2.12.865.

[ACM Style]
Bo Gyum Kim and Jae Sung Lee. 2013. Automatic Construction of Korean Two-Level Lexicon using Lexical and Morphological Information. KIPS Transactions on Software and Data Engineering, 2, 12, (2013), 865-872. DOI: 10.3745/KTSDE.2013.2.12.865.