A Korean Language Stemmer based on Unsupervised Learning


The KIPS Transactions:PartB , Vol. 8, No. 6, pp. 675-684, Dec. 2001
10.3745/KIPSTB.2001.8.6.675,   PDF Download:

Abstract

This paper describes a method for stemming of Korean language by using unsupervised learning from raw corpus. This technique does not require a lexicon or any language-specific knowledge. Since we use unsupervised learning, the time and effort required for learning is negligible. Unlike heuristic approaches that are theoretically ungrounded, this method is based on widely accepted statistical methods, and therefore can be easily extended. The method is currently applied only to Korean language, but it can easily be adapted to other agglutinative languages, since it is not language-dependent.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
S. H. Cho, "A Korean Language Stemmer based on Unsupervised Learning," The KIPS Transactions:PartB , vol. 8, no. 6, pp. 675-684, 2001. DOI: 10.3745/KIPSTB.2001.8.6.675.

[ACM Style]
Se Hyeong Cho. 2001. A Korean Language Stemmer based on Unsupervised Learning. The KIPS Transactions:PartB , 8, 6, (2001), 675-684. DOI: 10.3745/KIPSTB.2001.8.6.675.