Korean Probabilistic Dependency Grammar Induction by morpheme


The KIPS Transactions:PartB , Vol. 9, No. 6, pp. 791-798, Dec. 2002
10.3745/KIPSTB.2002.9.6.791,   PDF Download:

Abstract

In this thesis, we present a new method for inducing a probabilistic dependency grammar (PDG) from text corpus. As words in Korean are composed of a set of more basic morphemes, there exist various dependency relations in a word. So, if the induction process does not take into account of these in-word dependency relations, the accuracy of the resulting grammar may be poor. In comparison with previous PDG induction methods, the main difference of the proposed method lies in the fact that the method takes into account in-word dependency relations as well as inter-word dependency relations. To access the performance of the proposed method, we conducted an experiment using a manually-tagged corpus of 25,000 sentences which is complied by Korean Advanced Institute of Science and Technology (KAIST). The grammar induction produced 2,349 dependency rules. The parser with these dependency rules showed 69.77% accuracy in terms of the number of correct dependency relations relative to the total number dependency relations for best-1 parse trees of sample sentences. The result shows that taking into account in-word dependency relations in the course of grammar induction results in a more accurate dependency grammar.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
S. H. Choi and H. R. Park, "Korean Probabilistic Dependency Grammar Induction by morpheme," The KIPS Transactions:PartB , vol. 9, no. 6, pp. 791-798, 2002. DOI: 10.3745/KIPSTB.2002.9.6.791.

[ACM Style]
Seon Hwa Choi and Hyuk Ro Park. 2002. Korean Probabilistic Dependency Grammar Induction by morpheme. The KIPS Transactions:PartB , 9, 6, (2002), 791-798. DOI: 10.3745/KIPSTB.2002.9.6.791.