Automatic Extraction of Collocations based on Corpus using mutual information


The Transactions of the Korea Information Processing Society (1994 ~ 2000), Vol. 1, No. 4, pp. 461-468, Nov. 1994
10.3745/KIPSTE.1994.1.4.461,   PDF Download:

Abstract

This paper describes the automatic extraction of collocations based on corpus. The collocations are extracted from corpus using cooccurrence frequency and mutual information between words. In English, 5 types of collocations are defined. These collocations are transitive verb and object, intransitive verb and subject, adjective and noun, verb and adverb, and adverb and adjective. In this paper another type of collocation is recognized and extracted, which consists of verb and preposition. So 6 types of collocations are extracted based on corpus.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
L. H. Suk, "Automatic Extraction of Collocations based on Corpus using mutual information," The Transactions of the Korea Information Processing Society (1994 ~ 2000), vol. 1, no. 4, pp. 461-468, 1994. DOI: 10.3745/KIPSTE.1994.1.4.461.

[ACM Style]
Lee Ho Suk. 1994. Automatic Extraction of Collocations based on Corpus using mutual information. The Transactions of the Korea Information Processing Society (1994 ~ 2000), 1, 4, (1994), 461-468. DOI: 10.3745/KIPSTE.1994.1.4.461.