Named Entity Recognition and Dictionary Construction for Korean Title: Books, Movies, Music and TV Programs


KIPS Transactions on Software and Data Engineering, Vol. 3, No. 7, pp. 285-292, Jul. 2014
10.3745/KTSDE.2014.3.7.285,   PDF Download:

Abstract

A named entity recognition method is used to improve the performance of information retrieval systems, question answering systems, machine translation systems and so on. The targets of the named entity recognition are usually PLOs (persons, locations and organizations). They are usually proper nouns or unregistered words, and traditional named entity recognizers use these characteristics to find out named entity candidates. The titles of books, movies and TV programs have different characteristics than PLO entities. They are sometimes multiple phrases, one sentence, or special characters. This makes it difficult to find the named entity candidates. In this paper we propose a method to quickly extract title named entities from news articles and automatically build a named entity dictionary for the titles. For the candidates identification, the word phrases enclosed with special symbols in a sentence are firstly extracted, and then verified by the SVM with using feature words and their distances. For the classification of the extracted title candidates, SVM is used with the mutual information of word contexts.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
Y. M. Park and J. S. Lee, "Named Entity Recognition and Dictionary Construction for Korean Title: Books, Movies, Music and TV Programs," KIPS Transactions on Software and Data Engineering, vol. 3, no. 7, pp. 285-292, 2014. DOI: 10.3745/KTSDE.2014.3.7.285.

[ACM Style]
Yong Min Park and Jae Sung Lee. 2014. Named Entity Recognition and Dictionary Construction for Korean Title: Books, Movies, Music and TV Programs. KIPS Transactions on Software and Data Engineering, 3, 7, (2014), 285-292. DOI: 10.3745/KTSDE.2014.3.7.285.