The Design of Keyword Spotting System based on Auditory Phonetical Knowledge-Based Phonetic Value Classification

Hack Jin Kim; Soon Hyub Kim

The Design of Keyword Spotting System based on Auditory Phonetical Knowledge-Based Phonetic Value Classification

Hack Jin Kim

Soon Hyub Kim

The KIPS Transactions:PartB , Vol. 10, No. 2, pp. 169-178, Apr. 2003

10.3745/KIPSTB.2003.10.2.169, PDF Download:

Abstract

This study outlines two viewpoints the classification of phone likely unit (PLU) which is the foundation of korean large vocabulary speech recognition, and the effectiveness of Chiljongseong (7 Final Consonants) and Paljongseong (8 Final Consonants) of the korean language. The phone likely classifies the phoneme phonetically according to the location of and method of articulation, and about 50 phone-likely units are utilized in korean speech recognition. In this study auditory phonetical knowledge was applied to the classification of phone likely unit to present 45 phone likely unit . The vowels ´ㅔ, ㅐ´ were classified as phone-likely of [ee];´ㅒ, ㅖ´ as [ye]; and ´ㅚ, ㅙ, ㅞ´ as [we]. Secondly, the Chiljongseong System of the draft for unified spelling system which is currently in use and the Paljongseonggajokyong of Korean script haerye were illustrated. The question on whether the phonetic value on ´ㄷ´ and ´ㅅ´ among the phonemes used in the final consonant of the korean lan guage is the same has been argued in the academic world for a long time. In this study, the transition stages of Korean consonants were investigated, and Chiljongseong and Paljongseonggajokyong were utilized in speech recognition, and its effectiveness was verified. The experiment was divided into isolated word recognition and speech recognition, and in order to conduct the experiment PBW452 was used to test the isolated word recognition. The experiment was conducted on about 50 men and women-divided into 5 groups-and they vocalized 50 words each. As for the continuous speech recognition experiment to be utilized in the materialized stock exchange system, the sentence corpus of 71 stock exchange sentences and speech corpus vocalizing the sentences were collected and used 5 men and women each vocalized a sentence twice. As the result of the experiment, when the Paljongseonggajokyong was used as the consonant, the recognition performance elevated by an average of about 1.45% ; and when phone likely unit with Paljongseonggajokyong and auditory phonetic applied simultaneously, was applied, the rate of recognition increased by an average of 1.5% to 2.02%. In the continuous speech recognition experiment, the recognition performance elevated by an average of about 1% to 2% than when the existing 49 or 56 phone likely units were utilized.

Statistics

Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.

Cite this article

[IEEE Style]

H. J. Kim and S. H. Kim, "The Design of Keyword Spotting System based on Auditory Phonetical Knowledge-Based Phonetic Value Classification," The KIPS Transactions:PartB , vol. 10, no. 2, pp. 169-178, 2003. DOI: 10.3745/KIPSTB.2003.10.2.169.

[ACM Style]

Hack Jin Kim and Soon Hyub Kim. 2003. The Design of Keyword Spotting System based on Auditory Phonetical Knowledge-Based Phonetic Value Classification. The KIPS Transactions:PartB , 10, 2, (2003), 169-178. DOI: 10.3745/KIPSTB.2003.10.2.169.