Analyzing Korean Math Word Problem Data Classification Difficulty Level Using the KoEPT Model


KIPS Transactions on Software and Data Engineering, Vol. 11, No. 8, pp. 315-324, Aug. 2022
https://doi.org/10.3745/KTSDE.2022.11.8.315,   PDF Download:
Keywords: Math Word Problems, Generation Model, Transformer, Pointer Network, Classification Difficulty
Abstract

In this paper, we propose KoEPT, a Transformer-based generative model for automatic math word problems solving. A math word problem written in human language which describes everyday situations in a mathematical form. Math word problem solving requires an artificial intelligence model to understand the implied logic within the problem. Therefore, it is being studied variously across the world to improve the language understanding ability of artificial intelligence. In the case of the Korean language, studies so far have mainly attempted to solve problems by classifying them into templates, but there is a limitation in that these techniques are difficult to apply to datasets with high classification difficulty. To solve this problem, this paper used the KoEPT model which uses ‘expression’ tokens and pointer networks. To measure the performance of this model, the classification difficulty scores of IL, CC, and ALG514, which are existing Korean mathematical sentence problem datasets, were measured, and then the performance of KoEPT was evaluated using 5-fold cross-validation. For the Korean datasets used for evaluation, KoEPT obtained the state-of-the-art(SOTA) performance with 99.1% in CC, which is comparable to the existing SOTA performance, and 89.3% and 80.5% in IL and ALG514, respectively. In addition, as a result of evaluation, KoEPT showed a relatively improved performance for datasets with high classification difficulty. Through an ablation study, we uncovered that the use of the ‘expression’ tokens and pointer networks contributed to KoEPT’s state of being less affected by classification difficulty while obtaining good performance.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
R. Sangkyu, K. K. Seo, K. Bugeun, G. Gahgene, "Analyzing Korean Math Word Problem Data Classification Difficulty Level Using the KoEPT Model," KIPS Transactions on Software and Data Engineering, vol. 11, no. 8, pp. 315-324, 2022. DOI: https://doi.org/10.3745/KTSDE.2022.11.8.315.

[ACM Style]
Rhim Sangkyu, Ki Kyung Seo, Kim Bugeun, and Gweon Gahgene. 2022. Analyzing Korean Math Word Problem Data Classification Difficulty Level Using the KoEPT Model. KIPS Transactions on Software and Data Engineering, 11, 8, (2022), 315-324. DOI: https://doi.org/10.3745/KTSDE.2022.11.8.315.