Comparison of Deep Learning Models Using Protein Sequence Data


KIPS Transactions on Software and Data Engineering, Vol. 11, No. 6, pp. 245-254, Jun. 2022
https://doi.org/10.3745/KTSDE.2022.11.6.245,   PDF Download:
Keywords: CNN, LSTM, GRU, Combined Model, Protein Sequence
Abstract

Proteins are the basic unit of all life activities, and understanding them is essential for studying life phenomena. Since the emergence of the machine learning methodology using artificial neural networks, many researchers have tried to predict the function of proteins using only protein sequences. Many combinations of deep learning models have been reported to academia, but the methods are different and there is no formal methodology, and they are tailored to different data, so there has never been a direct comparative analysis of which algorithms are more suitable for handling protein data. In this paper, the single model performance of each algorithm was compared and evaluated based on accuracy and speed by applying the same data to CNN, LSTM, and GRU models, which are the most frequently used representative algorithms in the convergence research field of predicting protein functions, and the final evaluation scale is presented as Micro Precision, Recall, and F1-score. The combined models CNN-LSTM and CNN-GRU models also were evaluated in the same way. Through this study, it was confirmed that the performance of LSTM as a single model is good in simple classification problems, overlapping CNN was suitable as a single model in complex classification problems, and the CNN-LSTM was relatively better as a combination model.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
J. M. Lee and H. Lee, "Comparison of Deep Learning Models Using Protein Sequence Data," KIPS Transactions on Software and Data Engineering, vol. 11, no. 6, pp. 245-254, 2022. DOI: https://doi.org/10.3745/KTSDE.2022.11.6.245.

[ACM Style]
Jeung Min Lee and Hyun Lee. 2022. Comparison of Deep Learning Models Using Protein Sequence Data. KIPS Transactions on Software and Data Engineering, 11, 6, (2022), 245-254. DOI: https://doi.org/10.3745/KTSDE.2022.11.6.245.