Evaluation of Human Demonstration Augmented Deep Reinforcement Learning Policies via Object Manipulation with an Anthropomorphic Robot Hand


KIPS Transactions on Software and Data Engineering, Vol. 10, No. 5, pp. 179-186, May. 2021
https://doi.org/10.3745/KTSDE.2021.10.5.179,   PDF Download:
Keywords: Anthropomorphic Robot Hand, Deep Reinforcement Learning, Human Demonstration, Policy Optimization
Abstract

Manipulation of complex objects with an anthropomorphic robot hand like a human hand is a challenge in the human-centric environment. In order to train the anthropomorphic robot hand which has a high degree of freedom (DoF), human demonstration augmented deep reinforcement learning policy optimization methods have been proposed. In this work, we first demonstrate augmentation of human demonstration in deep reinforcement learning (DRL) is effective for object manipulation by comparing the performance of the augmentation-free Natural Policy Gradient (NPG) and Demonstration Augmented NPG (DA-NPG). Then three DRL policy optimization methods, namely NPG, Trust Region Policy Optimization (TRPO), and Proximal Policy Optimization (PPO), have been evaluated with DA (i.e., DA-NPG, DA-TRPO, and DA-PPO) and without DA by manipulating six objects such as apple, banana, bottle, light bulb, camera, and hammer. The results show that DA-NPG achieved the average success rate of 99.33% whereas NPG only achieved 60%. In addition, DA-NPG succeeded grasping all six objects while DA-TRPO and DA-PPO failed to grasp some objects and showed unstable performances.


Statistics
Show / Hide Statistics

Statistics (Cumulative Counts from September 1st, 2017)
Multiple requests among the same browser session are counted as one view.
If you mouse over a chart, the values of data points will be shown.


Cite this article
[IEEE Style]
N. H. Park, J. H. Oh, G. H. Ryu, P. R. Lopez, E. V. Añazco, T. S. Kim, "Evaluation of Human Demonstration Augmented Deep Reinforcement Learning Policies via Object Manipulation with an Anthropomorphic Robot Hand," KIPS Transactions on Software and Data Engineering, vol. 10, no. 5, pp. 179-186, 2021. DOI: https://doi.org/10.3745/KTSDE.2021.10.5.179.

[ACM Style]
Na Hyeon Park, Ji Heon Oh, Ga Hyun Ryu, Patricio Rivera Lopez, Edwin Valarezo Añazco, and Tae Seong Kim. 2021. Evaluation of Human Demonstration Augmented Deep Reinforcement Learning Policies via Object Manipulation with an Anthropomorphic Robot Hand. KIPS Transactions on Software and Data Engineering, 10, 5, (2021), 179-186. DOI: https://doi.org/10.3745/KTSDE.2021.10.5.179.