Average success rate for five real-world trials. It can be seen that our method, residual recurrent TD3 with impedance controller, learns the task faster than others.