GENERALIZED CRITIC POLICY OPTIMIZATION: A MODEL FOR COMBINING ADVANTAGE ESTIMATES IN ACTOR CRITIC METHODS

Roumeissa Kitouni, Abderrahim Kitouni, Feng Jiang

DOI

SPS

Members: Free
IEEE Members: $11.00
Non-members: $15.00

Length: 10:24

26 Oct 2020

We present a general model for actor-critic methods that represent the possibility of combining value function estimations as a means to further reduce the policy gradient's variance and improve the learning result. We show the potential of this architecture by implementing an example case to learn some of the Pybullet continuous control robotic tasks with OpenAI Gym. We show by experimenting with a special case the effect of the external parameters on the overall performance of the policy optimization algorithm.

Tags:

sps conference

icip 2020

GENERALIZED CRITIC POLICY OPTIMIZATION: A MODEL FOR COMBINING ADVANTAGE ESTIMATES IN ACTOR CRITIC METHODS

Roumeissa Kitouni, Abderrahim Kitouni, Feng Jiang

Value-Added Bundle(s) Including this Product

ICIP 2020 Virtual Conference - Presentation Videos Product Bundle

More Like This

IEEE ICASSP 2023, 4-10 June 2023, Greece. Virtual and In-Person Conference - Presentation Videos Product Bundle

IEEE ICASSP 2024, 1 4-19 April 2024, Seoul, Korea. Conference Presentation Videos Bundle

ICIP 2022, October 16-19, 2022, Bordeaux, France - Presentation Videos Product Bundle

Join an IEEE Society