Advancements in Deep Reinforcement Learning: A Comparative Analysis of Policy Optimization Techniques

Balaraman Ravindran

doi:10.63282/3050-9246.IJETCSIT-V1I1P103

Authors

Balaraman Ravindran Assistant Professor, Department of Computer Science & Engineering, Loyola Institute of Technology, Chennai, India Author

DOI:

https://doi.org/10.63282/3050-9246.IJETCSIT-V1I1P103

Keywords:

Reinforcement Learning, Policy Optimization, Actor-Critic, Proximal Policy Optimization, Model-Based Learning, Deep Neural Networks, Sample Efficiency, Training Time, Advantage Estimation, Trust Region Methods

Abstract

Deep Reinforcement Learning (DRL) has emerged as a powerful framework for solving complex decision-making problems. This paper provides a comprehensive review and comparative analysis of various policy optimization techniques in DRL, including Policy Gradient Methods, Actor-Critic Algorithms, and Model-Based Approaches. We discuss the theoretical foundations, practical implementations, and recent advancements in each category. The paper also evaluates these techniques on a variety of benchmark tasks to highlight their strengths and limitations. Our analysis aims to provide insights into the current state of DRL and guide future research directions