From State Compression to State Emphasis: Continuous Aggregation for Policy Gradient Optimization

Published: 9 May 2026| Version 1 | DOI: 10.17632/rs2pt2r6rm.1
Contributor:
Shuo Zhao

Description

This repository implements Hierarchical Proximal Policy Optimization (HPPO) and standard Proximal Policy Optimization (PPO) algorithms using Ray RLlib framework for reinforcement learning experiments. Requirements The project requires the following dependencies: gymnasium==1.1.1 ray==2.10.0 torch==2.0.1 Installation Install the required packages using pip: pip install gymnasium==1.1.1 ray==2.10.0 torch==2.0.1 For ALE/Atari environments, you may also need: pip install ale-py Project Structure HPPO-master/ ├── main_HPPO.py # HPPO algorithm implementation ├── main_PPO.py # PPO algorithm implementation ├── ray_result_HPPO_all.zip # HPPO training results ├── ray_result_PPO_all.zip # PPO training results └── README.md # This file

Files

Institutions

Categories

Reinforcement Learning

Licence