RLPD-PyTorch

Introduction

This is a reproduction of an excellent work by ICML 2023, the work proposes the use of off-policy method and offline data in the online learning. It is very meaningful, as the idea can be applied to many scenarios.

Therefore, we modified and refactored its code, building a simple framework. A simple test was conducted on one of the environments, and it worked. Many of these parts may not be perfect, as we hope to use the framework in specific tasks in the future. So many places are not flexible enough and only provide simple examples. (For example, you can customize the environment, add different wrappers, modify networks, etc. to adapt to different tasks.)

For a more detailed description, you can read the original paper and source code.

If this is helpful to you, could you give me a star ⭐.

Show

We trained and validated the framework on HalfCheetah-v2 to be effective:

Quick start

You can use the default configuration by running the train.py directly. Of course, you can also change the configuration manually.

python train.py

Also provided is a validated test code:

python eval.py --weight-path='your path'

Dependencies

numpy == 1.26.4
gym == 0.17.0
d4rl == 1.1
python == 3.10
tensorboardx == 2.6.2.2
torch == 2.2.2+cu118
mujoco-py == 1.50.1.0

Reference

original paper: Efficient online reinforcement learning with offline data

source code: RLPD

Name		Name	Last commit message	Last commit date
Latest commit History 36 Commits
data/halfcheetah_v2		data/halfcheetah_v2
demo		demo
rlpd_pytorch		rlpd_pytorch
README.md		README.md
eval.py		eval.py
train.py		train.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

RLPD-PyTorch

Introduction

Show

Quick start

Dependencies

Reference

About

Releases

Packages

Languages

zZhiG/RLPD-PyTorch

Folders and files

Latest commit

History

Repository files navigation

RLPD-PyTorch

Introduction

Show

Quick start

Dependencies

Reference

About

Topics

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages