ppo_pytorch :

A simple implementation of Clipped Proximal Policy Optimization in pytorch that runs in gym envs. This library also contains some weird additions, shortcuts and experimental stuff like truncated distributions, fixed std on the policy network (suprisingly works quite well) and full episode rollouts so it may not always marry up precisely with openai baselines.

This guy has been training for 50409 16 episode rollouts in the episode shown he scored 284.6

Installation:

ideally make yourself a virtualenv so i don't fuck up your torch install or whatever and then do:

 git clone https://github.com/leaprovenzano/ppo_pytorch.git
 pip install -e ppo_pytorch

Super basic example :

COMING SOON ... a notebook or something, soz!

Name		Name	Last commit message	Last commit date
Latest commit History 92 Commits
gifs		gifs
ppo_pytorch		ppo_pytorch
tests		tests
.gitignore		.gitignore
README.md		README.md
requirements.txt		requirements.txt
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

gifs

gifs

ppo_pytorch

ppo_pytorch

tests

tests

.gitignore

.gitignore

README.md

README.md

requirements.txt

requirements.txt

setup.py

setup.py

Repository files navigation

ppo_pytorch :

Installation:

Super basic example :

About

Releases

Packages

Languages

leaprovenzano/ppo_pytorch

Folders and files

Latest commit

History

Repository files navigation

ppo_pytorch :

Installation:

Super basic example :

About

Resources

Stars

Watchers

Forks

Languages