Skip to content

Implementation of Proximal Policy Optimization in Pytorch.

Notifications You must be signed in to change notification settings

leaprovenzano/ppo_pytorch

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

92 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

ppo_pytorch :

A simple implementation of Clipped Proximal Policy Optimization in pytorch that runs in gym envs. This library also contains some weird additions, shortcuts and experimental stuff like truncated distributions, fixed std on the policy network (suprisingly works quite well) and full episode rollouts so it may not always marry up precisely with openai baselines.

gif

This guy has been training for 50409 16 episode rollouts in the episode shown he scored 284.6

Installation:

  • ideally make yourself a virtualenv so i don't fuck up your torch install or whatever and then do:
 git clone https://github.com/leaprovenzano/ppo_pytorch.git
 pip install -e ppo_pytorch

Super basic example :

COMING SOON ... a notebook or something, soz!

About

Implementation of Proximal Policy Optimization in Pytorch.

Resources

Stars

Watchers

Forks

Packages

No packages published

Languages