add complex observation atari ppo #359

ttumiel · 2023-02-15T23:24:56Z

Description

Added handling of complex observations to atari_ppo.py. Closes #353

I also wrote a jax version for the #338 branch (I can put it in another PR when #338 is ready?) There are only 2 changes that use jax's tree_map.
https://gist.github.com/ttumiel/ee746d6292cecb47d390fb97c3ccfa5e

Tests

I wrote some tests for different observation types. Wasn't sure if these belonged in the test folder, since they kind of just demonstrate the functionality.

I also wrote a dummy complex observation wrapper to demonstrate handling a dict spact in atari: https://gist.github.com/ttumiel/c2132b424c49b76a62bafe7efef9923d

Speed

The tree.map_structure function is about 10us of overhead.

import gym, tree, numpy as np

o = gym.spaces.Box(0, 255, (64, 64))
x = [o.sample() for _ in range(10)]

%%timeit
o=np.stack(x)
# 15.8 µs ± 491 ns per loop (mean ± std. dev. of 7 runs, 100,000 loops each)

%%timeit
o=tree.map_structure(lambda *x: np.stack(x), *x)
# 26.5 µs ± 670 ns per loop (mean ± std. dev. of 7 runs, 10,000 loops each)

I (surprisingly?) got a slight speed increase when running BreakoutNoFrameskip. Locally I got about 470 SPS with complex tree_map vs 460 SPS on the original.

Questions

Maybe I should put the complex obs in the ppo.py file directly, instead of a new file?
Should I add a note in the docs?

Types of changes

Bug fix
New feature
New algorithm
Documentation

Checklist:

I've read the CONTRIBUTION guide (required).
I have ensured pre-commit run --all-files passes (required).
I have updated the documentation and previewed the changes via mkdocs serve.
I have updated the tests accordingly (if applicable).

If you are adding new algorithm variants or your change could result in performance difference, you may need to (re-)run tracked experiments. See #137 as an example PR.

vercel · 2023-02-15T23:24:59Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Comments	Updated
cleanrl	✅ Ready (Inspect)	Visit Preview	💬 Add your feedback	Feb 15, 2023 at 11:25PM (UTC)

ttumiel added 4 commits February 15, 2023 22:07

add complex observation atari ppo

27d528f

remove dummy wrapper

6849f41

lint

5960cf2

Merge branch 'master' into ppo-complex-obs

90cb2c8

vercel bot deployed to Preview February 15, 2023 23:25 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

add complex observation atari ppo #359

add complex observation atari ppo #359

ttumiel commented Feb 15, 2023

vercel bot commented Feb 15, 2023 •

edited

add complex observation atari ppo #359

Are you sure you want to change the base?

add complex observation atari ppo #359

Conversation

ttumiel commented Feb 15, 2023

Description

Tests

Speed

Questions

Types of changes

Checklist:

vercel bot commented Feb 15, 2023 • edited

vercel bot commented Feb 15, 2023 •

edited