[rllib]State input's shape about custom env based on gym's env #10024

lsylusiyao · 2020-08-10T12:59:58Z

What is your question?

I've created a custom env, which basically looks like this.

def __init__(self):
  self.action_space = gym.spaces.Discrete(9)
  self.observation_space = gym.spaces.Tuple((
                gym.spaces.MultiBinary(
                    (5,4)), 
                gym.spaces.Box(
                    low=np.zeros(2, dtype=np.float32),
                    high=np.array([100, 100], dtype=np.float32)
                )
            ))
def reset(self):
  a = np.zeros((5,4), dtype=np.int32)
  b = np.array([100, 100], dtype=np.float32)
  return a,b
def step(self, action):
# I've debugged and it seems that this function hasn't been entered. So I believe the above two functions should be enough.
# sth
# sth
# sth
# `states` is just similar as (a,b) in function `reset`
  return states, reward, done, {}

The problem is, when I wrote reset like this, line 375 in modelv2.py : _unpack_obs(obs, obs_space.original_space, tensorlib=tensorlib) gave out : reshape(): argument 'shape' must be tuple of ints, but found element of type tuple at pos 2. Later I've found out that I need to add a newaxis for ndarray. So I added a = a[np.newaxis, :] and b = b[np.newaxis, :] in reset.

However, after doing this, it goes wrong again on line 60 in preprocessors.py : if not self._obs_space.contains(observation), which is Observation outside expected value range ( I'm assure it's not because the absolute value in the Box). I got into self._obs_space.contains(observation) in tuple.py and found out that the problem occurred on line 28 : space.contains(part) for (space,part) in zip(self.spaces,x), which returns [True, False].

I'm a new gay for gym and rllib, and I'm a little confused about the input. Is there any suggestion? Thanks.

Ray version and other system information (Python version, TensorFlow version, OS):
Ray: 0.8.6; Python 3.8.5; PyTorch 1.6.0; Windows 10 2004

The text was updated successfully, but these errors were encountered:

sven1977 · 2020-08-10T16:27:36Z

Hey @lsylusiyao thanks for filing this. Yeah, looks like a bug or at least something we don't support yet (MultiBinary space). We should have a Bernoulli distribution or MultiBernoulli for these cases. ...

lsylusiyao · 2020-08-11T02:08:37Z

However, it seems that the MultiBinary is enough for my problem, and the result of sample satisfies me. The problem could still be in the dimension or in the Box.

lsylusiyao · 2020-08-11T11:41:12Z

Oh, and today I tested to switch Tuple to Dict and the same problem occurred in the same way at the same place.......

sven1977 · 2020-08-11T14:47:24Z

Yes, that could be. I'm currently adding a better test case for complex action spaces (including MultiBinary components). Some special combinations do fail currently (even w/o MultiBinary). ...

lsylusiyao · 2020-08-12T14:38:15Z

Thanks to @panda361, I finally solve this problem. It seems that the bug is on the Gym part rather than on the ray so I made a pull request for that.
#2023 for Gym
And also, thanks for @sven1977 helping me.

lsylusiyao added the question Just a question :) label Aug 10, 2020

lsylusiyao changed the title ~~[rllib]State input about custom env based on gym's env~~ [rllib]State input's shape about custom env based on gym's env Aug 10, 2020

sven1977 self-assigned this Aug 10, 2020

sven1977 added enhancement Request for new feature and/or capability P3 Issue moderate in impact or severity rllib labels Aug 10, 2020

lsylusiyao mentioned this issue Aug 12, 2020

Add support for tuple input on MultiBinary space openai/gym#2023

Merged

lsylusiyao closed this as completed Aug 12, 2020

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[rllib]State input's shape about custom env based on gym's env #10024

[rllib]State input's shape about custom env based on gym's env #10024

lsylusiyao commented Aug 10, 2020 •

edited

sven1977 commented Aug 10, 2020

lsylusiyao commented Aug 11, 2020

lsylusiyao commented Aug 11, 2020

sven1977 commented Aug 11, 2020

lsylusiyao commented Aug 12, 2020

[rllib]State input's shape about custom env based on gym's env #10024

[rllib]State input's shape about custom env based on gym's env #10024

Comments

lsylusiyao commented Aug 10, 2020 • edited

What is your question?

sven1977 commented Aug 10, 2020

lsylusiyao commented Aug 11, 2020

lsylusiyao commented Aug 11, 2020

sven1977 commented Aug 11, 2020

lsylusiyao commented Aug 12, 2020

lsylusiyao commented Aug 10, 2020 •

edited