Type hints #293

timoklein · 2022-10-14T07:25:11Z

Description

As discussed on Discord, I've done basic type hints for PPO and DQN. Everything checks out with mypy 0.982 (mypy cleanrl/ppo.py --show-error-codes --ignore-missing-imports). Of course we can have a discussion about whether that's the checker that will be used if we go further down the road of implementing this. I'll put comments in noteworthy places.

The tests fail because tuple[int] or list[int] only works from Python 3.9 on.

Types of changes

Bug fix
New feature
New algorithm
Documentation

Checklist:

I've read the CONTRIBUTION guide (required).
I have ensured pre-commit run --all-files passes (required).
I have explained note-worthy implementation details.
Update poetry dependencies
Type checker config.

vercel · 2022-10-14T07:25:16Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Updated
cleanrl	✅ Ready (Inspect)	Visit Preview	Oct 14, 2022 at 7:25AM (UTC)

timoklein · 2022-10-14T07:29:26Z

cleanrl/ppo.py

@@ -159,15 +166,24 @@ def get_action_and_value(self, x, action=None):
    # env setup
    envs = gym.vector.SyncVectorEnv(
        [make_env(args.env_id, args.seed + i, i, args.capture_video, run_name) for i in range(args.num_envs)]
-    )
+    )  # type:ignore[abstract]


SyncVectorEnv inherits from VectorEnv which inherits from Env. For older gym versions (I'm currently on 0.23.1), Env is an ABC with abstract method render that is not overriden by any of the vector envs. Since it's fixed in the newest gym release and not our issue, I'm ignoring here.

timoklein · 2022-10-14T07:37:26Z

cleanrl/ppo.py

+    # Handling gym shapes being Optionals (variant 1)
+    # Personally i'd prefer the asserts
+    assert isinstance(envs.single_observation_space.shape, tuple), "shape of observation space must be defined"
+    assert isinstance(envs.single_action_space.shape, tuple), "shape of action space must be defined"
+
+    # Handling gym shapes being Optionals (variant 2)
+    # Once could also cast inside each call but in my eyes that's not conducive to readability
+    obs_space_shape = cast(tuple[int, ...], envs.single_observation_space.shape)
+    action_space_shape = cast(tuple[int, ...], envs.single_action_space.shape)


Gym spaces can in theory return None shapes. Mypy will complain about this when concatenating the shape tuples later.

Option 1 is to use a cast either once here or every time the spaces are accessed. I don't think that's very readable.

Option 2 is to assert that the space shapes are tuples. Doing it once here fixes all errors for the rest of the code. Since there's an assert in this place already anyway I think this is the better option.

Option 1 is more preferrable

timoklein · 2022-10-14T07:39:29Z

cleanrl/dqn.py

@@ -92,11 +96,11 @@ def __init__(self, env):
            nn.Linear(84, env.single_action_space.n),
        )

-    def forward(self, x):
+    def forward(self, x: torch.FloatTensor) -> torch.FloatTensor:


I've used FloatTensor here but we should just use torch.Tensor if type hints are pursued further.

This is fine

vwxyzjn

Thank you! The PR looks good. I have left some comments

vwxyzjn · 2022-10-18T14:37:47Z

cleanrl/dqn.py

@@ -82,7 +83,10 @@ def thunk():

 # ALGO LOGIC: initialize agent here:
 class QNetwork(nn.Module):
-    def __init__(self, env):
+


Could you remove this space?

vwxyzjn · 2022-10-18T14:37:55Z

cleanrl/dqn.py

@@ -92,11 +96,11 @@ def __init__(self, env):
            nn.Linear(84, env.single_action_space.n),
        )

-    def forward(self, x):
+    def forward(self, x: torch.FloatTensor) -> torch.FloatTensor:


This is fine

vwxyzjn · 2022-10-18T14:40:27Z

cleanrl/ppo.py

+    # Handling gym shapes being Optionals (variant 1)
+    # Personally i'd prefer the asserts
+    assert isinstance(envs.single_observation_space.shape, tuple), "shape of observation space must be defined"
+    assert isinstance(envs.single_action_space.shape, tuple), "shape of action space must be defined"
+
+    # Handling gym shapes being Optionals (variant 2)
+    # Once could also cast inside each call but in my eyes that's not conducive to readability
+    obs_space_shape = cast(tuple[int, ...], envs.single_observation_space.shape)
+    action_space_shape = cast(tuple[int, ...], envs.single_action_space.shape)


Option 1 is more preferrable

add ppo and dqn type hints

3a9b148

vercel bot deployed to Preview October 14, 2022 07:25 View deployment

timoklein commented Oct 14, 2022

View reviewed changes

timoklein marked this pull request as draft October 14, 2022 07:41

vwxyzjn reviewed Oct 20, 2022

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Type hints #293

Type hints #293

timoklein commented Oct 14, 2022 •

edited

vercel bot commented Oct 14, 2022 •

edited

timoklein Oct 14, 2022 •

edited

timoklein Oct 14, 2022

vwxyzjn Oct 18, 2022

timoklein Oct 14, 2022

vwxyzjn Oct 18, 2022

vwxyzjn left a comment

vwxyzjn Oct 18, 2022

vwxyzjn Oct 18, 2022

vwxyzjn Oct 18, 2022

Type hints #293

Are you sure you want to change the base?

Type hints #293

Conversation

timoklein commented Oct 14, 2022 • edited

Description

Types of changes

Checklist:

vercel bot commented Oct 14, 2022 • edited

timoklein Oct 14, 2022 • edited

Choose a reason for hiding this comment

timoklein Oct 14, 2022

Choose a reason for hiding this comment

vwxyzjn Oct 18, 2022

Choose a reason for hiding this comment

timoklein Oct 14, 2022

Choose a reason for hiding this comment

vwxyzjn Oct 18, 2022

Choose a reason for hiding this comment

vwxyzjn left a comment

Choose a reason for hiding this comment

vwxyzjn Oct 18, 2022

Choose a reason for hiding this comment

vwxyzjn Oct 18, 2022

Choose a reason for hiding this comment

vwxyzjn Oct 18, 2022

Choose a reason for hiding this comment

timoklein commented Oct 14, 2022 •

edited

vercel bot commented Oct 14, 2022 •

edited

timoklein Oct 14, 2022 •

edited