Huggingface Integration #292

vwxyzjn · 2022-10-13T21:53:36Z

Description

This PR closes #110. https://huggingface.co/cleanrl/CartPole-v1-dqn-seed1 is an example model page.

Types of changes

Bug fix
New feature
New algorithm
Documentation

Checklist:

I've read the CONTRIBUTION guide (required).
I have ensured pre-commit run --all-files passes (required).
I have updated the documentation and previewed the changes via mkdocs serve.
I have updated the tests accordingly (if applicable).

If you are adding new algorithms or your change could result in performance difference, you may need to (re-)run tracked experiments. See #137 as an example PR.

vercel · 2022-10-13T21:53:39Z

The latest updates on your projects. Learn more about Vercel for Git ↗︎

Name	Status	Preview	Updated
cleanrl	✅ Ready (Inspect)	Visit Preview	Jan 4, 2023 at 8:20PM (UTC)

vwxyzjn · 2022-10-13T22:02:13Z

The integration also makes it easier to just run models, such as

cleanrl/cleanrl_utils/evals/dqn_eval.py

Lines 43 to 57 in 4074eee

    
           from huggingface_hub import hf_hub_download 
        
           from cleanrl.dqn import QNetwork, make_env 
        
           model_path = hf_hub_download(repo_id="cleanrl/CartPole-v1-dqn-seed1", filename="q_network.pth") 
        
           evaluate( 
        
               model_path, 
        
               make_env, 
        
               "CartPole-v1", 
        
               eval_episodes=10, 
        
               run_name=f"eval", 
        
               QNetwork=QNetwork, 
        
               device="cpu", 
        
               epsilon=0.05, 
        
               capture_video=False, 
        
           )

vwxyzjn · 2022-10-14T15:54:10Z

CC @ThomasSimonini for review :) Thanks!

kinalmehta

LGTM.

cleanrl_utils/evals/dqn_eval.py

simoninithomas

First, I want to thank you for this integration and all the work behind 🤗. The result is fantastic, especially the model card with visible hyperparameters.

I gave some insights based on Omar and Lucain and my review on how we can improve the push_to_hub part. I'm happy to help with this.**

In addition to this, we can:

Having a downstream part ( load_from_hub ), I can help with that.
Generating a json/yaml file containing the hyperparameters for reproducibility?
Adding the library to our Hub list so that it creates a tag for people searching for cleanrl models.

cleanrl_utils/huggingface.py

vwxyzjn · 2022-10-18T15:10:34Z

@kinalmehta @simoninithomas and @Wauplin, thanks for the review. The CommitOperation suggestion is really helpful. Regarding some further comments:

Having a downstream part ( load_from_hub ), I can help with that.
Appreciate the help! That said, we are only downloading a single file from the hub, so having a customized load_from_hub might be unnecessary, right?

cleanrl/cleanrl_utils/evals/dqn_eval.py

Line 48 in b430540

    
           model_path = hf_hub_download(repo_id="cleanrl/CartPole-v1-dqn-seed1", filename="q_network.pth")

Generating a json/yaml file containing the hyperparameters for reproducibility?

Are you thinking of loading from the yaml file somehow to run the script like python dqn.py --load-yaml hyper.yaml?

Adding the library to our Hub list so that it creates a tag for people searching for cleanrl models.

That would be great! Thank you!

simoninithomas · 2022-10-20T08:09:07Z

Hi @vwxyzjn , yes for yaml I was thinking what you mentioned.

That said, we are only downloading a single file from the hub, so having a customized load_from_hub might be unnecessary, right?

Yes and no, because it has two advantages:

We are able to count how many download of the model each month.
We can cache the model without using hf_hub_download directly.

For instance with SB3 integration here's the code for load_from_hub:

def load_from_hub(repo_id: str, filename: str) -> str:
    """
    Download a model from Hugging Face Hub.
    :param repo_id: id of the model repository from the Hugging Face Hub
    :param filename: name of the model zip file from the repository
    """
    try:
        from huggingface_hub import hf_hub_download
    except ImportError:
        raise ImportError(
            "You need to install huggingface_hub to use `load_from_hub`. "
            "See https://pypi.org/project/huggingface-hub/ for installation."
        )

    # Get the model from the Hub, download and cache the model on your local disk
    downloaded_model_file = hf_hub_download(
        repo_id=repo_id,
        filename=filename,
        library_name="huggingface-sb3",
        library_version="2.1",
    )

    return downloaded_model_file

simoninithomas · 2022-10-24T10:31:49Z

FIY From our side, we started to work on the frontend integration 🤗
huggingface/hub-docs#447

vwxyzjn · 2022-10-24T21:42:05Z

Thank you @simoninithomas

We are able to count how many download of the model each month.

Does this mean hf_hub_download(repo_id="cleanrl/CartPole-v1-dqn-seed1", filename="q_network.pth") would not trigger the download stats?

We can cache the model without using hf_hub_download directly.

Does hf_hub_download not cache models? I ran the dqn_eval.py and noticed the download progress bar only presents once and it did not appear again during the second run, so I assumed hf_hub_download caches automatically.

FIY From our side, we started to work on the frontend integration 🤗
huggingface/hub-docs#447

Awesome thanks! :)

Wauplin · 2022-10-25T09:25:27Z

Hi @vwxyzjn

Does this mean hf_hub_download(repo_id="cleanrl/CartPole-v1-dqn-seed1", filename="q_network.pth") would not trigger the download stats?

I'll let @simoninithomas answer on that as I am 100% sure what is counted in # downloads / month.
Worth noticing that the example from @simoninithomas uses 2 kwargs library_name and library_version to make the Hub know which lib is downloading the model (e.g. a cleanrl user and not a random user).

Does hf_hub_download not cache models?

Yes it does ! No matter if you use hf_hub_download or snapshot_download , your files will be downloaded only once.

vwxyzjn · 2023-01-04T20:23:08Z

@simoninithomas @kinalmehta @Wauplin thanks so much for helping with this PR. I think everything looks good at this point. We also have a good notebook ready to go https://colab.research.google.com/github/vwxyzjn/cleanrl/blob/hf-integration/docs/get-started/CleanRL_Huggingface_Integration_Demo.ipynb. Documentation can be previewed at https://cleanrl-git-hf-integration-vwxyzjn.vercel.app/get-started/zoo/ (the embed link is broken in it because it's pointing to the master branch).

vwxyzjn · 2023-01-12T16:03:13Z

Merging this as is, subjecting to future PRs. We'd also probably use huggingface/blog#616 to make the announcement. Thanks for the great work, folks!

Wauplin · 2023-01-12T17:29:15Z

Congrats ! That was a big piece of work 🎉🎉

simoninithomas · 2023-01-16T16:13:32Z

Congratulations 👏 I was off at the end of last week. I'm preparing the blogpost for next week and we're going to have a unit using CleanRL on PPO with Edward and me using GodotRL we will have the PR this week I'll mention you to put you in the loop.

vwxyzjn added 2 commits October 13, 2022 17:51

initial commit

1b585d6

pre-commit

fa82356

vwxyzjn requested review from kinalmehta and dosssman October 13, 2022 21:53

Add hub integration

4074eee

vercel bot deployed to Preview October 13, 2022 22:00 View deployment

pre-commit

4436ce4

vercel bot deployed to Preview October 14, 2022 14:53 View deployment

kinalmehta approved these changes Oct 15, 2022

View reviewed changes

cleanrl_utils/evals/dqn_eval.py Show resolved Hide resolved

simoninithomas reviewed Oct 17, 2022

View reviewed changes

cleanrl_utils/huggingface.py Outdated Show resolved Hide resolved

cleanrl_utils/huggingface.py Outdated Show resolved Hide resolved

use CommitOperation

df41e3d

vercel bot deployed to Preview October 18, 2022 14:45 View deployment

Fix pre-commit

a98383d

vercel bot deployed to Preview October 18, 2022 14:48 View deployment

refactor

b430540

vercel bot deployed to Preview October 18, 2022 14:51 View deployment

Merge branch 'master' into hf-integration

dd8ee86

vercel bot deployed to Preview October 18, 2022 18:34 View deployment

simoninithomas mentioned this pull request Oct 23, 2022

(Do not merge yet) Add CleanRL to Hub Documentation huggingface/hub-docs#447

Closed

push changes

8144562

vercel bot deployed to Preview October 27, 2022 00:21 View deployment

refactor

2f20e17

vwxyzjn added 2 commits January 3, 2023 10:57

support capture video

4a1f72a

Add notebook

7f22c25

vercel bot deployed to Preview January 3, 2023 15:57 View deployment

update docs

5331287

vercel bot deployed to Preview January 3, 2023 16:02 View deployment

kinalmehta added 2 commits January 4, 2023 08:27

support c51_atari and c51_atari_jax

9aec97e

Merge remote-tracking branch 'origin/hf-integration' into hf-integration

bc8c014

vercel bot deployed to Preview January 4, 2023 02:57 View deployment

kinalmehta added 2 commits January 4, 2023 16:58

typo fix

b202985

add c51 to zoo docs

54fd64a

vercel bot deployed to Preview January 4, 2023 11:29 View deployment

add colab badge

9e5841b

vercel bot deployed to Preview January 4, 2023 20:10 View deployment

fix broken colab svg

9178763

vercel bot deployed to Preview January 4, 2023 20:12 View deployment

pypi release

07961f4

vercel bot deployed to Preview January 4, 2023 20:17 View deployment

vwxyzjn added 2 commits January 4, 2023 15:18

typo fix

c09a80d

update pre-commit

a18ffdb

vercel bot deployed to Preview January 4, 2023 20:18 View deployment

remove hf-integration reference

ba7053a

vercel bot deployed to Preview January 4, 2023 20:20 View deployment

vwxyzjn requested a review from simoninithomas January 4, 2023 20:21

vwxyzjn merged commit 30381ee into master Jan 12, 2023

vwxyzjn mentioned this pull request Jan 12, 2023

Qdagger: Reincarnate RL #344

Merged

20 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Huggingface Integration #292

Huggingface Integration #292

vwxyzjn commented Oct 13, 2022 •

edited

vercel bot commented Oct 13, 2022 •

edited

vwxyzjn commented Oct 13, 2022

vwxyzjn commented Oct 14, 2022

kinalmehta left a comment

simoninithomas left a comment

vwxyzjn commented Oct 18, 2022 •

edited

simoninithomas commented Oct 20, 2022

simoninithomas commented Oct 24, 2022

vwxyzjn commented Oct 24, 2022

Wauplin commented Oct 25, 2022 •

edited

vwxyzjn commented Jan 4, 2023

vwxyzjn commented Jan 12, 2023

Wauplin commented Jan 12, 2023

simoninithomas commented Jan 16, 2023 •

edited

Huggingface Integration #292

Huggingface Integration #292

Conversation

vwxyzjn commented Oct 13, 2022 • edited

Description

Types of changes

Checklist:

vercel bot commented Oct 13, 2022 • edited

vwxyzjn commented Oct 13, 2022

vwxyzjn commented Oct 14, 2022

kinalmehta left a comment

Choose a reason for hiding this comment

simoninithomas left a comment

Choose a reason for hiding this comment

vwxyzjn commented Oct 18, 2022 • edited

simoninithomas commented Oct 20, 2022

simoninithomas commented Oct 24, 2022

vwxyzjn commented Oct 24, 2022

Wauplin commented Oct 25, 2022 • edited

vwxyzjn commented Jan 4, 2023

vwxyzjn commented Jan 12, 2023

Wauplin commented Jan 12, 2023

simoninithomas commented Jan 16, 2023 • edited

vwxyzjn commented Oct 13, 2022 •

edited

vercel bot commented Oct 13, 2022 •

edited

vwxyzjn commented Oct 18, 2022 •

edited

Wauplin commented Oct 25, 2022 •

edited

simoninithomas commented Jan 16, 2023 •

edited