Conversion to HF checkpoint should generate a checkpoint format that can be loaded directly #1359

awaelchli · 2024-04-25T18:49:53Z

The conversion we have with litgpt convert to a huggingface checkpoint creates a model.pth file. But then you have to load it like so as described in the tutorial:

import torch
from transformers import AutoModel

state_dict = torch.load("output_dir/model.pth")
model = AutoModel.from_pretrained(
    "output_dir/", local_files_only=True, state_dict=state_dict
)

But we should make it work like this:

model = AutoModel.from_pretrained("output_dir")

The only blocker for this is that from_pretrained requires the pytorch_model.bin to be loaded with weights_only=True. Our checkpoints don't satisfy this constraint, because we save checkpoints using the incremental pickle save. See #1357 for more context where we had to work around this.

The text was updated successfully, but these errors were encountered:

rasbt · 2024-04-25T19:40:42Z

Yes I agree, there were a few people on Discord recently struggling with this

carmocca added enhancement New feature or request conversion For model weight-map conversions labels Apr 25, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Conversion to HF checkpoint should generate a checkpoint format that can be loaded directly #1359

Conversion to HF checkpoint should generate a checkpoint format that can be loaded directly #1359

awaelchli commented Apr 25, 2024 •

edited

rasbt commented Apr 25, 2024

Conversion to HF checkpoint should generate a checkpoint format that can be loaded directly #1359

Conversion to HF checkpoint should generate a checkpoint format that can be loaded directly #1359

Comments

awaelchli commented Apr 25, 2024 • edited

rasbt commented Apr 25, 2024

awaelchli commented Apr 25, 2024 •

edited