perform secondary fine-tuning on the basis of fine-tuning the first model #942

xjy2020 · 2024-05-06T15:14:21Z

How can I perform secondary fine-tuning on the basis of fine-tuning the first model?

ebsmothers · 2024-05-06T16:15:24Z

@xjy2020 thanks for creating the issue. In general you should be able to run a second fine-tuning script using the output paths of your first fine-tune. So e.g. for llama3/8B_lora you would want to modify these lines of the config to point to the directory and filename(s) of your first fine-tuned checkpoint.

bjohn22 · 2024-05-11T03:27:29Z

Would user also need to swap out this: torchtune.utils.FullModelMetaCheckpointer
in Here especially if fine tuned llama3 was downloaded from HF?

ebsmothers · 2024-05-13T13:24:09Z

Hi @bjohn22 it depends on the type of checkpoint that you download. So FullModelMetaCheckpointer checkpoints can still be downloaded from HF. For instance the tune download command given for our Llama3-8B configs (see e.g. here) will download Meta format checkpoints from HF. In that case you would still use FullModelMetaCheckpointer. Note that for Llama3-8B-Instruct the same model page contains both HF format and Meta format checkpoints. See here -- the HF format weights are in the safetensors files, while the Meta format weights are under the subdirectory original/.

For other community fine-tuned checkpoints on the hub, it may vary, but I suspect many will be in HF format. Btw you can also read our checkpointing deep-dive which covers this topic in more detail.

kartikayk · 2024-05-16T20:58:59Z

@xjy2020 let us know if you have any more questions or if the comments were not helpful! I'm closing this issue, but please feel free to reopen if there are more follow ups.

kartikayk closed this as completed May 16, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

perform secondary fine-tuning on the basis of fine-tuning the first model #942

perform secondary fine-tuning on the basis of fine-tuning the first model #942

xjy2020 commented May 6, 2024

ebsmothers commented May 6, 2024

bjohn22 commented May 11, 2024

ebsmothers commented May 13, 2024

kartikayk commented May 16, 2024

perform secondary fine-tuning on the basis of fine-tuning the first model #942

perform secondary fine-tuning on the basis of fine-tuning the first model #942

Comments

xjy2020 commented May 6, 2024

ebsmothers commented May 6, 2024

bjohn22 commented May 11, 2024

ebsmothers commented May 13, 2024

kartikayk commented May 16, 2024