LoRA: dequantize model #811

Andrei-Aksionov · 2023-12-14T14:46:26Z

Hi there 👋

This PR adds functionality to dequantize the model that was initially quantized, in case anyone wants to use these weights elsewhere.

The logic is to dequantize to float16 dtype, that is hardcoded in quant_state and then cast to the compute dtype, the one that was used for trainable LoRA parameters and data inputs.
This is exactly how it's done in BNB.

Note: the PR is not yet finalized as, for completeness, I also want to add dequantization for bnb.int8.

Andrei-Aksionov added 3 commits December 14, 2023 14:01

Dequantize function for lit_gpt/lora.py

84f7d14

Test for dequantize_model func

7f0a948

Ruff/Black CLI

b0df34d

Andrei-Aksionov requested review from awaelchli, carmocca and lantiga as code owners December 14, 2023 14:46

Andrei-Aksionov marked this pull request as draft December 14, 2023 14:46

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

LoRA: dequantize model #811

LoRA: dequantize model #811

Andrei-Aksionov commented Dec 14, 2023

LoRA: dequantize model #811

Are you sure you want to change the base?

LoRA: dequantize model #811

Conversation

Andrei-Aksionov commented Dec 14, 2023