YAML configuration for mlx_lm.lora #503

chimezie · 2024-02-29T00:11:59Z

Changes lora tuning module to use YAML configuration with defaults from original command-line parameters

Supersede #235

awni

Thanks for getting this started! I suggest a few changes (which I'm happy to help with):

Keep support for the existing command line arguments. It can be nice to run with just the CLI and it's useful to be back-compatible here IMO.
Allow a config arg for the yaml.
I think a nice behavior is to overwrite flags from the config with the CLI (so prefer the command line to the same parameter set in the config). Makes it easy to experiment without needing to update the config. Another option which I think is ok is to simply disallow setting both (if the config is provided then you can't also provide the same flag on the CLI).

awni · 2024-03-03T17:17:06Z

llms/mlx_lm/LORA.md

+The main command is `mlx_lm.lora`. The argument is a YAML file with the training parameters 
+in the following format:
+
+```yaml


Let's keep the LORA.md as it was. Instead of putting this here, let's put an example config in the examples directory (call it lora_config.yaml). https://github.com/ml-explore/mlx-examples/tree/main/llms/mlx_lm/examples

And maybe link to the example config in this readme and mention how to use it.

Ok. I reverted LORA.md, added the example YAML, and added a mention of it in LORA.md

awni · 2024-03-03T17:18:09Z

llms/mlx_lm/lora.py

 from mlx.utils import tree_flatten

 from .tuner.trainer import TrainingArgs, TrainingCallback, evaluate, train
 from .tuner.utils import linear_to_lora_layers
 from .utils import load

+yaml_loader = yaml.SafeLoader
+yaml_loader.add_implicit_resolver(


What is the purpose of this?

See: Numbers in scientific notation without dot are parsed as string

awni · 2024-03-06T04:01:20Z

@chimezie do you plan to come back to this? Let me know if I can help here!

chimezie · 2024-03-06T05:13:31Z

Yes, I was working on it a few hours ago and intended to finish it tomorrow (with some questions).

…ck in ml-explore#503

chimezie · 2024-03-06T12:40:19Z

Keep support for the existing command line arguments. It can be nice to run with just the CLI and it's useful to be back-compatible here IMO.

I agree. I have updated this

Allow a config arg for the yaml.

There was already a positional argument for the config (it is now an optional positional argument). Is it fine as it is, or did you want it as an optional argument like (for example) -c/--config?

I think a nice behavior is to overwrite flags from the config with the CLI (so prefer the command line to the same parameter set in the config). Makes it easy to experiment without needing to update the config. Another option which I think is ok is to simply disallow setting both (if the config is provided then you can't also provide the same flag on the CLI).

Ok. I went with the first option, so the CLI flags take precedence

awni · 2024-03-06T14:17:23Z

Thank you for all the updates!

There was already a positional argument for the config (it is now an optional positional argument). Is it fine as it is, or did you want it as an optional argument like (for example) -c/--config?

I think it would be better as a flag -c/--config because it is more consistent with all the other arguments

…eir defaults are in CONFIG_DEFAULTS)

chimezie · 2024-03-06T15:53:24Z

Ok. I have updated the config option accordingly. I also ended up having to remove the defaults from the CLI argument definitions because they were overriding the YAML-based options due to their taking precedence. Currently, the order of precedence is:

CLI options (if specified)
YAML options (if specified)
Defaults in CONFIG_DEFAULTS

It is ready for your review

chimezie · 2024-03-06T17:29:55Z

One tricky issue came up afterwards.

Apparently, even if you define options this way:

import argparse

parser = argparse.ArgumentParser(description='')
parser.add_argument('--sum', help='', required=False)
args = parser.parse_args()

if __name__ == '__main__':
    print(args.__dict__)

The argument dict will still have an entry for the 'sum' option, mapped to None. So, as it is currently, if the -c/--config is not provided, all the defaults will be None, even for CLI options that were not specified. What I had in mind was to do this, but I'm reluctant for no particularly good reason than that argparse should exclude options that were not specified in the CLI if they are defined to not be required:

    if args.config:
        # [..]    
    else:
        args.__dict__.update(
            {
                arg: CONFIG_DEFAULTS[arg]
                for arg, value in args.__dict__.items()
                if value is None and arg != "config"
            }
        )

Thoughts?

madroidmaq · 2024-03-08T06:19:26Z

llms/mlx_lm/examples/lora_config.yaml

+    #Number of training steps between validations.
+    "steps_per_eval": 200
+
+    #Load path to resume training with the given adapter weights.


Can you just set it to adapter.npz to make sure it works out of the box?

That file is just an example of the format of the YAML configuration and the mlx_lm/lora.py here pulls defaults from CONFIG_DEFAULTS in that file:

CONFIG_DEFAULTS = { "adapter_file": "adapters.npz", [...] }

Since 'adapters.npz' is already the default for adapter_file there, if --adapter-file is not specified in the commandline or in the configuration (as "adapter_file") then adapter.npz will be the default.

…s from CONFIG_DEFAULTS

awni

Thanks for the additions!

* Convert mlx_lm.lora to use YAML configuration * pre-commit run fixes * Fix loading of config file * Remove invalid YAML from doc * Update command-line options and YAML parameter overriding, per feedback in ml-explore#503 * Minor wording change * Positional argument * Moved config to a (-c/--config) flag * Removed CLI option defaults (since CLI options take precedence and their defaults are in CONFIG_DEFAULTS) * pre-commit format updates * Fix handling of CLI option defaults * Prevent None values of unspecified CLI options from overwriting values from CONFIG_DEFAULTS * nits --------- Co-authored-by: Awni Hannun <awni@apple.com>

chimezie added 4 commits February 28, 2024 18:45

Convert mlx_lm.lora to use YAML configuration

1e51dcc

pre-commit run fixes

b83d873

Fix loading of config file

1bed8c9

Remove invalid YAML from doc

7722750

awni mentioned this pull request Mar 1, 2024

Reinforcement Learning from Human Feedback (RLHF) examples: Direct Preference Optimization (DPO) #513

Open

awni reviewed Mar 3, 2024

View reviewed changes

Merge branch 'ml-explore:main' into yaml-config

9721b61

chimezie added 4 commits March 6, 2024 07:29

Merge branch 'ml-explore:main' into yaml-config

70c9a6c

Update command-line options and YAML parameter overriding, per feedba…

811c867

…ck in ml-explore#503

Minor wording change

20f2158

Positional argument

d947586

chimezie added 6 commits March 6, 2024 10:11

Merge branch 'ml-explore:main' into yaml-config

cb965b5

Moved config to a (-c/--config) flag

8f242b8

Merge remote-tracking branch 'origin/yaml-config' into yaml-config

da10a44

Removed CLI option defaults (since CLI options take precedence and th…

dba0905

…eir defaults are in CONFIG_DEFAULTS)

pre-commit format updates

3fa666e

Fix handling of CLI option defaults

31f422b

madroidmaq reviewed Mar 8, 2024

View reviewed changes

chimezie and others added 4 commits March 8, 2024 09:36

Merge branch 'ml-explore:main' into yaml-config

e98a87e

Prevent None values of unspecified CLI options from overwriting value…

ddff96a

…s from CONFIG_DEFAULTS

Merge remote-tracking branch 'origin/yaml-config' into yaml-config

d89723e

nits

3d53b0b

awni approved these changes Mar 8, 2024

View reviewed changes

awni merged commit 8c2cf66 into ml-explore:main Mar 8, 2024
3 checks passed

chimezie deleted the yaml-config branch March 11, 2024 15:37

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

YAML configuration for mlx_lm.lora #503

YAML configuration for mlx_lm.lora #503

chimezie commented Feb 29, 2024

awni left a comment

awni Mar 3, 2024 •

edited

awni Mar 3, 2024

chimezie Mar 6, 2024

awni Mar 3, 2024

chimezie Mar 6, 2024

awni commented Mar 6, 2024

chimezie commented Mar 6, 2024

chimezie commented Mar 6, 2024 •

edited

awni commented Mar 6, 2024

chimezie commented Mar 6, 2024

chimezie commented Mar 6, 2024 •

edited

madroidmaq Mar 8, 2024

chimezie Mar 8, 2024

awni left a comment

YAML configuration for mlx_lm.lora #503

YAML configuration for mlx_lm.lora #503

Conversation

chimezie commented Feb 29, 2024

awni left a comment

Choose a reason for hiding this comment

awni Mar 3, 2024 • edited

Choose a reason for hiding this comment

awni Mar 3, 2024

Choose a reason for hiding this comment

chimezie Mar 6, 2024

Choose a reason for hiding this comment

awni Mar 3, 2024

Choose a reason for hiding this comment

chimezie Mar 6, 2024

Choose a reason for hiding this comment

awni commented Mar 6, 2024

chimezie commented Mar 6, 2024

chimezie commented Mar 6, 2024 • edited

awni commented Mar 6, 2024

chimezie commented Mar 6, 2024

chimezie commented Mar 6, 2024 • edited

madroidmaq Mar 8, 2024

Choose a reason for hiding this comment

chimezie Mar 8, 2024

Choose a reason for hiding this comment

awni left a comment

Choose a reason for hiding this comment

awni Mar 3, 2024 •

edited

chimezie commented Mar 6, 2024 •

edited

chimezie commented Mar 6, 2024 •

edited