Finetune_LLMs

Overview

This repo contains code to fine-tune GPT-J-6B with a famous quotes dataset. Originally, the repo downloaded and converted the model weights when GPTJ was not yet added to huggingface transformer package. That code can still be seen under the branch original_youtube

/quotes_dataset contains the dataset properly formatted for fine-tuning. See repo for making this dataset here

/finetuning_repo contains code orginally from the repo here that I have modified to work with GPT-J-6B

Old Video Walkthroughs

See the old video for orignal repo code here for a video tutorial.

A more updated video for using the Huggingface model can be seen here

First create a conda envrionment and enter the environment
Run the ./install_requirements.sh script
Then you want to copy the data from train.csv and validation.csv from /quotes_dataset to the /finetuning_repo folder
Run the finetuning code with appropriate flags to fine tune the model. See example_run.txt inside the finetuning_repo

Updated Docker Walkthrough

The updated walkthrough uses nvidia docker to take the headache out of much of the process.

Requirements

A sufficient Nvidia GPU(typically at least 24GB of VRAM and support for fp16). If using cloud offerings I reccomend A100. Though it costs more its speed and VRAM make up for it.
Use a Linux machine. I reccommend Ubuntu
Sufficiently modern version of docker(when in doubt update to latest)
nvidia-docker to allow GPU passthrough the the docker container. See install guide here
Make sure you have the lastest nvidia drivers installed. Check out the tool here

Cuda Drivers Example

If you have a 64 bit Linux system, and need drivers for an A100, you can run a command like this to get setup.

wget https://us.download.nvidia.com/tesla/515.86.01/NVIDIA-Linux-x86_64-515.86.01.run

You will then run the downloaded program with sudo.

chmod 777 NVIDIA-Linux-x86_64-515.86.01.run

sudo ./NVIDIA-Linux-x86_64-515.86.01.run

Usage

First, build the docker image by running build_image.sh. If you recieve an error about not being able to find the docker image, update to a newer cuda version. The images are periodically depreacated. Then open a PR so you can fix this issue for others. Building the docker image can take many minutes.
Run run_image.sh. This script runs the docker image that was just built and mounts the current directory to /workspace inside of the docker container. All GPUs in the system will be passed through. Additionally, to prevent downloading models each time this container is ran, your .cache will also be passed through.
This image can now be used for finetuning a model with GPUs, or for using DeepSpeed inference. Navigate to another folder for more information

Notes

Hi, I am Adam and I forked and made very minor tweaks to this project. Here are some notes on errors I ran into that may help on your journey.

Nvme loading not working: Install libaio-dev both with sudo apt-get AND conda, you should only need one but I believe that for some reason it struggles to pick up the CFLAGS and DFLAGS environment variables otherwise. Failing that, check those environment variables.
OOM error (error code 1, it doesn't tell you that it's an OOM error)? Add more swap space, 128gb is not enough RAM even if you calculate it to be enough RAM.
AdamW and Adam optimisers not working? Use NAdam with the option "zero_allow_untested_optimizer": true in the deepspeed config .json you're using.

Name		Name	Last commit message	Last commit date
Latest commit History 89 Commits
finetuning_repo		finetuning_repo
inference		inference
llama		llama
quotes_dataset		quotes_dataset
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
build_image.sh		build_image.sh
install_requirements.sh		install_requirements.sh
run_image.sh		run_image.sh
update_image.sh		update_image.sh
update_image_llama.sh		update_image_llama.sh

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

finetuning_repo

finetuning_repo

inference

inference

llama

llama

quotes_dataset

quotes_dataset

Dockerfile

Dockerfile

LICENSE

LICENSE

README.md

README.md

build_image.sh

build_image.sh

install_requirements.sh

install_requirements.sh

run_image.sh

run_image.sh

update_image.sh

update_image.sh

update_image_llama.sh

update_image_llama.sh

Repository files navigation

Finetune_LLMs

Overview

Old Video Walkthroughs

Updated Docker Walkthrough

Requirements

Cuda Drivers Example

Usage

Notes

About

Releases

Packages

Languages

License

ZizoAdam/Finetune_LLMs

Folders and files

Latest commit

History

Repository files navigation

Finetune_LLMs

Overview

Old Video Walkthroughs

Updated Docker Walkthrough

Requirements

Cuda Drivers Example

Usage

Notes

About

Resources

License

Stars

Watchers

Forks

Languages