Smaller LitGPT like configs #398

carmocca · 2024-05-10T14:33:45Z

What does this PR do?

Alternative to #219

Weight memory usage before:

gpt-neox-like 6532864 26.131456 MB
llama1-like 602432 2.409728 MB
long-context-like 4301120 17.20448 MB
llama2-like 602432 2.409728 MB
falcon-7b-like 4359040 17.43616 MB
falcon-40b-like 1589760 6.35904 MB
codellama2-like 819520 3.27808 MB
mixtral-like 3308800 13.2352 MB

After:

gpt-neox-like 141056 0.564224 MB
llama1-like 602432 2.409728 MB
long-context-like 602432 2.409728 MB
llama2-like 602432 2.409728 MB
falcon-7b-like 77700 0.3108 MB
falcon-40b-like 398080 1.59232 MB
codellama2-like 401568 1.606272 MB
mixtral-like 257856 1.031424 MB

for more information, see https://pre-commit.ci

t-vi · 2024-05-11T10:40:57Z

Thank you @carmocca ,I don't think we need to go smaller than we currently have. Anything < 500MB GPU memory should be more than fine.

carmocca · 2024-05-13T10:29:44Z

Okay. We can always revisit this

Smaller LitGPT like configs

13c5a43

carmocca self-assigned this May 10, 2024

[pre-commit.ci] auto fixes from pre-commit.com hooks

edfc013

for more information, see https://pre-commit.ci

carmocca force-pushed the carmocca/smaller-litgpt-test-configs branch from 25989b6 to 123df4f Compare May 10, 2024 15:38

Skips

bba9e86

carmocca force-pushed the carmocca/smaller-litgpt-test-configs branch from 123df4f to bba9e86 Compare May 10, 2024 15:39

Non strict

cfc4148

carmocca closed this May 13, 2024

carmocca deleted the carmocca/smaller-litgpt-test-configs branch May 13, 2024 10:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Smaller LitGPT like configs #398

Smaller LitGPT like configs #398

carmocca commented May 10, 2024 •

edited

t-vi commented May 11, 2024

carmocca commented May 13, 2024

Smaller LitGPT like configs #398

Smaller LitGPT like configs #398

Conversation

carmocca commented May 10, 2024 • edited

What does this PR do?

t-vi commented May 11, 2024

carmocca commented May 13, 2024

carmocca commented May 10, 2024 •

edited