TF: GPT-J compatible with XLA generation #17986

gante · 2022-07-01T17:18:33Z

What does this PR do?

This PR modifies TF GPT-J so as to be compatible with XLA generation. It borrows the new code from FLAX -- in essence, instead of computing the embedded positions (sincos) at each call, given the size of the sequence (which could be obtained from the size of the past), now pre-computes the embedded positions and gathers them given the position_ids.

⚠️ The integration tests are disabled with @tooslow, due to the size of the model. I've reworked the tests BEFORE touching GPT-J code, to test all needed features correctly. All but the XLA test were passing before GPT-J was changed, and all tests pass after the changes. We still have two XLA tests being run in CI frequently (test_xla_generate_fast and test_xla_generate_slow), as well as a couple of generic generate tests -- they just don't use the trained model weights.

gante · 2022-07-01T17:19:58Z

Related issue: #17935

HuggingFaceDocBuilderDev · 2022-07-01T17:29:58Z

The documentation is not available anymore as the PR was closed or merged.

patrickvonplaten

Looks good to me!

gante added 3 commits July 1, 2022 16:42

tmp commit

8ca86ba

XLA GPT-J

9c99ec9

add missing test config option

bbd7c4b

gante requested review from patrickvonplaten and Rocketknight1 July 1, 2022 17:19

patrickvonplaten approved these changes Jul 4, 2022

View reviewed changes

gante merged commit 360719a into huggingface:main Jul 6, 2022

gante deleted the xla_gptj branch July 6, 2022 14:02

viclzhu pushed a commit to viclzhu/transformers that referenced this pull request Jul 18, 2022

TF: GPT-J compatible with XLA generation (huggingface#17986)

e0ffe75

njhill mentioned this pull request Mar 10, 2023

Fix position embeddings for GPT-J and CodeGen #22069

Merged

5 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

TF: GPT-J compatible with XLA generation #17986

TF: GPT-J compatible with XLA generation #17986

gante commented Jul 1, 2022

gante commented Jul 1, 2022

HuggingFaceDocBuilderDev commented Jul 1, 2022 •

edited

patrickvonplaten left a comment

TF: GPT-J compatible with XLA generation #17986

TF: GPT-J compatible with XLA generation #17986

Conversation

gante commented Jul 1, 2022

What does this PR do?

gante commented Jul 1, 2022

HuggingFaceDocBuilderDev commented Jul 1, 2022 • edited

patrickvonplaten left a comment

Choose a reason for hiding this comment

HuggingFaceDocBuilderDev commented Jul 1, 2022 •

edited