New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Add XGLM models #14876

Merged

patil-suraj merged 49 commits into huggingface:master from patil-suraj:xglm

Jan 28, 2022

Contributor

patil-suraj commented Dec 22, 2021

What does this PR do?

This PR adds the XGLM model: code, paper

patil-suraj changed the title ~~Add XGLM models~~ [WIP] Add XGLM models

patil-suraj force-pushed the xglm branch 2 times, most recently from 52fb413 to 2fe52b4 Compare

December 28, 2021 12:30

patil-suraj requested review from patrickvonplaten, sgugger and LysandreJik

December 28, 2021 13:34

patrickvonplaten reviewed

View reviewed changes

src/transformers/models/xglm/modeling_flax_xglm.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed

View reviewed changes

src/transformers/models/xglm/configuration_xglm.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed

View reviewed changes

docs/source/model_doc/xglm.mdx

		in social value tasks such as hate speech detection in five languages and find it has limitations similar to comparable sized GPT-3 models.*


		This model was contributed by [Suraj](https://huggingface.co/valhalla). The original code can be found [here](https://github.com/pytorch/fairseq/tree/main/examples/xglm).

Contributor

patrickvonplaten Dec 29, 2021

Maybe we can add some prompt examples here - but happy to do this in a follow-up PR

patrickvonplaten reviewed

View reviewed changes

src/transformers/models/xglm/modeling_flax_xglm.py Show resolved Hide resolved

patrickvonplaten reviewed

View reviewed changes

src/transformers/models/xglm/modeling_flax_xglm.py Show resolved Hide resolved

patrickvonplaten reviewed

View reviewed changes

src/transformers/models/xglm/modeling_flax_xglm.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed

View reviewed changes

src/transformers/models/xglm/configuration_xglm.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed

View reviewed changes

src/transformers/models/xglm/modeling_flax_xglm.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed

View reviewed changes

src/transformers/models/xglm/modeling_flax_xglm.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed

View reviewed changes

src/transformers/models/xglm/modeling_xglm.py Show resolved Hide resolved

patrickvonplaten reviewed

View reviewed changes

tests/test_modeling_flax_xglm.py Show resolved Hide resolved

patrickvonplaten reviewed

View reviewed changes

tests/test_modeling_xglm.py Show resolved Hide resolved

patrickvonplaten approved these changes

View reviewed changes

Contributor

patrickvonplaten left a comment

Nice, looks very clean to me already!

sgugger approved these changes

View reviewed changes

Collaborator

sgugger left a comment

Thanks a lot for adding this new model!

docs/source/model_doc/xglm.mdx Outdated Show resolved Hide resolved

docs/source/model_doc/xglm.mdx Outdated Show resolved Hide resolved

src/transformers/__init__.py Outdated Show resolved Hide resolved

src/transformers/__init__.py Outdated Show resolved Hide resolved

src/transformers/__init__.py Outdated Show resolved Hide resolved

src/transformers/models/xglm/modeling_flax_xglm.py Outdated Show resolved Hide resolved

src/transformers/models/xglm/modeling_xglm.py Outdated Show resolved Hide resolved

src/transformers/models/xglm/modeling_xglm.py Outdated Show resolved Hide resolved

src/transformers/models/xglm/modeling_xglm.py Outdated Show resolved Hide resolved

tests/test_tokenization_xglm.py Outdated Show resolved Hide resolved

patil-suraj changed the title ~~[WIP] Add XGLM models~~ Add XGLM models

LysandreJik approved these changes

View reviewed changes

Member

LysandreJik left a comment

This is great, thanks @patil-suraj!

src/transformers/convert_slow_tokenizer.py

		@@ -910,6 +910,35 @@ def converted(self) -> Tokenizer:
		return tokenizer


		class XGLMConverter(SpmConverter):

Member

LysandreJik Jan 3, 2022

Nice!

src/transformers/models/xglm/configuration_xglm.py Outdated Show resolved Hide resolved

tests/test_modeling_flax_xglm.py

+                      for model_class_name in self.all_model_classes:
+                          model = model_class_name.from_pretrained("facebook/xglm-564M")
+                          outputs = model(np.ones((1, 1)))
+                          self.assertIsNotNone(outputs)

Member

LysandreJik Jan 3, 2022

Could there be an integration test here as well?

huggingface deleted a comment from github-actions bot

patil-suraj added the WIP label

patil-suraj added 6 commits

January 28, 2022 13:24


          add xglm

07d9dbc


          update vocab size

2dd3e9d


          fix model name

0bd4a1b


          style and tokenizer

229f424


           typo

69a051b


          no mask token

6f7cb55

patil-suraj and others added 24 commits

January 28, 2022 13:26


          add tokenizer test

d607d08


          update checkpoint names

70bc407


          fix tokenizer tests

fd06165


          fix slow tests

4640a0c


          add copied from comments

a61cc98


          rst -> mdx

a142b37


          flax model

eacdb40


          update flax tests

9bfe482


          quality

0f86fdc


          style

8c0ce4c

doc

d72ae18


          update index and readme

e144a93


          fix copies

5714c54


          fix doc

4370f25


          update toctrr

9704e27


          fix indent

2d9594a


          minor fixes

722db64


          fix config doc

46f8878


          don't save embed_pos weights

f4f5a0a


          Apply suggestions from code review

6c6e78d

Co-authored-by: Sylvain Gugger <35901082+sgugger@users.noreply.github.com>
Co-authored-by: Patrick von Platen <patrick.v.platen@gmail.com>


          address Sylvains commnets, few doc fixes

09ba1c2


          fix check_repo

27ce46f


          align order of arguments

33c53e5


          fix copies

d3bbe4f

patil-suraj force-pushed the xglm branch from 3e0dab5 to d3bbe4f Compare

January 28, 2022 12:28

patil-suraj added 3 commits

January 28, 2022 16:48


          fix labels

2dc4949


          remove unnecessary mapping

dacc786


          fix saving tokenizer

199e1b4

patil-suraj merged commit d25e25e into huggingface:master

patil-suraj deleted the xglm branch

January 28, 2022 17:55

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment