Add Wav2Vec2 & Hubert ForSequenceClassification #13153

anton-l · 2021-08-17T13:28:06Z

What does this PR do?

This adds a Hubert extension for sequence classification.
Ultimately this classification head should be compatible with s3prl UtteranceLevel implementation to support classification tasks from SUPERB, such as Keyword Spotting and transfer their pretrained models.

Before submitting

This PR fixes a typo or improves the docs (you can dismiss the other checks if that's the case).
Did you read the contributor guideline,
Pull Request section?
Was this discussed/approved via a Github issue or the forum? Please add a link
to it if that's the case.
Did you make sure to update the documentation with your changes? Here are the
documentation guidelines, and
here are tips on formatting docstrings.
Did you write any new necessary tests?

Who can review?

@patrickvonplaten @patil-suraj

src/transformers/models/hubert/configuration_hubert.py

src/transformers/models/hubert/modeling_hubert.py

tests/test_modeling_hubert.py

src/transformers/models/hubert/modeling_hubert.py

…add-speech-classification # Conflicts: # src/transformers/models/hubert/configuration_hubert.py # src/transformers/models/hubert/convert_hubert_original_s3prl_checkpoint_to_pytorch.py # src/transformers/models/hubert/modeling_hubert.py # tests/test_modeling_hubert.py # utils/check_repo.py

src/transformers/models/hubert/convert_hubert_original_s3prl_checkpoint_to_pytorch.py

src/transformers/models/hubert/modeling_hubert.py

tests/test_modeling_hubert.py

tests/test_modeling_wav2vec2.py

patrickvonplaten · 2021-08-25T12:31:18Z

utils/check_repo.py

@@ -122,6 +122,8 @@
    "TFRagTokenForGeneration",
    "Wav2Vec2ForCTC",
    "HubertForCTC",
+    "Wav2Vec2ForSequenceClassification",


Yes! We have to discuss a bit with @Narsil how to best add those models to pipelines

# Update this list for models that are not in any of the auto MODEL_XXX_MAPPING. Being in this list is an exception and # should **not** be the rule.

Seems like the exception has grown quite a bit :)

src/transformers/models/wav2vec2/feature_extraction_wav2vec2.py

src/transformers/models/hubert/modeling_hubert.py

patrickvonplaten

This PR looks to be in a very good shape already!
Before merging it would be great if we could:

add a #Copied from Wav2Vec2 ... to the HuBERT code if it's 1-to-1 the same
add one test per task for Wav2Vec2 as well
add at least one model for each task to either https://huggingface.co/superb or facebook (let's check with others here)
run eval of the models on the datasets to check which models should be normalized and which shouldn't and adapt configs accordingly

src/transformers/models/wav2vec2/modeling_wav2vec2.py

anton-l · 2021-08-27T13:27:45Z

Accuracy evaluation on SUPERB tasks:

KS has uniform-length samples, so no padding
ER has non-uniform padded batches
SID is evaluated with batch_size=1 as in s3prl

Task	Model	normalize=True	normalize=False	Paper
KS	Wav2Vec2-base	0.9627	0.9643	0.9623
	Hubert-base	0.9669	0.9672	0.9630
ER	Wav2Vec2-base	0.5281	0.6258	0.6343
	Hubert-base	0.5502	0.6359	0.6492
SID	Wav2Vec2-base	0.7360	0.7518	0.7518
	Hubert-base	0.8071	0.8071	0.8142

So far normalize=False is always better, as expected (s3prl never used normalization during eval).
There's also some slight variation with the official results, but it's of the same magnitude as s3prl vs paper results.

…fication

anton-l · 2021-08-27T15:39:35Z

Passed integration test for all 4 tasks on both models
Added Copied from where possible (the script just inserts a full copy of W2V2.forward() before End copy, so I didn't use it there)
Added dummy examples to forward() docs
Moved the models to https://huggingface.co/superb

@patrickvonplaten everything should be ready to merge now :)

patrickvonplaten · 2021-08-27T17:46:31Z

Awesome job @anton-l ! Feel free to merge the PR whenever you want

Add hubert classifier + tests

a050229

anton-l requested a review from patrickvonplaten August 17, 2021 13:28

patrickvonplaten reviewed Aug 17, 2021

View reviewed changes

src/transformers/models/hubert/configuration_hubert.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Aug 17, 2021

View reviewed changes

src/transformers/models/hubert/modeling_hubert.py Show resolved Hide resolved

patrickvonplaten reviewed Aug 17, 2021

View reviewed changes

tests/test_modeling_hubert.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Aug 17, 2021

View reviewed changes

tests/test_modeling_hubert.py Outdated Show resolved Hide resolved

patrickvonplaten reviewed Aug 17, 2021

View reviewed changes

src/transformers/models/hubert/modeling_hubert.py Show resolved Hide resolved

patrickvonplaten mentioned this pull request Aug 18, 2021

Adding a Wav2Vec2ForSpeechClassification class #12730

Closed

anton-l added 7 commits August 25, 2021 13:47

Add hubert classifier + tests

f39b19f

Dummies for all classification tests

bcfc9b7

Wav2Vec2 classifier + ER test

ab60bb8

Fix hubert integration tests

cfe4ea6

Add hubert IC

6b0aea4

Pass tests for all classification tasks on Hubert

66d0f9a

anton-l changed the base branch from master to hubert-test August 25, 2021 10:59

anton-l changed the base branch from hubert-test to master August 25, 2021 11:00