FSD50K Speech Model Fine-tuning Tutorial #201

FlorentMeyer · 2022-10-22T15:57:37Z

Before submitting

Was this discussed/approved via a Github issue? (no need for typos and docs improvements)
Did you make sure to update the docs?
Did you write any new necessary tests?

What does this PR do?

Add FSD50K Speech Model Fine-tuning Tutorial.

PR review

Did you have fun?

A lot 🙃

for more information, see https://pre-commit.ci

codecov · 2022-10-22T15:59:42Z

Codecov Report

Merging #201 (9794cfc) into main (89b94ba) will not change coverage.
Report is 1 commits behind head on main.
The diff coverage is n/a.

Additional details and impacted files

@@         Coverage Diff         @@
##           main   #201   +/-   ##
===================================
  Coverage    73%    73%           
===================================
  Files         2      2           
  Lines       382    382           
===================================
  Hits        280    280           
  Misses      102    102

rohitgr7 · 2022-10-23T19:55:03Z

hey @FlorentMeyer, mind check the file you uploaded, looks like it's too big and there might be some redundant stuff here. Might clean it up?

Borda · 2022-10-25T20:51:59Z

not sure what happen but GH does not want to show me the diff :/

…yer/tutorials into fsd50K_speech_model_finetuning

FlorentMeyer · 2022-10-26T17:57:00Z

Good evening,

I should mention that the code in the converted notebook was exactly the same as in this Colab notebook (having removed the !pip installs). I also kept the output, but reading other people's examples I suppose that the outputs printed inside the docs are the ones obtained by running the .py converted notebooks on your side.

My last commit therefore makes these changes to the linked Colab notebook:

remove all cells outputs
remove conditions on bash instructions (a single bash command inside an if was causing a syntax error due to the absence of Python code)
remove %% magic
comment out the tensorboard cell (which was responsible for creating such a large file, I am sorry I hadn't checked it before)

Changes to the .yaml file:

add gdown as a requirement
remove brackets around my name (didn't know they were special characters)

I'm just not sure whether the Pandas dataframes with the audio players will get rendered.

for more information, see https://pre-commit.ci

lightning_examples/fsd50K-speech-model-finetuning/fsd50K_speech_model_finetuning.py

rohitgr7 · 2022-10-26T18:58:23Z

lightning_examples/fsd50K-speech-model-finetuning/fsd50K_speech_model_finetuning.py

+# ## Compute metrics
+
+# %% id="zlTooqqp8FWk"
+mAP_micro = average_precision_score(


mind use torchmetrics functional metrics?
https://torchmetrics.readthedocs.io/en/stable/classification/average_precision.html#functional-interface

At the time I wrote the code, torchmetrics.functional.average_precision's target took "integer labels" therefore not accepting multi-hot labels. Just let me check whether this was fixed and if I get the same results as with scikit-learn!

OK, the new implementations of multilabel_average_precision give the same results as scikit-learn

let's use that :)

rohitgr7 · 2022-10-26T18:59:19Z

lightning_examples/fsd50K-speech-model-finetuning/fsd50K_speech_model_finetuning.py

+# ## Compute metrics
+
+# %% id="zlTooqqp8FWk"
+mAP_micro = average_precision_score(


also, it looks like you have true values for preds, I'd recommend using test_step instead to show the metrics.

What do you mean? Something like this?

With on_step=False, on_epoch=True to only log the end of the epoch according to https://pytorch-lightning.readthedocs.io/en/stable/extensions/logging.html#logging-from-a-lightningmodule:

The above config for validation applies for test hooks as well.

Suggested change

mAP_micro = average_precision_score(

# In __init__:

self.mAP = torchmetrics.classification.MultilabelAveragePrecision()

# In test_step:

self.mAP(preds, y)

self.log('mAP', self.mAP)

Then the class version of torchmetrics should be prefered to functional I'd say?

okay.. it's fine.. let's use functional metrics here since you already have all the targets and predictions.
modular metrics are useful, when you are aggregating the metrics, let's say on step-level

lightning_examples/fsd50K-speech-model-finetuning/fsd50K_speech_model_finetuning.py

Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>

for more information, see https://pre-commit.ci

FlorentMeyer · 2022-11-07T14:24:39Z

I see there are still problems with:

the git+<my_repo> in requirements.txt
the building of the docs saying some cells are missing IDs even though I used jupytext as requested

…yer/tutorials into fsd50K_speech_model_finetuning

for more information, see https://pre-commit.ci

FlorentMeyer · 2022-11-07T16:08:21Z

OK I also saw that there were bizarre things happening in the notebook, looks like the pre-commit hooks are moving stuff around causing duplication every time I pull them into my own code before being able to push again (example) and it's easy to miss things when reading a notebook as a .py file.

Anyway I read the whole file carefully and this should be fixed now. Also all cells have an ID so I'm not sure where the "cells are missing IDs" error comes from :/

FlorentMeyer · 2022-12-01T13:47:56Z

Small up!

lightning_examples/fsd50K-speech-model-finetuning/fsd50K_speech_model_finetuning.py

Borda · 2022-12-16T16:11:13Z

lightning_examples/fsd50K-speech-model-finetuning/fsd50K_speech_model_finetuning.py

+#     name: python3
+# ---
+
+# %% [markdown] id="CI0JECKA9AnY"


lets remove all the ID

for more information, see https://pre-commit.ci

Upload whole project.

3488fb2

FlorentMeyer requested review from Borda, rohitgr7, carmocca, kaushikb11 and ethanwharris as code owners October 22, 2022 15:57

[pre-commit.ci] auto fixes from pre-commit.com hooks

92410db

for more information, see https://pre-commit.ci

FlorentMeyer added 3 commits October 26, 2022 19:47

Clean Colab notebook

1e3fb82

Remove useless brackets, add gdown requirement

f8bce98

Merge branch 'fsd50K_speech_model_finetuning' of github.com:FlorentMe…

f62fea4

…yer/tutorials into fsd50K_speech_model_finetuning

[pre-commit.ci] auto fixes from pre-commit.com hooks

b5e9f1e

for more information, see https://pre-commit.ci

rohitgr7 reviewed Oct 26, 2022

View reviewed changes

akihironitta added the Example Example / Demo / Tutorial label Oct 27, 2022

Apply suggestions from code review

5482d08

Co-authored-by: Rohit Gupta <rohitgr1998@gmail.com>

Borda changed the title ~~Upload whole project.~~ FSD50K Speech Model Fine-tuning Tutorial Nov 4, 2022

Borda assigned rohitgr7 Nov 4, 2022

FlorentMeyer and others added 4 commits November 5, 2022 17:16

Remove torch.cuda.device_count check

ee62851

Replace sklearn.metrics with torchmetrics

78a9672

[pre-commit.ci] auto fixes from pre-commit.com hooks

344c7ab

for more information, see https://pre-commit.ci

Remove cells associated with torch.cuda.device_count check

ee68dfc

FlorentMeyer and others added 3 commits November 7, 2022 15:58

Replace average_precision with multilabel_average_precision

5a0b5a0

Merge branch 'fsd50K_speech_model_finetuning' of github.com:FlorentMe…

e63873c

…yer/tutorials into fsd50K_speech_model_finetuning

[pre-commit.ci] auto fixes from pre-commit.com hooks

cce4022

for more information, see https://pre-commit.ci

FlorentMeyer removed the request for review from kaushikb11 November 12, 2022 09:53

FlorentMeyer requested a review from rohitgr7 November 12, 2022 09:53

Borda reviewed Dec 16, 2022

View reviewed changes

Borda assigned ethanwharris and unassigned rohitgr7 Dec 16, 2022

Borda and others added 5 commits January 1, 2023 13:14

Merge branch 'main' into fsd50K_speech_model_finetuning

a6b0791

Apply suggestions from code review

b79986f

[pre-commit.ci] auto fixes from pre-commit.com hooks

223d4ce

for more information, see https://pre-commit.ci

Merge branch 'main' into fsd50K_speech_model_finetuning

c8f74fc

Merge branch 'main' into fsd50K_speech_model_finetuning

9794cfc

mergify bot requested a review from Borda April 6, 2024 19:30

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FSD50K Speech Model Fine-tuning Tutorial #201

FSD50K Speech Model Fine-tuning Tutorial #201

FlorentMeyer commented Oct 22, 2022 •

edited by Borda

codecov bot commented Oct 22, 2022 •

edited

rohitgr7 commented Oct 23, 2022

Borda commented Oct 25, 2022

FlorentMeyer commented Oct 26, 2022 •

edited

rohitgr7 Oct 26, 2022

FlorentMeyer Nov 5, 2022

FlorentMeyer Nov 5, 2022

rohitgr7 Nov 5, 2022

rohitgr7 Oct 26, 2022

FlorentMeyer Nov 5, 2022

FlorentMeyer Nov 5, 2022

rohitgr7 Nov 5, 2022

FlorentMeyer commented Nov 7, 2022 •

edited

FlorentMeyer commented Nov 7, 2022

FlorentMeyer commented Dec 1, 2022

Borda Dec 16, 2022

-mAP_micro = average_precision_score(
+# In __init__:
+self.mAP = torchmetrics.classification.MultilabelAveragePrecision()
+# In test_step:
+self.mAP(preds, y)
+self.log('mAP', self.mAP)

FSD50K Speech Model Fine-tuning Tutorial #201

Are you sure you want to change the base?

FSD50K Speech Model Fine-tuning Tutorial #201

Conversation

FlorentMeyer commented Oct 22, 2022 • edited by Borda

Before submitting

What does this PR do?

PR review

Did you have fun?

codecov bot commented Oct 22, 2022 • edited

Codecov Report

rohitgr7 commented Oct 23, 2022

Borda commented Oct 25, 2022

FlorentMeyer commented Oct 26, 2022 • edited

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

FlorentMeyer commented Nov 7, 2022 • edited

FlorentMeyer commented Nov 7, 2022

FlorentMeyer commented Dec 1, 2022

Choose a reason for hiding this comment

FlorentMeyer commented Oct 22, 2022 •

edited by Borda

codecov bot commented Oct 22, 2022 •

edited

FlorentMeyer commented Oct 26, 2022 •

edited

FlorentMeyer commented Nov 7, 2022 •

edited