Minimal loading pretrain BEiT model #373

thunanguyen · 2021-08-01T08:56:20Z

I want to ask if there is a minimal loading pre-trained model of BEiT because I want to use BEiT as a backbone in my research about multi-label classification. I don't need all the code in this repo but only the loading code for BEiT, but currently the loading code is so confusing for me, I think you should provide a torch hub model so that everyone can easily access to your model more.

donglixp · 2021-08-04T03:14:06Z

Hi @thunanguyen , thanks for the good suggestion! Huggingface is working on merging BEiT models in their model hub (huggingface/transformers#12994 (comment) ). It would be much easier to load the checkpoints with user-friendly APIs.

NielsRogge · 2021-08-04T16:53:36Z

BEiT has been merged: https://huggingface.co/transformers/master/model_doc/beit.html :)

thunanguyen · 2021-08-04T19:17:37Z

Great! Will try it immediately! One more question though. Do you think the use of a Squeeze layer like in SqueezeBERT can make the model smaller and faster, @donglixp? Since your large model is too large.

donglixp closed this as completed Aug 4, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Minimal loading pretrain BEiT model #373

Minimal loading pretrain BEiT model #373

thunanguyen commented Aug 1, 2021

donglixp commented Aug 4, 2021

NielsRogge commented Aug 4, 2021

thunanguyen commented Aug 4, 2021 •

edited

Minimal loading pretrain BEiT model #373

Minimal loading pretrain BEiT model #373

Comments

thunanguyen commented Aug 1, 2021

donglixp commented Aug 4, 2021

NielsRogge commented Aug 4, 2021

thunanguyen commented Aug 4, 2021 • edited

thunanguyen commented Aug 4, 2021 •

edited