Add XLM-V to Model Doc #21498

stefan-it · 2023-02-07T19:37:24Z

Hi,

as discussed in #21330 it would be good to have an extra entry for the new XLM-V model in the Model Doc.

This PR adds it with some additional information about the model and conducted experiments with it.

HuggingFaceDocBuilderDev · 2023-02-07T19:58:15Z

The documentation is not available anymore as the PR was closed or merged.

sgugger

Thanks for adding this!

sgugger · 2023-02-07T20:01:54Z

docs/source/en/model_doc/xlm-v.mdx

+> Large multilingual language models typically rely on a single vocabulary shared across 100+ languages.
+> As these models have increased in parameter count and depth, vocabulary size has remained largely unchanged.
+> This vocabulary bottleneck limits the representational capabilities of multilingual models like XLM-R.
+> In this paper, we introduce a new approach for scaling to very large multilingual vocabularies by
+> de-emphasizing token sharing between languages with little lexical overlap and assigning vocabulary capacity
+> to achieve sufficient coverage for each individual language. Tokenizations using our vocabulary are typically
+> more semantically meaningful and shorter compared to XLM-R. Leveraging this improved vocabulary, we train XLM-V,
+> a multilingual language model with a one million token vocabulary. XLM-V outperforms XLM-R on every task we
+> tested on ranging from natural language inference (XNLI), question answering (MLQA, XQuAD, TyDiQA), and
+> named entity recognition (WikiAnn) to low-resource tasks (Americas NLI, MasakhaNER).


Should be in italics.

sgugger · 2023-02-07T20:02:26Z

docs/source/en/model_doc/xlm-v.mdx

+  library had to be converted.
+- The `XLMTokenizer` implementation is used to load the vocab and performs tokenization.
+
+This model was contributed by [stefan-it](https://huggingface.co/stefan-it), including detailed experiments with XLM-V on downstream tasks.


Could you point to one canonical checkpoint on the Hub as well?

Thanks, I added a reference to facebook/xlm-v-base :)

stefan-it · 2023-02-07T20:06:26Z

CI is failing, I'm going to read the Flan-T5 PR (#19892) to see how it should be done!

sgugger · 2023-02-07T20:09:59Z

You just need to add the model type (same as what you picked for the page in the doc) and name in the configuration_auto file. The PR you mention also does it :-)

sgugger · 2023-02-07T21:43:03Z

Failures are unrelated to this PR so merging!

* doc: introduce new section for XLM-V model * doc: mention more details for XLM-V integration * docs: paper abstract in italics, model identifier for base model added * doc: mention new XLM-V support * auto: add XLM-V mapping * doc: run make fix-copies ;)

stefan-it added 2 commits February 7, 2023 18:07

doc: introduce new section for XLM-V model

4751e02

doc: mention more details for XLM-V integration

794f918

sgugger reviewed Feb 7, 2023

View reviewed changes

stefan-it added 4 commits February 7, 2023 21:20

docs: paper abstract in italics, model identifier for base model added

b974c45

doc: mention new XLM-V support

bfd2772

auto: add XLM-V mapping

156eab6

doc: run make fix-copies ;)

aa80eeb

sgugger approved these changes Feb 7, 2023

View reviewed changes

sgugger merged commit 7e51a44 into huggingface:main Feb 7, 2023

stefan-it deleted the add-xlm-v-model-doc branch February 8, 2023 14:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add XLM-V to Model Doc #21498

Add XLM-V to Model Doc #21498

stefan-it commented Feb 7, 2023

HuggingFaceDocBuilderDev commented Feb 7, 2023 •

edited

sgugger left a comment

sgugger Feb 7, 2023

stefan-it Feb 7, 2023

sgugger Feb 7, 2023

stefan-it Feb 7, 2023

stefan-it commented Feb 7, 2023

sgugger commented Feb 7, 2023

sgugger commented Feb 7, 2023

Add XLM-V to Model Doc #21498

Add XLM-V to Model Doc #21498

Conversation

stefan-it commented Feb 7, 2023

HuggingFaceDocBuilderDev commented Feb 7, 2023 • edited

sgugger left a comment

Choose a reason for hiding this comment

sgugger Feb 7, 2023

Choose a reason for hiding this comment

stefan-it Feb 7, 2023

Choose a reason for hiding this comment

sgugger Feb 7, 2023

Choose a reason for hiding this comment

stefan-it Feb 7, 2023

Choose a reason for hiding this comment

stefan-it commented Feb 7, 2023

sgugger commented Feb 7, 2023

sgugger commented Feb 7, 2023

HuggingFaceDocBuilderDev commented Feb 7, 2023 •

edited