Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update v2.3.x from master #5755

Merged
merged 14 commits into from
Jul 13, 2020
Merged

Update v2.3.x from master #5755

merged 14 commits into from
Jul 13, 2020

Conversation

adrianeboyd
Copy link
Contributor

Description

Update v2.3.x from master

Types of change

Checklist

  • I have submitted the spaCy Contributor Agreement.
  • I ran the tests, and all new and existing tests passed.
  • My changes don't require a change to the documentation, or if they do, I've added all required information.

honnibal and others added 14 commits July 5, 2020 15:48
* Use cosine loss in Cloze multitask

* Fix char_embed for gpu

* Call resume_training for base model in train CLI

* Fix bilstm_depth default in pretrain command

* Implement character-based pretraining objective

* Use chars loss in ClozeMultitask

* Add method to decode predicted characters

* Fix number characters

* Rescale gradients for mlm

* Fix char embed+vectors in ml

* Fix pipes

* Fix pretrain args

* Move get_characters_loss

* Fix import

* Fix import

* Mention characters loss option in pretrain

* Remove broken 'self attention' option in pretrain

* Revert "Remove broken 'self attention' option in pretrain"

This reverts commit 56b820f.

* Document 'characters' objective of pretrain
…rhead of mecab (explosion#5701)

* speed up Korean nlp 100x by stopping mecab from reloading on each doc

* add contributor agreement

* rename variables to improve code readability
* Add texthero to universe.json

* Add spaCy contributor Agreement
* Adding spaczz package to universe.json

* Adding contributor agreement.
Remove English-specific is_base_form from French lemmatizer.
* Fix lemmatizer init args for python2.7

* Move English is_base_form to a class method

* Skip test pickling PhraseMatcher for python2
Co-authored-by: Mark Neumann <markng@allenai.org>
@adrianeboyd adrianeboyd merged commit bf778f5 into explosion:v2.3.x Jul 13, 2020
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

8 participants