New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Renamed omw to omw-1.4 #2907
Renamed omw to omw-1.4 #2907
Conversation
This PR assumes nltk/nltk_data#175. Otherwise, the tests fail because OMW-1.4 cannot be downloaded. |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Looks good to me.
We should merge nltk/nltk_data#175 quickly, and then we can restart the tests for this PR, and get this merged too.
Thanks for jumping on this so quickly @ekaf
I compared the output of:
using respectively the old nltk-3.5 (with old omw.zip) vs. nltk-3.6.5 and this PR (with new omw-1.4.zip):
This shows that less than half the languages had a (usually small) increase in the number of lemmas, while the number of synsets stayed the same between the two OMW versions. |
This is going to have cache issues again. I've tried only caching if the commit message doesn't contain |
I suspect the secrets trick only works on branches within |
@tomaarsen: near the top of that page, it says "Secrets are not passed to workflows that are triggered by a pull request from a fork". So I suspect there will be a better chance if I renew the key in ci.yaml. We can always restore it afterwards. |
@ekaf sounds good. I suppose that makes sense - if PR's were passed secrets, then someone could run a github actions workflow in a PR that just prints the secret in plaintext, which sounds like a giant vulnerability when secrets are used as authentication etc. |
@tomaarsen, yes, that sounds likely to be the reason. The same applies to wordnet2021, then. I am to going add .xml files for them, run "make pkg_index" and submit a PR in nltk_data. |
@ekaf I'm already on it. I'll tag you in a PR in a few minutes, and you can review it if possible. |
The Ubuntu cache was updated, but the Windows one was not, hence the fail here. With other words, I have no issues merging this as it stands. On Thank you for your work in getting us to OMW 1.4! |
We really need a new version to pick up changes introduced in nltk/nltk#2907
This PR selects omw-1.4 as the default package to use together with Wordnet. This implements the third point in @tomaarsen's plan from #2905 (comment), in order to ensure a smooth transition to the new OMW-1.4 data.