New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
issues related to CHILDES Corpus Reader #3079
Comments
Hello @Arthur-Kan, I'm struggling a bit to reproduce these issues, and I think the reason is that the fileids that you must specify to To debug that, I'd like to ask whether this script work for you? from nltk.corpus.reader import CHILDESCorpusReader
valian = CHILDESCorpusReader('./Valian', '.*.xml')
fileids = valian.fileids()
#print words
print(valian.words(fileids[0]))
#print sentences
print(valian.sents(fileids[0]))
#print MLU
print(valian.MLU(fileids[0])) If not, could you also specify your NLTK version, so I can figure out if it's version specific? I mention this, because somewhat recently we had some issues with the CHILDES corpus not parsing correctly (#2997).
|
Hello Tom, Thank you for your response. My NLLTK version is 3.7. I have just tried the codes you suggested, the output is still the same. I tried other functions as well, such as tagged_words() and tagged_sent(), they all return an empty list, except for MLU which returns a zero. I have asked my friends to try the same codes, and also the ones you suggested, they all don't seem to be working. Could you please take a look? Thank you so much for your help!
Best, |
Hey @Arthur-Kan, NLTK 3.7 is the most recent release, although we've been working on various fixes since then on the We plan to create a new release some time in the next week, and until then you can use the
|
Hello Tom, Thank you very much for the information, I think I can wait till the new release next week. May I ask if the information related to the release will be said on NLTK website, or how could I tell when it is released? Looking forward to the updated version, thank you for your help! Best wishes, |
Hello @Arthur-Kan, The newest update, NLTK 3.8, is out now! See the Release Notes on the website or the ChangeLog on the repo for more information on the release. I think I can close this now, as it should be fixed. Let us know if you experience issues still!
|
Greetings. I am working on a child language project and would like to use the CHILDES Corpus Reader package to analyze children's language data. However, the methods do not output anything. I am trying with the Valian Corpus in the XML version (the link for downloading the XML version of Valian corpus is [(https://childes.talkbank.org/data-xml/Eng-NA/)]
Heres the code I used:
Here is what the output for words, sentences and MLU look like:
Thank you very much for your help!!
The text was updated successfully, but these errors were encountered: