This repository is under construction. The current progress is shown in table below:
NE Tags | Count | Goal | Progress |
---|---|---|---|
PER | 359 | 5109 | 7 % |
LOC | 389 | 2476 | 15 % |
ORG | 448 | 4181 | 10 % |
Total | 1196 | 11766 | 10 % |
Corpus size | 24160 | 250000 | 9 % |
The original texts is gathered from various news websites.Data sources include (farsnews,yjc,tabnak) .Tagging mechanism is published in a paper and will be uploaded in the future.These repository is revised by human in a rough manner.The accurate version of this corpus will be published soon.