【文件属性】:
文件名称:RDRsegmenter:快速准确的越南语分词器(LREC 2018)
文件大小:394KB
文件格式:ZIP
更新时间:2021-05-27 21:10:13
vietnamese word-segmentation vietnamese-nlp vietnamese-tokenizer Java
快速,准确的越南语分词器
如所述,RDRsegmenter的实现:
@InProceedings{NguyenNVDJ2018,
author={Dat Quoc Nguyen and Dai Quoc Nguyen and Thanh Vu and Mark Dras and Mark Johnson},
title={{A Fast and Accurate Vietnamese Word Segmenter}},
booktitle={Proceedings of the 11th International Conference on Language Resources and Evaluation (LREC 2018)},
pages={2582--2587},
year={2018}
}
每当将RDRsegmenter用于产生已发布的结果或将其合并到其他软件中时,请引用
【文件预览】:
RDRsegmenter-master
----Node.java(1KB)
----Model.RDR(125KB)
----RDRsegmenter.java(11KB)
----VnVocab(514KB)
----Vocabulary.java(62KB)
----License.md(565B)
----FWObject.java(436B)
----Utils.java(4KB)
----WordTag.java(270B)
----Readme.md(3KB)
----train()
--------Utility()
--------Train_gold.txt(409KB)
--------RDRsegmenter.py(1KB)
--------Readme.md(2KB)
--------SCRDRlearner()
----Tokenizer.java(17KB)
----DataPreprocessor.java(2KB)