jiangqn/KSTER

Missing train.en file in Medical dataset.

czwlines opened this issue · 1 comments

Could author provide this file? thanks.

https://github.com/roeeaharoni/unsupervised-domain-clusters 这个网址下载没有预处理过的medical领域训练数据,用moses脚本做分词,然后用subword-nmt切分子词,需要用到的bpe codes和词表文件在网盘中