ajinkyakulkarni14/TED-Multilingual-Parallel-Corpus

Problem when to download the Bilingual_Parallel_Corpus

Opened this issue · 4 comments

Hi, I want to download the file named Chinese, Simplified-French.txt and Chinese, Simplified-Spanish.txt in the folder of Bilingual_Parallel_Corpus, but these files just shows as follow:

version https://git-lfs.github.com/spec/v1
oid sha256:7dc4c4d364f2d05f575ec62e3663b558d01ed2c87baf53adb6ab29bd46571129
size 48043685

and they cannot be uesd.

So could you tell me how can I download the original file?

Best!

This repository is over its data quota. Account responsible for LFS bandwidth should purchase more data packs to restore access.

@Zachary-YL in the /Multilingual_Parallel_Corpus directory there are files that you can use to reproduce all the parrallel text files.

@averkij thank you, do the files in the zip contains the same parallel sentences than in the folder "Bilingual_Parallel_Corpus"?

Thank you.

In the coming six months, this repo will be moved to https://zenodo.org/ with documentation and the addition of more modality in dataset