clarin-eric/ParlaMint

Great directory rename

Closed this issue · 3 comments

Currently the things in the Corpora directory are rather a mess and it is time to bring some order, this issue documents the steps in this process.

What needs to be done (incl. fixing all the scripts):

  • rename Corpora/ to Build/
  • inside rename Master/ to Distro/
  • inside make directory Sources-TSV/ in move into it all the files from Ministers/, Orientations/, Sex/ and remove these
  • inside rename Docs/ to Sources-Build/
  • in Sources-Build/README.md/ change copied README files to symlinks
  • inside rmdir Sources-MT/, rename Sources-Sem/ to Sources-CoNLLU/
  • inside rename bin/ to Scripts/
  • in Scripts/ symlink to parlamint-lib.xsl

move to Scripts/ all build-only scripts (tei2tsv, tsv2tei, taxonomy, parlamint2release.xsl, parlamint2distro.pl)

tsv scripts have already been moved, but I can't move the rest:

  • taxonomy scripts are used by the main ParlaMint Makefile
  • parlamint2distro.pl calls a bunch of scripts that are also used by the main ParlaMint Makefile
  • parlamint2distro.pl also calls parlamint2release.xsl

In short, I will not move these scripts (and the hope remains that @matyaskopp might use some of them), and delete this line from the To Do list.

I think this has now been finished, i.e. managed to build a couple of corpora in the original language and MTed in the new directory structure. So, closing.