> make
> ./tgnews <option> <source_dir>
The source_dir
should only contain html files or directories, for example:
## For all html inside a Sample
> ./tgnews languages ./DataClusteringSample0107/
## Or choise a small sample
> ./tgnews top ./DataClusteringSample0107/20191101/00/