TDA based Summary and Distance between Sets of Texts and Distance between Literary Styles

The dataset used is part of the CorpusSonetosSigloDeOro repository.

  • In poets_comparison.py, the bottleneck distance computation of the experimentation is done.

  • In poets_comparison_entropy.py, the bottleneck distance computation of the experimentation is done.

  • In the folder Results, the bottleneck distance values of 100 iterations of the algorithm is provided.

  • In the folder Entropy_values, the entropy values of 100 iteration of the algorithm is provided.

An illustrative example using circumferences datasets can be seen in the jupyter notebook Example circumferences.ipynb