/genre_analysis

scripts and notebooks for genre analytics on corpora

Primary LanguageJupyter Notebook

genre_analysis

scripts and notebooks for genre analytics on corpora

Working process and data manipulation, also corpus to graph

https://github.com/TatianaShavrina/genre_analysis/blob/master/networks_project.ipynb

Sentences in graph form

Here in .rar archives

Final dataframes for classification and PCA, tSNE

https://github.com/TatianaShavrina/genre_analysis/tree/master/data

scripts for crawling the data are here:

https://github.com/TatianaShavrina/crawlers

scripts for tagging the data

https://github.com/TatianaShavrina/taiga/tree/master/tagging_pipeline

putting the corpus into databases

https://github.com/TatianaShavrina/taiga/tree/master/database