Timeline retrieval of academic papers from ACL Anthology collection
- LDA Model for Topic Modelling
- HDP Model for Topic Modelling
- ACL Dataset
- Analyzing NLP Research over time
- Random Walk with restart
- DivRank
- [Slides] (https://docs.google.com/presentation/d/1oJz0S1t27yeFW5U0F750fuiW3Oh97I41tP352O_HgQ0/edit?usp=sharing)
- run
python doc_tp.py
to get all the topics and get topic proportions. This will createdoc_tp_scores.npy
object. - run
python doc_tp_icp.py
to get TP-ICP scores for all the documents. This will createICP.npy
andTP_ICP_DOC.npy
object.
- run
python doc_similarity.py
to get similarity between documents. This will createDOC_SIMILARITY.npy
object. - run
python doc_similarity_normalised.py
to get similarity between documents. This will createDOC_SIMILARITY_NORMALIZED.npy
object.
- run
python random_walk_parallel.py
to get similarity between documents. This will create 9916 numpy objects based on random walk being started from each node.
- run
python manage.py runserver 0:8000
to start the server. Open your browser and goto127.0.0.1:8000