Graph displaying article topics
Closed this issue · 0 comments
dvmorozov commented
Task
Implement main graph displaying topics of articles and their relations.
- The graph should display set of topics (keywords) with relations. ✔️
- Size of node should correspond to number of articles related to the topic. ✔️
- #82.
- Mouse moving over the node should display pop-up window showing name of topic and number of articles. ✔️
- Edges of graph should connect topics related by articles. ✔️ Only most important links are displayed.
- #86.
- #85.
Solution
- Implement Python script extracting graph data from arxiv metadata. ✔️ Related #75.
- Filter-out data to show most important relations. ✔️
- Implement graph page and JavaScripts to load and visualize data. ✔️
- Use force-directed graph provided by d3.js. ✔️
- Use GitHub as hosting platform. Use JavaScript as data format. No back-end code. ✔️
- Add references to used components to the page. ✔️
- Add hyperlink opening graph in separate browser tab. ✔️
Dependencies
https://pypi.org/project/ijson/
Data
https://www.kaggle.com/datasets/Cornell-University/arxiv
https://arxiv.org/help/bulk_data
Commands
Extract downloaded data
gzip -d arxiv-public-datasets.gz