/wikipedia-graph

🦉Graphical representation of connections between articles on Wikipedia

Primary LanguagePythonMIT LicenseMIT

Contributors Forks Stargazers Issues MIT License LinkedIn


Wikipedia Graph

Crawl wikipedia and search for connection between 2 articles

Crawler

details page

login page

Path finder

main

About The Project

This project has 2 modules:

  1. crawler.py allows to crawl all links from the given page, add them to graph and then visit all scraped urls. After crawling is finished you can see generated graph by opening the html file.
  2. main.py allows to search a connection between 2 urls - START_URL and END_URL the program works as long as the connection is not found.

Built With

  • Python
  • Networkx
  • pyvis
  • requests
  • bs4

Installation

Development

  1. Clone git repository
git clone https://github.com/DEENUU1/wikipedia-graph.git
  1. Install all requirements
pip install -r requirements.txt
  1. Run selected script
python crawler.py

python main.py

License

See LICENSE.txt for more information.