/DarkSpider

Anatomy and Visualization of the Network structure of the Dark web using multi-threaded crawler

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

Multithreaded Crawler and Extractor for Dark Web

Version Python license Documentation


Introduction

DarkSpider is a multithreaded crawler and extractor for regular or onion webpages through the TOR network, written in Python.

Full Documentation

See the Docs for full documentation, examples, and other information.

Bugs and Feedback

For bugs, questions and discussions please use the GitHub Issues.

Credits

License

“GPL” stands for “General Public License”. Using the GNU GPL will require that all the released improved versions be free software. source & more

Changelog

v2.0.1:

What's Changed

  • Add: Project IEEE citation by @PROxZIMA in PROxZIMA#8
  • Fix: Ignore Gooey installation by default by @PROxZIMA in PROxZIMA#9
  • Add: CLI command to include/exclude external links by @PROxZIMA in PROxZIMA#11
  • Fixed Issue related to Graphical Analysis by @knightster0804 in PROxZIMA#10
  • Revert "Fixed Issue related to Graphical Analysis" by @PROxZIMA in PROxZIMA#12
  • Creating json by @r0nl in PROxZIMA#14
  • Added 'exclusion' Argument by @r0nl in PROxZIMA#16
  • Added required functionalities and Images by @knightster0804 in PROxZIMA#13
  • White box unit test cases for modules by @PROxZIMA in PROxZIMA#7
  • New graphical visualization using seaborn library by @PROxZIMA in PROxZIMA#17
  • Added image and script crawler by @ytatiya3 in PROxZIMA#15
  • Crawler multi-threaded implementation by @PROxZIMA in PROxZIMA#18

New Contributors

  • @PROxZIMA made their first contribution in PROxZIMA#8
  • @knightster0804 made their first contribution in PROxZIMA#10
  • @r0nl made their first contribution in PROxZIMA#14
  • @ytatiya3 made their first contribution in PROxZIMA#15

Full Changelog: https://github.com/PROxZIMA/DarkSpider/compare/1.0.0...2.0.1

v1.0.0:
  • Initial project setup

Full Changelog: https://github.com/PROxZIMA/DarkSpider/commits/1.0.0