/wikispecies-crawler

Crawling https://species.wikimedia.org/wiki/Main_Page

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

Wiki Species Data Extractor

Python Parsers for : https://species.wikimedia.org/wiki/Main_Page Parsing all sub-species for any given species or taxon group

Requirements:

  • Python
  • BS4
  • urllib

Sample (Plantae)

Tested on Python3 / MacOS X

>>> extract_taxons_rec('https://species.wikimedia.org/wiki/Main_Page/Plantae')
<<< 

OP: Plantae data output

Contributions

El bouchti Alaa
Hilaly Mohammed-Amine