This repository contains the drug images crawled from Drugbank
- Install Jupyter Notebook using Anaconda or from anyother source
- Ensure you have Python 3 installed
- Start the notebook using the command
jupyter notebook
in the current folder - Ensure that following packages are installed
- svglib
- reportlab
- scrapy
- urllib
- requests
- pathlib
- Open Drugbank_Crawler.ipynb and execute all the cells - This will crawl all the data from drugbank and store the images in the
svg
folder in .svg format - Open PNG_Convert.ipynb and execute all the cells - This will use the
svg
folder to convert it into .png images and store it inpng
folder inside the same directory. This will also remove images containingNo Structures Found
- You will have all the images in the
png
folder
Classes | Count |
---|---|
cardiovascular | 419 |
central nervous system | 963 |
anti-infective | 671 |
gastrointestinal | 209 |
anti-inflammatory | 233 |
dermatological | 85 |
hematologic | 111 |
lipid regulating | 49 |
reproductive control | 70 |
respiratory system | 112 |
urological | 21 |
antineoplastic | 494 |