/digikrawl

Data is the new gold. This repository will help you gain some! Using Python, Pandas, and BeautifulSoup4, you can scrape Digikala (Iranian E-commerce company) comments. You can simply choose your desired category of products and run code cells and get a .csv file containing the comments.

Primary LanguageJupyter Notebook

Digikrawl: DigiKala Comment Crawler

Data is the new gold. This repository will help you gain some! Using Python, Pandas, and BeautifulSoup4, you can scrape Digikala (Iranian E-commerce company) comments. You can simply choose your desired category of products and run code cells and get a .csv file containing the comments. You could aquire great dataset for NLP (Natural Language Processing) and NLU (Natural Language Understanding) tasks in Farsi.

All credits belong to DigiKala.