/dataset_collection

Kumpulan script dan dataset terkait aktivitas pengumpulan dataset

Primary LanguageJupyter Notebook

Dataset Collection Repository

This repository contains some scripts in Python that useful for scrapping some dataset related to COVID19 in Indonesia.

How to use?

Python3 was used to develop this project. To run this project, install requirements using pip pip install -r requirements.txt or pipenv pipenv install

Requirements

  1. Python 3.*
  2. Pandas 1.0.3
  3. BeautifulSoup 4 0.0.1
  4. LXML 4.2.1
  5. Selenium 3.141.0

How to Run the Scripts

  1. Install all of the requirements using pip3 install package-name
  2. ipynb Files
    1. Open the ipynb files in Jupyter Notebook or Google Colab
    2. Choose Kernels - Restart and Run All
  3. py Files
    1. Open terminal
    2. Run using python3 script_file_name.py