/odloader

Open Data Loader for Python.

Primary LanguagePythonApache License 2.0Apache-2.0

ODLoader

ODLoader (odloader) is an open source Python library for open data analysis.

Functionality

We have many open data, but it is difficult to use them effectively because of different data formats and download methods. Hirokazu Takaya and ODLoader Development Team published COVID-19 data in Japan and covsirphy Python library for COVID-19 data analysis here and found many challenges in data engineering.

ODLoader has the following functionalities to overcome the challenges in data science.

  • to download datasets and save them at local directry
    • (expected) with URLs
    • (expected) pre-registered open data providers
  • (expected) to record data source information to configuration file
  • (expected) to update local data if reqired
  • to provide data engineering tools
    • (expected) tools for exploratory data analysis
    • (expected) tools for data cleaning

If you have ideas or need new functionalities, please join this project. Any suggestions via Github Issues and Twitter: @lisphilar are always welcomed. Questions are also great.

Installation

To be confirmed.

Demo

To be confirmed.

Tutorial

To be confirmed.

Release notes

Refer to GitHub Releases.

Developers

ODLoader is developed by a community of volunteers. Please see the full list here.

License: Apache License 2.0

Please refer to LICENSE file.

Citation

Please cite this library as follows with version number.

Hirokazu Takaya and ODLoader Development Team (2023), ODLoader version [version number]: Open source Python library for open data analysis, https://github.com/lisphilar/odloader