/bubbles

[NOT MAINTAINED] Bubbles – Python ETL framework

Primary LanguagePythonOtherNOASSERTION

Bubbles

Bubbles is a Python ETL Framework and set of tools. It can be used for processing, auditing and inspecting data. Focus is on understandability and transparency of the process.

Project page: http://bubbles.databrewery.org

Blog: http://blog.databrewery.org

About

Bubbles is a Python framework for:

  • ETL (extraction, transformation and loading)
  • preparation of data for further analysis
  • data probing – analysing properties of data, mostly categorical in nature
  • data quality monitoring
  • virtual data objects – abstraction of table-like structured datasets. Datasets are treated the same, no matter whether the source is a text file or a database table.

Installation

Requires at least Python 3.3.

To install Bubbles framework type:

pip install bubbles

To install Bubbles from sources, you can get it from Github:

https://github.com/Stiivi/bubbles

Documentation

Introduction to bubbles (Slideshare presentation)

Operations (Scribd document)

Documentation can be found at: http://packages.python.org/bubbles

Sources

Project source repository is being hosted at Github: https://github.com/Stiivi/bubbles

git clone git://github.com/Stiivi/bubbles.git

Support

If you have questions, problems or suggestions, you can send a message to the Google group or write to the author.

Author

Stefan Urbanek stefan.urbanek@gmail.com

License

Bubbles is licensed under MIT license with following addition:

If your version of the Software supports interaction with it remotely 
through a computer network, the above copyright notice and this permission 
notice shall be accessible to all users.

Simply said, that if you use it as part of software as a service (SaaS) you have to provide the copyright notice in an about, legal info, credits or some similar kind of page or info box.

For full license see the LICENSE file.