/NADEEF

A Generalized Data Cleaning System

Primary LanguageJavaScriptOtherNOASSERTION

What is NADEEF?

NADEEF (or ''clean'' in Arabic, نظيف) is a generalized data cleaning system developed by the data analytic group at Qatar Computing Research Institute.

See it in Action

Launch NADEEF using command

    Usage: nadeef.sh [OPTIONS]
    Options are:
        console : start the NADEEF console.
        dashboard : start the NADEEF dashboard.

A demo page can be accessed via here.

More details on the instructions can be found in the User Guide.

Goals of NADEEF

Being a commodity data cleaning system, NADEEF aims to be extensible, generic and easy-to-deploy.

Most existing data cleaning methods and systems, either in industry or academia, employ different types of data quality rules in isolation, each time a new data cleaning problem arises or a new type of rules is considered, practitioners either build a new system from scratch or go through a painful process of customizing an existing tool. The NADEEF team designed a new data cleaning system that distinguishes between a programming interface and a core to achieve generality and extensibility.

Publications

http://da.qcri.org/publications

License

NADEEF is released under the terms of the MIT License.

Contact

For any issues or enhancement please use the issue pages in Github, or contact us. We will try our best to help you sort it out.

Acknowledgement

We would like to thank JetBrains' support for their wonderful IntelliJ IDEA product. We are using it through all the development of NADEEF.

IntelliJ