NADEEF (or ''clean'' in Arabic, نظيف) is a generalized data cleaning system developed by the data analytic group at Qatar Computing Research Institute.
Launch NADEEF using command
Usage: nadeef.sh [OPTIONS]
Options are:
console : start the NADEEF console.
dashboard : start the NADEEF dashboard.
A demo page can be accessed via here.
More details on the instructions can be found in the User Guide.
Being a commodity data cleaning system, NADEEF aims to be extensible, generic and easy-to-deploy.
Most existing data cleaning methods and systems, either in industry or academia, employ different types of data quality rules in isolation, each time a new data cleaning problem arises or a new type of rules is considered, practitioners either build a new system from scratch or go through a painful process of customizing an existing tool. The NADEEF team designed a new data cleaning system that distinguishes between a programming interface and a core to achieve generality and extensibility.
http://da.qcri.org/publications
NADEEF is released under the terms of the MIT License.
For any issues or enhancement please use the issue pages in Github, or contact us. We will try our best to help you sort it out.
We would like to thank JetBrains' support for their wonderful IntelliJ IDEA product. We are using it through all the development of NADEEF.