/neatdata

Shell scripts for cleaning csv, tsv, txt, and other types of data files.

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

Neatdata

This is a collection of python scripts I use to clean databases in csv, tsv, and txt format.

Experience shows that cleaning data directly on bash is efficient and might free your RAM Memory. It would be okay to deal with 10GB files. You don't have to load a file in memory before working on it, you just have to clean it.


They are inspired by the package 'tidyr' created by Hadley Wickham (check it out - https://github.com/hadley/tidyr).