This program semi-automates a manual process that normally takes an estimated 5 hours to complete, reducing the processing time to under 6 minutes.
The program is divided into two stages:
In this stage, the program:
- Loads/Extracts multiple CSV files
- Resolves encoding errors in the files
- Cleans and transforms the data
- Loads/Writes the files into categorized .csv files labeled as 'Inward' and 'Outward'
In this stage, the program:
- Loads the previously exported Inward and Outward .csv files
- Further breaks down the data into sub-categories for easier user understanding
- Fork the Repo to your PC
- Go to the
main
folder and edit theinit.py
file in your Code editor - Go to Line 29 and change the
base_dir
variable according to the location of the Repo on your PC - Run the code and enjoy
- io
- os
- glob
- pandas
- pandasql
- datetime
- nbformat
- nbconvert
- ipykernel
To install these Python modules, you can use the pip
package manager. Open your terminal or command prompt and run the following commands:
pip install pandas pandasql nbformat nbconvert ipykernel
For the other modules (io
, os
, glob
, and datetime
), they are part of the Python standard library, so you don't need to install them separately. You can directly import them in your Python scripts or Jupyter notebooks.
If you encounter any issues during installation or need further assistance, feel free to ask! 😊