drf_excel_processing

An example DRF project on how to upload and process excel files using openpyxl

Requirements

The requirements.txt/environment.yml lists all Python moduls required. In short:

Python 3.10
- Django==4.0.3
- djangorestframework==3.13.1
- openpyxl==3.0.9

Note: Project was created under Windows. Installing from requirements.txt under Linux might fail due to possible differences in module versions/version naming

Install from source

Clone repository or download as zip. Optional: create a virtualenv / conda env to use an isolated Python environment

Install using pip:

pip install -r requirements.txt

Install using conda:

conda env create -f environment.yml
conda activate drf_excel_processing

Usage

Make sure your environment is activated
Run the Django development server from the "excel_project" folder using:

python .\manage.py runserver

open your browser and navigate to: http://127.0.0.1:8000/api/v1/
Use the provided example_excel_sheet.xlsx for upload and processing
Use "Raw data" to post columns in the Summary view since DRF does not provide a lists component in the HTML form

Notice

this is just an example project that would need some modifications for production use
this project uses the DRF Browsable API accessible via browser at http://127.0.0.1:8000/api/v1/ to allow for easier API discoverability
authentication and authorization was disabled/skipped for ease of use
as stated in https://docs.djangoproject.com/en/4.0/howto/static-files/ - the way static files are served in this project is not suitable for production use. Follow https://docs.djangoproject.com/en/4.0/howto/static-files/deployment/ in order to serve static files in production.
the summary endpoint can be accessed either via the summary URL, or via the "Extra Actions" dropdown in a detail view
since it was just required to create a summary of the provided Excel file, this could have been solved by just implementing one endpoint - however the current approach seems "nicer" overall and allows for easier extensibility
the current create_summary method assumes that the column names can be found in the first row of the first available sheet

Improvements