UOS HUB Website Back-End

This repository contains the server-side code of UOS HUB website. We're using a Python Web Framework called Django, which will be connected to an SQLite database that'll store users' data like their preferences and additions. Our Django project contains two apps, one is the Website app which contains all the static files of the client-side to be sent to it when requested. The other app is the API app, which contains the REST API views which provides access to the server's services. Aside from that, the Requests package (not a Django app) contains all the code and logic for data getting and scraping. We wrote the get.py scripts using Requests library to login, get data and post it from and to UOS websites (myUDC, Blackboard, Outlook & UOS homepage). And we wrote scrape.py scripts using lxml library to scrape the retrieved data and extract the required data and filter it. Then the extracted data will be structured and sent to the client-side as JSON objects through our RESTful API. Also values.py scripts contains possible values for the package they're in, and obsolete.py contain code that's not used anymore.

Diagram of our Back-End's scraping flow (the main functionality)

Details of the Back-End

What's happening in the diagram above?

It shows the data scraping request flow in our back-end, which starts by logging into one of the UOS Websites, then navigating to the page which contains the desired data to be scraped and fetching it. That's done by the Requests library, which then passes the retrieved page or data to the lxml library to process and extract exactly what we need from it. The extracted data is then formatted into JSON objects and passed through Django REST Framework API to the client-side. Before sending the data, some of it might be store in our SQLite Database for later use (by the student only).

What's inside the folders?

The Requests folder contains the code which handles the data retrieval and scraping.

The Requests/* folders are packages containing the get.py and scrape.py modules for UOS Websites.

The API folder contains the API app which handles the REST API calls to UOS HUB views, services and database.

The API/view/* files contain logic executed when API views are called.

The Website folder contains the Website app which handles the static files of the front-end.

The Website/static folder links to the FrontEnd repository which contains front-end's static files.

The UOSHUB folder is the Django project root which contains the general configurations of the project.

Things to do in the future

There're a lot of thing we still need to do in the back-end:

thethiny/BackEnd