#Logs Analysis Project
An internal reporting tool that uses information of large database from a newspaper website and draw business conclusions from this information.
Lawrence Nara
This is a python class that uses information of large database from a newspaper website and draw business conclusions from the information. The database contains newspaper articles and web server log for the site. The log has a database row for each time a reader loaded a web page. The database contains three tables:
- The authors table includes information about the authors of articles.
- The articles table includes the articles themselves.
- The log table includes one entry for each time a user has accessed the site.
- Most popular three articles of all time.
- Most popular article authors of all time.
- Days on which more than 1% of requests lead to errors.
- Python2
- Vagrant
- VirtualBox
- Install Vagrant Download link And VirtualBox Download link
- Clone this repository in your vagrant folder
Launch Vagrant VM by running vagrant up
, you can then log in with vagrant ssh
for linux and winpty vagrant ssh
for windows
*Download the Data link You will need to unzip this file after downloading it. The file inside is called newsdata.sql. Put this file into the vagrant directory, which is shared with your virtual machine.
To load the data into your database, use the command psql -d news -f newsdata.sql
run python log_analysis.py
from the command line.
For any errors or other issues please contact me for update at lawrence.naraAtgmail.com