Project Proposal Video : https://www.youtube.com/watch?v=IeKPIaOxPJI
RShiny Deployed Link: https://outbrainproject.shinyapps.io/WebOutBrain/
Tableau Online link: https://us-east-1.online.tableau.com/#/site/ads_outbrain_finalproject/workbooks (Note: since data file was big tableau server was used. Please use your credentials to login in view the dashboard)
Final Report document: ADS_Final_Project_Report_Team8.docx
DockerHub Link : : https://hub.docker.com/r/sweta/outbrain_prediction/
Project Demo Link: https://www.youtube.com/watch?v=19ihFCw8RIM
Final Presentation Link :ADS_Final_Project_Team8.pptx
Tools used: R, R-shiny for web devolpment, Python, Plotly and Tableau for analysis, Docker, IBM Data Science, Microsoft Azure, AWS
In Exploratory analysis we have used plotly in offline mode, hence the graph will not be visible unless the code is run. The Report has all the graph images along with the explanation.
Millions of user use social network for surfing, visiting countless websites and clicking on countless ads/recommendations on these website. Knowing what the users are interested in and what the users are using in real world would be of great significance for future recommendations used by marketing team to attract potential users as well as ad placements and real time bidding Predicting the likelihood of users clicking on a particular content Ranking the recommendations in each group by decreasing predicted likelihood of being clicked
We have made Kaggle submission of random forest model. The model is evaluated as shown in the image below
and below is the model evaulation
- Below is a screenshot of rest API that will predict advertisement ids for an individual display id.
- Below is the screenshot of rest API that will predict individual ad click probablity
-
Maximum percentage of clicks were made through mobile phones, followed by desktop and then tablets. Mainly because:
-
App Availability: the app is available that is available on desktop is now present on model too.
-
Convenience: Games or social networking apps frequently serve as a way to pass the time while on the subway commuting home or in a cab or surfing net. This directly reflects the increase in uses of mobile devices
-
We observed that the click frequency is very high between 10-15 hrs. Around 10-11 a.m. time people are usually on their commute to work, school etc. and pass their time surfing net. Around 1-3 p.m. is usually lunch time where people get time to surf internet and that’s the frequency is high.
-
The frequency in the bucket 15-20 is again when people are traveling back home and around dinner time when they have time to surf internet.
- The number of clicks increase from 9.am. and reaches highest at around 10 a.m. and remains high till 12p.m. and then gradually decreases till it hits 3.p.m. It again rises from 3p.m.
From the dashboard we can conclude that USA that the maximum number of view rate. The frequency of clicks gradually increases from 6 am and was high around 10 a.m. to 12 p.m. The click was highest on Day 0 which is June 14th. It is a US national holiday
- California state has the maximum number of views.
- The maximum number of views were from mobile devices at around 11 a.m.