- In this mini-project, we will focus on analyzing different aspects and statistics of the sport Soccer and visualizing historical data in such a way that it will help study different patterns and strategies various clubs/teams use and also study the strengths and weakeness of individual players.
- We have used a special and powerful opensource data-set by StatsBomb. It provides highly detailed information of several games event by event with record of evry time-stamp. Click here to explore the data set
- Firstly, we prepare a canvas to plot our data in the shape of a football field as shown below.
- Now we will plot Passes in form of vector and also analyze the density of pass origin using a kdeplot.
- Now let's target a particular team (here, FCB) and observe the patterns in the shots attempted using the vectors.
- We can easily conclude from the above image that maximum shots on target were originated within the 6-yard box. Let us understand this more clearly using the color coordinated scatter plot shown below. The red dots represents the origin place of the shots that were able to meet the back net whereas the blue ones were either deflected, saved by the keeper, or simply off-target.
- For the final part, we will firstly create a new dataframe to fit different skills of each individual player as a seperate attribute and compare it with standard deviation to get player rating. The normalized data of each individual player is the plotted on a spider-plot that display's the player's strength and weakness. Thus we can derive areas of excellence and areas for potential improvement.