project-fifa-moneyball

Project details: FIFA - MoneyBall

The challenge

Perform an end-to-end analysis putting into practice what you have learned so far. You will apply statistical or machine learning techniques and present your results to the class.

Possible Outcomes

  • Rank players by market value.
  • Highlight the top players for their outstanding performances over a discrete season.
  • Decide when to transfer a player.
  • Decide the best replacement for a transferred player.

You might suggest your own outcomes. Check with instructional staff before committing to a new option.

Objectives

  • Ask interesting and thoughtful questions and find the data to answer them.
  • Focus on improving in areas that are hard for you or learning more about something with which you feel comfortable.
  • Apply the statistical and machine learning techniques we have learned.
  • Create useful and clear graphs.
  • Present your insights in a thoughtful, clear, and accurate way.

Dataset

In this project, you will use the provided fifa23_players_dataset dataset.

Details about the dataset can be found here as well kaggle description

In case you want to find more information on how each column relates to a player, take a look at this website sofifa website

This data set includes:

  1. EA Sports FIFA 23 Game data:
  • Player Name
  • Club of the Player
  • League
  • Position
  • Pace
  • Shooting
  • Passing
  • Dribbling
  • Defending
  • Physical
  1. Transfermarkt extra info by player:
  • Date of Birth
  • Nationality
  • Height
  • Foot
  • Day Joined the current club
  • Day of Contract End
  • Market Value of the Player
  1. ESPN FC data from the past 5 years performance of each player
  • GS: Games Started
  • SB: Games Substituted
  • G: Goals Scored
  • A: Assists
  • SH: Shots
  • SG: Shots on Goal
  • FC: Fouls Committed
  • FS: Fouls Suffered
  • YC: Yellow Cards
  • RC: Red Cards

Instructions & Scope

  • You must plan your project. Creating a Kanban or using Trello or a similar app for a digital board is mandatory.
  • You CAN'T CODE until your project is planned.
  • Create a *.gitignore* file and include it in your repository.
  • As an optional you can include a linear regression as a way to answer question(s) on the data.

Deliverables

  • A well-commented Jupyter notebook with your analysis.
  • The final dataset after all cleaning and transformations.
  • Repository with your workflow + documentation + code.
  • Visual documentation of Kanban or Trello board link.

Tips & Tricks

  • Organize yourself (don't get lost!). Respect deadlines.
  • Ask for help but don't forget that Google is your friend.
  • Define a simple approach first. You never know how the data can betray you. 😉
  • Document your work.
  • Learn about the problem and what research has been done before you.
  • Before making a graph, think about what you want to represent.