This repository contains an in-depth analysis of online gaming behavior using a comprehensive dataset. The dataset includes various metrics and demographics related to player behavior, game-specific details, engagement metrics, and a target variable reflecting player retention.
The dataset captures the following features:
- PlayerID: Unique identifier for each player.
- Age: Age of the player.
- Gender: Gender of the player.
- Location: Geographic location of the player.
- GameGenre: Genre of the game the player is engaged in.
- PlayTimeHours: Average hours spent playing per session.
- InGamePurchases: Indicates whether the player makes in-game purchases (0 = No, 1 = Yes).
- GameDifficulty: Difficulty level of the game.
- SessionsPerWeek: Number of gaming sessions per week.
- AvgSessionDurationMinutes: Average duration of each gaming session in minutes.
- PlayerLevel: Current level of the player in the game.
- AchievementsUnlocked: Number of achievements unlocked by the player.
- EngagementLevel: Categorized engagement level reflecting player retention ('High', 'Medium', 'Low').
This dataset can be found in Kaggle: 🎮 Predict Online Gaming Behavior Dataset
- Perform exploratory data analysis (EDA) to understand the dataset.
- Identify key factors influencing player engagement levels.
- Build predictive models to classify player engagement.
- Analyze feature importance and provide insights for game design optimization.
- Summary statistics of numerical features.
- Distribution plots for age, play time, session duration, etc.
- Correlation matrix to identify relationships between features.
- Develop models to predict
EngagementLevel
using features like age, gender, location, game genre, play time, etc. - Models include decision trees, random forests, logistic regression, and neural networks.
- Hyperparameter Tuning: Optimize the performance of the Random Forest model.
- Model Comparison: Evaluate different machine learning models including SVM, Gradient Boosting, and Neural Networks.
- Cross-Validation: Ensure robustness of the models.
- Summarize the findings from the analysis.
- Provide recommendations for future work.
online_gaming_behavior_analysis.ipynb
: Jupyter Notebook containing the analysis.online_gaming_behavior_dataset.csv
: Dataset used for the analysis.README.md
: This file.
-
Clone the repository to your local machine:
git clone https://github.com/debjit-mandal/online-gaming-behavior-analysis.git cd online-gaming-behavior-analysis
-
Ensure you have the necessary dependencies installed. You can create a virtual environment and install the dependencies using:
python -m venv venv source venv/bin/activate # On Windows use `venv\Scripts\activate` pip install -r requirements.txt
-
Open the Jupyter Notebook:
jupyter notebook online_gaming_behavior_analysis.ipynb
-
Run the cells in the notebook to perform the analysis.
If you would like to contribute to this project, please fork the repository and create a pull request with your proposed changes.
This project is licensed under the MIT License - see the LICENSE file for details.
- This project is inspired by the growing interest in understanding player behavior in online gaming environments.
- Thanks to the open-source community for providing tools and libraries that make data analysis and machine learning accessible.