/SteamDataHarvester

SteamDataHarvester is a multi-threaded Python scraper for Steam user data. It efficiently navigates user networks, collecting game data and avoiding duplicates, ideal for Steam community analysis and trend tracking.

Primary LanguagePythonApache License 2.0Apache-2.0

SteamDataHarvester

Description

SteamDataHarvester is a multi-threaded Python tool for efficiently scraping Steam user data. It navigates through user networks, collects game data, and maintains a unique tracking system to avoid duplicate processing. This tool is ideal for analyzing trends and social networks within the Steam gaming community.

Features

  • Multi-threaded data scraping for optimized performance
  • Collects game playtime data from user profiles
  • Navigates through friends' networks to gather extended data
  • Avoids duplicate processing with a smart tracking system

Prerequisites

Before you begin, ensure you have met the following requirements:

  • Python 3.x installed
  • requests and pandas libraries installed
  • A Steam API key

Installation

Clone the repository to your local machine:

git clone https://github.com/hpekkan/SteamDataHarvester.git

SteamDataHarvester

Description

SteamDataHarvester is a multi-threaded Python tool for efficiently scraping Steam user data. It navigates through user networks, collects game data, and maintains a unique tracking system to avoid duplicate processing. This tool is ideal for analyzing trends and social networks within the Steam gaming community.

Features

  • Multi-threaded data scraping for optimized performance
  • Collects game playtime data from user profiles
  • Navigates through friends' networks to gather extended data
  • Avoids duplicate processing with a smart tracking system

Prerequisites

Before you begin, ensure you have met the following requirements:

  • Python 3.x installed
  • 'requests' and 'pandas' libraries installed
  • A Steam API key

Installation

Clone the repository to your local machine:

git clone https://github.com/hpekkan/SteamDataHarvester.git

Usage

To use SteamDataHarvester, follow these steps:

  1. Add your Steam API key and desired initial SteamID to the script.
  2. Run python scrapper.py in your terminal.
  3. Data will be saved in the 'user_data' folder.
  4. Run python merger.py in your terminal.
  5. Merged csv file be saved in './{counter}_user_data.csv'.

Contributing to SteamDataHarvester

To contribute, follow these steps:

  1. Fork this repository.
  2. Create a branch: git checkout -b <branch_name>
  3. Make your changes and commit them: git commit -m '<commit_message>
  4. Push to the original branch: git push origin <project_name>/<location>
  5. Create the pull request.

Alternatively, see the GitHub documentation on creating a pull request.

Contributors

Contact

If you want to contact me, you can reach me at huseyinpekkan1163@gmail.com.

License

This project uses the following license: Apache License 2.0.