GitHub Spider is a Python script that interacts with the GitHub API to crawl user profiles, follow users based on follower count, display user repositories, and star repositories that meet popularity criteria.
- User Crawling: Fetches and displays GitHub user profiles.
- Following Users: Automatically follows GitHub users with over 300 followers.
- Repository Handling: Displays and stars repositories based on popularity.
-
Clone the repository:
git clone https://github.com/Ian729/GithubSpider.git cd GithubSpider
-
Install dependencies:
pip install -r requirements.txt
-
Set up your GitHub token as an environment variable:
export GITHUB_TOKEN=your-github-token
Replace
your-github-token
with your actual GitHub personal access token. -
Modify the script (
github_spider.py
) to customize crawling depth (DEPTH_LIMIT
) and other parameters as needed. -
Run the script:
python github_spider.py
The script will start crawling from a predefined GitHub user (
torvalds
by default) and follow users, display repositories, and star popular repositories.
Contributions are welcome! If you'd like to contribute to this project, please fork the repository and submit a pull request.
This project is licensed under the MIT License - see the LICENSE file for details.
- GitHub Token: Ensure your GitHub token (
GITHUB_TOKEN
) is set properly in your environment variables to avoid unauthorized access errors. - Customization: Modify the script (
github_spider.py
) to adjust parameters likeDEPTH_LIMIT
, follower count criteria, and repository popularity thresholds according to your requirements. - Dependencies: Check and update
requirements.txt
as your project evolves and new dependencies are added.
This README.md template provides an overview of your project, installation instructions, usage guidelines, contribution information, and licensing details. Customize it further based on specific details and features of your GitHub Spider project.