basic API sergo programs demonstrating using web scraping with Colly, GOquery, Selenium using proxy servers
This repository contains two branches:
-
The
basic
branch contains the base code that will be modified in the article Go Proxy Servers. -
The
main
branch is the result of the article tutorial.
This project demonstrates setting up proxy servers in Web Scraping in Go, Proxies protect your digital identity during web scraping by using their IP address, circumventing IP bans and geoblocking.
To use this project, you need to have Go installed on your machine. You can download and install Go from the official website: https://golang.org/
-
Clone the repository:
git clone https://github.com/shacharbd/proxy-scrape-go.git
-
Navigate to the project directory:
cd proxy-scrape-go
-
Install the dependencies:
go mod download
This project demonstrates how to use web scraping with proxy servers in Go. It utilizes the following libraries:
- Colly - a scraping framework for Go
- Goquery - a library for parsing HTML in Go
- Selenium - a browser automation tool
To run the project, make sure you have a valid proxy server. You can obtain proxy server details from a provider like Bright Data. Once you have the proxy server details, update the main.go
file with the appropriate proxy configuration.
Contributions are welcome! If you find any issues or have suggestions for improvement, please open an issue or submit a pull request.
This project is licensed under the MIT License. See the LICENSE file for more information.