/Linkedin-Java-Scraper

A scraper that can perform login into your LinkedIn Account and scrap email ids from particular post comments.

Primary LanguageJava

Linkedin-Java-Scraper

The Scraper is built with Java and Selenium. You can enter Username and password of LinkedIn account along with the URL of the post and scraper will start a chrome instance and scrap all the comments(Top level), emails, and users profile URL. The data will be available in the folder where you .jar file in an excel format.

Dependencies

  • Java
  • Chrome
  • Selenium chrome driver (According to your chrome version)
For scrapy runtime Server
Clone the repo

1) Create a new folder on your system

2) Copy latest .jar file available in the "EXE" folder of the repo and put it into the folder you created in the above step

3) Install Chrome

4) Find your chrome version and download selenium chrome driver for that version (Driver link: https://chromedriver.chromium.org/downloads)

5) Copy downloaded chromedriver in the same folder where you .jar file is

6) Run the .jar file

7) all done

Note

The application runs in two steps. In the first, it will perform login and load all the comments on the web page by opening the post URL(It will wait for 60 sec after loading all the comments to handle network issue). Now, in the second part, it will start scraping the comments (By waiting random time between 1 to 30 seconds).

Home

Screenshot (125)

config

Screenshot (142)

Performing login

Screenshot (132)

Loading comments

Screenshot (138)

Scraping comments

Screenshot (139)

Scraped data (Excel file)

Screenshot (141)