/Instagram-Crawler

Non API. Crawling post (photo, likes, comments, date ...) by username, hashtags

Primary LanguagePythonMIT LicenseMIT

Instagram-Crawler

Non API. Crawling post (photo, likes, comments, date ...) by username, hashtags

Installation

  1. Make sure you have Chrome browser installed.
  2. Download chromedriver and put it into driver folder: ./driver/chromedriver
  3. Install requirements pip install -r requirements.txt

Examples:

Results: under the ./data folder

Download the first 10 photos and information from username "instagram"

$ python3 crawl.py -q 'instagram' -n 10

Download the first 7 photos and information(all comments) from hashtags #hello, #hi

$ python3 crawl.py -q '#hello, #hi' --a -n 7
you can enter multiple username or hashtags by separating them with commas

Example of a files data

likes: 
5,326

comments: 
923

caption: 
Art of @kendricklamar by @illestration
Bold, bright and colorful. 

commentMessages: 
tttt: Amazing 😉
this_is_t.rs: my name says it all

dateTime: 
2018-05-30T19:42:03.000Z
photo of post will download in ./data folder

Usage

Usage:
    crawl.py [-q QUERY] [-n NUMBER] [--a] [-h HELP]
    
Options:
    -q QUERY  username, add '#' to search for hashtags, e.g. 'username' or '#hashtag'
                  For multiple query seperate with comma, e.g. 'username1, username2, #hashtag'

    -n NUM    number of returned posts [default: 10000]

    --a       collect all comments
    
    -h HELP   show this help message and exit

Contributing

This is an open source project so feel free to contribute by