/instagramy

Python Package for Instagram Users, Posts, and Hashtag data.

Primary LanguagePythonMIT LicenseMIT

Instagramy

Python Package for Instagram Without Any external dependencies

PyPi Downloads GitHub stars GitHub forks GitHub license Code style GitHub Repo size GitHub Actions GitHub Actions

Scrape Instagram Users Information, Posts data, Hashtags and Locations data. This Package scrapes the user's recent posts with some information like likes, comments, captions and etc. No external dependencies.

Features

Download

Installation

pip install instagramy

Upgrade

pip install instagramy --upgrade

Sample Usage

Getting Session Id of Instrgram

For Login into Instagram via instagramy session id is required. No username or password is Needed. You must be login into Instagram via Browser to get session id

  1. Login into Instagram in default webbrowser
  2. Move to Developer option
  3. Copy the sessionid
    • Move to storage and then to cookies and copy the sessionid (Firefox)
    • Move to Application and then to storage and then to cookies and copy the sessionid (Chrome)

Note: Check for session id frequently, It may be changed by Instagram

Instagram User details

Class InstagramUser scrape some of the information related to the user of the Instagram

>>> from instagramy import InstagramUser

>>> session_id = "38566737751%3Ah7JpgePGAoLxJe%334"

>>> user = InstagramUser('google', sessionid=session_id)

>>> user.is_verified
True

>>> user.biography
'Google unfiltered—sometimes with filters.'

>>> user.user_data # More data about user as dict

If you get the data of the user onetime, instagramy store the data as cache file for avoid the error. you can get the data from cache also. Don't provide the sessionid.

>>> from instagramy import InstagramUser

>>> user = InstagramUser('google', from_cache=True)

>>> user.is_verified
True

It is opt of all classes InstagramUser, InstagramHashTag and InstagramPost.

Show all Properties

  • biography
  • connected_fb_page
  • followed_by_viewer
  • follows_viewer
  • fullname
  • has_blocked_viewer
  • has_country_block
  • has_requested_viewer
  • is_blocked_by_viewer
  • is_joined_recently
  • is_private
  • is_verified
  • no_of_mutual_follower
  • number_of_followers
  • number_of_followings
  • number_of_posts
  • other_info
  • posts
  • posts_display_urls
  • profile_picture_url
  • requested_by_viewer
  • restricted_by_viewer
  • username
  • website

InstagramUser.user_data has more data other than defined as Properties

Instagram Hashtag details

Class InstagramHashTag scrape some of the information related to the hash-tag of the Instagram

you can also set your sessionid as env variable

$ export SESSION_ID="38566737751%3Ah7JpgePGAoLxJe%er40q"
>>> import os

>>> from instagramy import InstagramHashTag

>>> session_id = os.environ.get("SESSION_ID")

>>> tag = InstagramHashtag('google', sessionid=session_id)

>>> tag.number_of_posts
9556876

>>> tag.tag_data # More data about hashtag as dict
Show all Properties

  • number_of_posts
  • posts_display_urls
  • profile_pic_url
  • tagname
  • top_posts

InstagramHashTag.tag_data has more data other than defined as Properties

Instagram Post details

Class InstagramPost scrape some of the information related to the particular post of Instagram. It takes the post id as the parameter. You can get the post id from the URL of the Instagram posts from the property of InstagramUser.posts. or InstagramHagTag.top_posts

>>> from instagramy import InstagramPost

>>> session_id = "38566737751%3Ah7JpgePGAoLxJe%334"

>>> post = InstagramPost('CLGkNCoJkcM', sessionid=session_id)

>>> post.author
'ipadpograffiti'

>>> post.number_of_likes
1439

>>> post.post_data # More data about post as dict
Show all Properties

  • author
  • caption
  • display_url
  • get_json
  • number_of_comments
  • number_of_likes
  • post_source
  • text
  • type_of_post
  • upload_time

InstagramPost.post_data has more data other than defined as Properties

Instagram Location details

Class InstagramLocation scrape some of the information and posts related to the given Location . It takes the location id and slug as the parameter. You can get the location id and slug from the URL of the Instagram Location or from the property of InstagramPost.location.id and InstagramPost.location.slug.

>>> from instagramy import InstagramPost

>>> session_id = "38566737751%3Ah7JpgePGAoLxJe%334"

>>> post = InstagramPost('CLGkNCoJkcM', sessionid=session_id)

>>> location_id, slug = post.location.id, post.location.slug

>>> from Instagramy import InstagramLocation

>>> location = InstagramLocation(location_id, slug, session_id)

>>> location.latitude
28.6139

>>> location.longitude
77.2089

>>> location.address
{'street_address': 'T2, Indira Gandhi International Airport', 'zip_code': '', 'city_name': 'New Delhi', 'region_name': '', 'country_code': 'IN', 'exact_city_match': False, 'exact_region_match': False, 'exact_country_match': False}

you can also get the location id and slug from the instagram url

https://www.instagram.com/explore/locations/977862530/mrc-nagar
https://www.instagram.com/explore/locations/<location_id>/<slug>
Show all Properties

  • address
  • id
  • latitude
  • location_data
  • longitude
  • name
  • number_of_posts
  • phone
  • profile_pic_url
  • sessionid
  • slug
  • top_posts
  • url
  • viewer
  • website

InstagramLocation.location_data has more data other than defined as Properties

Plugins

Instagramy has some plugins for ease

Plugins for Data Analyzing

  • analyze_users_popularity
  • analyze_hashtags
  • analyze_user_recent_posts
>>> import pandas as pd
>>> from instagramy.plugins.analysis import analyze_users_popularity

>>> session_id = "38566737751%3Ah7JpgePGAoLxJe%334"

>>> teams = ["chennaiipl", "mumbaiindians",
        "royalchallengersbangalore", "kkriders",
        "delhicapitals", "sunrisershyd",
        "kxipofficial"]
>>> data = analyze_users_popularity(teams, session_id)
>>> pd.DataFrame(data)

                   Usernames  Followers  Following  Posts
0                 chennaiipl    6189292        194   5646
1              mumbaiindians    6244961        124  12117
2  royalchallengersbangalore    5430018         59   8252
3                   kkriders    2204739         68   7991
4              delhicapitals    2097515         75   9522
5               sunrisershyd    2053824         70   6227
6               kxipofficial    1884241         67   7496

Plugins for Downloading Posts

  • download_hashtags_posts
  • download_post
  • download_profile_pic
>>> import os

>>> from instagramy.plugins.download import *

>>> session_id = os.environ.get('SESSION_ID')

>>> download_profile_pic(username='google', sessionid=session_id, filepath='google.png')

>>> download_post(id="ipadpograffiti", sessionid=session_id, filepath='post.mp4')

>>> download_hashtags_posts(tag="tamil", session_id=session_id, count=2)

Use Without Login

You can use this package without login. Sessionid is not required but it may rise RedirectionError error after four to five requests.

>>> from instagramy import *

>>> user = InstagramUser('google')
>>> user.fullname
'Google'
>>> tag = InstagramHashTag('python')
>>> tag.tag_data

Caching Feature

from version 4.3, Added the new feature that is caching the required data. If you get the data of the user onetime, instagramy store the data as cache json file for avoid the error. you can get the data from cache also. Don't need to provide the sessionid. Instead of sessionid add the optional parameter from_cache=True.

>>> from instagramy import InstagramUser

>>> user = InstagramUser('google', from_cache=True)

>>> user.is_verified
True

It is opt of all classes InstagramUser, InstagramHashTag, InstagramPost and InstagramLocation.

Clear all Caches created by instagramy in current dir by

>>> from instagramy.core.cache import clear_caches

>>> clear_caches() # clear all caches of instagramy

List of all Cache files created by instagramy in current dir

>>> from instagramy import list_caches

>>> list_caches() # list all caches of instagramy

Sample Scripts

Getting Email address and phone number

user = InstagramUser('username')
email, phone_number = user.user_data['business_email'], user.user_data['business_phone_number']

✏️ Important Notes

  • Don't send huge request to Instagram with sessionid, Instagram may ban you.
  • You can use this package without sessionid (Login). But it may RedirectionError after four to five requests.
  • class Viewer provide the data about currently logged in user.
  • Check for session id frequently, It may be changed by Instagram
  • If code execution is never gets completed, check and change your session id and try again.
  • Don't provide the wrong session_id.
  • InstagramUser.user_data, InstagramPost.post_data, InstagramHashtag.tag_data and InstagramLocation.location_data which is python dict has more and more data other than defined as Properties.
  • This Package does not scrap all the posts from an account, the limit of the post only 12 (For non-private account)
  • This Package not scrap all the posts of given hash-tag and location it only scrapes the top 60 - 72 posts.

Disclaimer

If you send the huge request to the Instagram with session id Instagram may ban you. I am not responsible for any misuse or damage caused by this program.

License

MIT License

Contributions

Contributions are Welcome. Feel free to report bugs in issue and fix some bugs by creating pull requests. Comments, Suggestions, Improvements and Enhancements are always welcome. Let disscuss about it Here.

Made with Python ❤️