/quantifying-greenwashing

Predicting a company's sustainability rating based on their social media.

Primary LanguageJupyter Notebook

The aim of this project is to analyze the language and behaviours of Global 2000 companies around greenwashing.

Selected Company Data

  • Every Global 2000 Company with:
    • a verified instagram account
    • captions that are 80% in english
  • Instagram posts per company:
    • every post in 2020

Instagram Metrics

The Instagram scraper built for this project was built using the extraordinary Instaloader package. The fifteen attributes below were collected for each companies Instagram feeds.

post_info = {
				"shortcode": post.shortcode,
				"username": company,
				"date_utc": post.date_utc.strftime("%Y-%m-%d %H:%M"),
				"is_video": "yes" if post.is_video else "no",
				"is_sponsored": post.is_sponsored,
				"hashtags": ",".join(post.caption_hashtags),
				"mentions": ",".join(post.caption_mentions),
				"caption": (emoji.demojize(post.caption)).encode('utf-8'),
				"video_view_count": post.video_view_count if post.is_video else 0,
				"video_length": post.video_duration if post.is_video else 0,
				"likes": post.likes,
				"comments": post.comments,
				"likes+comments": (post.likes + post.comments),
				"location_name": post.location.name if post.location else "",
				"location_latlong": " ".join((str(post.location.lat), str(post.location.lng))) if post.location else ""
				}

Sustainability Score

Each companies ESG & sustainbility score was obtained from the student version of CSRhub.

Alternate image text