Wagwan Fam?

This tool serves those who have, or work for someone who owns, a facebook page, and essentially want to know what the people are talking about in their post.

What is it?

A simple front-end which can be used to run a basic, yet simple, keywords extraction on facebook posts. In addition, it employs spaCy default models to extract named-entities from comments. Visit spaCy page to know more about named entities. This tool is pretty much a word counter that employs standard NLP pre-processing, plus the NER part performed by spaCy.

How does it do it?

It brings up a webapp supported by a python back-end which is a taylored version of whats-the-topic. It requires an access token to get people's comments on a selected post. Additional info on how to get a token can be found at this link. In short, once a facebook developers account has been created, the access token can be generated through the Facebook GraphAPI.

The tool performs text preprocessing (tokenization, stopwords filtering, stemming) to make plots of the keyword-count plus a word cloud image - using the awesome word_Cloud library.

For devs

This tool has been developed on Ubuntu 18.04 and macOS High Sierra, but has never been seriously tested. It requires Python3+ and all the packages listed in requirements.txt.

Results

Here there are two images of the keyword-count bar plot, and the wordcloud, that are produced by running the tool on this post:

Word cloud with no stemming

Top 20 keywords

The data

word	count
clim	11
years	9
hoax	6
chang	5
stop	4
planet	4
10	4
biggest	4
ever	4
giv	3
meat	3
increasing	3
species	3
volcanoes	3
guess	3

Top 12 entities

This is a bar plot of the top N entities extracted from this post