/SocialScraper

Social Scraper is a python tool meant for Detection of Child Predators/Cyber Harassers on Social Media

Primary LanguagePythonGNU General Public License v3.0GPL-3.0

SocialScraper


build status Analyze version Coverage
Test Python V3.7 License status-up

Social Scraper is a python tool meant for Detection of Child Predators/Cyber Harassers on Social Media


Tool intends to identify the cyber predators/child harassers on social media with a malevolent intend. The posts, comments and followers on the social media are subjected to analysis using Artificial Intelligence, Machine Learning with IGPL and NSFW(Not Safe For Work) to categorise the offensive contents.

This system is capable of analyzing all social media platforms like Instagram, Twitter, Facebook, etc., and other outlets seeking the same suspect. If the suspect doesn’t have the same user ID on different platforms, then Reverse Image Searching is done to identify the suspect. A set of user_id is used as a key to grab their personal information and their post information(Post ID, Comments, Timestamp, location, Captions) from multiple social platforms using ​ OSINT(Open Source INTelligence) and Beautifulsoup Python Package. The above data of various posts are subjected to analyze malevolent contents using Machine Learning and Pandas Python library.Based on the statistical analysis, suspects are categorized based on their behavior(also Polite harassment). The users whose suspect level is greater than the threshold value will be scrutinized and monitored for further analysis. The suspected user’s post information(media like Image, Audio, and Video) is retrieved and analyzed using the ​ IGPL Python package, ​ Urllib and ​ Artificial Intelligence with ​ NSFW (Not Safe For Work) library to make them fall under the category 'suspects/predators'. Finally, the Child grooming patterns followers and statistical results that are generated are analyzed and the concerned person is classified as predator and reported to the law enforcement authorities

Tool UI


Creators: 👤

Aravindha Hariharan M
Kabilan S
Gowtham G
Giridhara Prasath G

Prerequisites 📦

1.Python 3.X with pip3 Installed
If not then, pip3 installation

apt install python3-pip

To Check pip versioon

pip3 --version

2.Geckodriver for Mozilla Firefox
If you havn't installed then, Visit the link below, Download the required file for resepective operating system and install.

https://github.com/mozilla/geckodriver/releases

User List Creation

Tool can handle N-number of user account scrapping which can be given a user.txt

You can use any kind of text editor to edit user.txt

gedit user.txt | vim user.txt

Installation 💾

Python Direct module Installation

python3 -m pip install SocialScraper

From Source Package

Open Terminal and type

git clone https://github.com/Aravindha1234u/SocialScraper

cd SocialScraper

Automatic Setup

chmod +x setup

./setup

Manual Setup

To Install required Python package

pip3 install -r requirements.txt

or

python3 -m pip install -r requirements.txt

Api Keys

We haven't included our keys for usage. Add your respective api keys to SocialScraper/social/api.py and replace the google credentials.json and client_secret.json to sample directory and facebook credentials in credentials.yaml for scrappering the accounts.

** Google API **

Get it signed in and once you get your API key, make sure that you have enabled gmail service to this.So that automatic mail can function. Gmail API

** Imagga API **

Sign up as a hacker and get the API key. Imagga API

** DeepAI **

Sign up and get the API key DeepAI API

Execution 👍

To Run SocialScraper

python3 main.py

Issues

Feel free to express any kind of bug or error in this tool by reporting it in issues, So that it can be fixed soon.

Important Message ⚠️

This tool is for research purposes only. Hence, the developers of this tool won't be responsible for any misuse of data collected using this tool. Used by many researchers and open source intelligence (OSINT) analysts.

License 📄

SocialScraper is licensed under GNU General Public License v3.0. Take a look at the License


Tool Working 🎴

Tool Working