Content Moderation

Due to interactions between large communities among different channels in Rocket Chat, there was a need for support of an optional moderation service for offensive content. The service as of now is limited to image & links moderation which means if someone sends an offensive image or link to Rocket Chat app and the app along with server is deployed and configured then the image will be blocked but not videos. The dockerised moderation service can be deployed to any server easily since all the major Cloud Providers such as AWS, GCP, Azure, IBM Cloud, etc. provides support to Docker.

Quick start for code developers

Prerequisites:

Rocket.Chat-Deploy
npm install -g @rocket.chat/apps-cli
Docker

Depending on the installation & machine while running docker commands you may want to use 'sudo' if you encounter any errors.

Open a Command Line and execute the following code.

git clone https://github.com/RocketChat/content-moderation.git
cd content-moderation
docker-compose up -d // This will launch a Rocket Chat Instance

Open Rocket.Chat instance ( http://127.0.0.1:3000 ) and Go through the Rocket.Chat initial setup. Remember user-name and password for future use (we will also need it in later steps).
Generate Personal Access Tokens My Account -> Personal Access Tokens -> Add (You can either ignore or not ignore 2 Factor Authentication) Copy User-ID & Token for future use.
From Rocket Chat open Administration -> General -> Apps and make sure the following options are enabled:

Enable App development mode
Enable the App Framework

For Rocket.Chat Content-Moderation-App installation follow steps mentioned here

After deployment let's configure Content Moderation App so that app can help in posting images to the hosted moderation-service to make predictions and block offensive images/links.
In our case:

Administration -> Apps -> Content Moderation.
'Rocket Chat host URL': http://rocket-chat:3000 & 'Content Moderation App Host URL': http://moderation-api:5000/predict in Content Moderation App's Setting.
Now, Let's deploy our service!!
Edit docker-compose-server.yml in your local content-moderation directory. & change the following parameters:
a. RC_UUID
b. RC_TOKEN
We copied them in previous step.

cd .. # Make sure you're in moderation directory
docker-compose -f docker-compose-server.yml up -d

Everything is configured now. We can now test the app!!.

Try posting an offensive image in one of the channels & it should get blocked!

To see the logs generated:

docker logs moderation_rocketchat_1
docker logs moderation_api_1

Note

The Machine Learning model currently recognises only JPEGs and PNGs.

Bits about our Machine Learning Models

We have two PyTorch models(resnet18 & VGG16) and two fastai models(resnet18 & resnet50). We are currently using PyTorch's resnet18 architecture.
All Jupyter Notebooks have Google Colab Link where contributers can contribute by training our ML models on more datasets, optimising hyperparameters, etc..
You can also train the Machine Learning with your own data. To know more read here.

Contribute towards the expansion of the Project:

As of now we have only one Machine Learning model that is capable of classifying the offensive content with an accuracy of ~92%. To expand the service for different medias like Gifs, Videos, all the other media that requires analysing the media frame by frame for classification :