/tae-gpt2-bot

Redditor imitator based on GPT-2 language model

Primary LanguagePythonMIT LicenseMIT

tae reddit bot and web experiment

About

This was a social web experiment which took place in several Turkish meme subreddits between April 18 and May 5 2020.

Bot imitates r/tamamahbapengelli (and some other similar subs) users by posting comments which generated from a trained GPT-2 unsupervised Transformer language model.

What I did

I scraped over 3000 comments from r/tae subreddit, filtered them and created a dataset to finetune GPT-2 355M model using this example Colab Notebook.

I spent 2 days training/finetuning the model to make it generate better comments (it wasn't very effective).

Finished writing the bot and made it comment under posts looking as humanly as possible by combining GPT-2 generated text with a human-like bot behaviour (sleeping at night, not commenting to a new post immediatly etc.).

Notes

Scraped and generated data contains many Turkish and English swearwords (including racist ones). Be cautious if you want to use them.