/clickbait-generator

LSTM Clickbait Headline Generator

Primary LanguageJupyter Notebook

Clickbait Generator

This word-level LSTM model generates clickbait headlines like the following:

  • we know your zodiac sign based on your zodiac sign
  • the 17 most important canadian celebrity moments of 2015
  • here's how to make a vampire
  • can you guess your favorite '90s movie based on your favorite kitten
  • are you more a canadian or taylor swift or oprah

Running

pipenv shell
jupyter notebook clickbait.ipynb

Data

This model uses the collection of 17,000 clickbait headlines scraped from the following esteemed publications:

  • BuzzFeed
  • Upworthy
  • ViralNova
  • Thatscoop
  • Scoopwhoop
  • ViralStories

Taken from the paper "Stop Clickbait: Detecting and Preventing Clickbaits in Online News Media"
Data Source

Model

Word Embeddings

This model trains its own 10-dimensional embeddings.

Architecture

The model's current architecture is a two-layer LSTM with 256 units and a 20% dropout rate.

Further Work

  • Get more data
  • Replace all this with a transformer

Sources

Inspired by Lars Eidnes' blog post
"Stop Clickbait: Detecting and Preventing Clickbaits in Online News Media" link
Excellent RNN intro by Andrej Karpathy link