/dm-markov

Markov Chain generator from Daily Mail articles

Primary LanguagePython

Main Files

Filename Description
corpus-bo.py Scrapes the Barack Obama speech homepage for all links, and then crawls each of them in turn, saving the output to a simple text format
corpus-dm.py Scrapes the Daily Mail website homepage for all article links, and then crawls each of them in turn, saving the output to a simple text format
ddiag.py Utility tool for dictionary files
dictionary.py Creates a dictionary file from the saved corpus documents
markov.py Generates Markov chains from a specified dictionary

Dependencies:

  • Python 3.2+
  • BeautifulSoup4