/get_wangyin_blog

A program collection to download WangYin's blog articles and convert them into markdown format.

Primary LanguagePythonEclipse Public License 1.0EPL-1.0

This project contains some programs that download WangYin's blogging posts from his websites here and here, and convert them into text files in the markdown format.

Distributed under GPLv3.


The Clojure part for "http://www.yinwang.org/"

Before running please make sure you have "Java/Leiningen/node.js" installed. The versions I'm using are:

java: 11.0.7-open
leiningen: 2.9.4
node.js: v12.18.3

The "node.js" part of this project is used to convert the HTML page content into markdown format text. Before running, install the npm packages first:

npm install

Now you're good to go:

lein run

The Python part for "https://yinwang1.wordpress.com/"

Setup the "node.js" environment as above, and make sure you have Python3 installed. Install the Python libraries first:

pip install -r requirements.txt

Run the program:

./wywp.py