A web-crawler framework with Haskell
Useless Framework provides:
- an independent HTML module with:
- DOM tree structure to store HTML document;
- HTML parser, translate html-format string to DOM tree;
- Tag selector, to easily select tags from a tree, like what you do in css;
- a monad transformer that supports Open-Close Table algorithm, to help task scheduling
For usage, please refer to Wiki