A Python client library for the Apache Nutch that makes Nutch 1.x capabilities available using the Nutch REST Server.
See (https://wiki.apache.org/nutch/NutchTutorial) for installing Nutch 1.x and alternatively operating it via the command line.
This Python client library for Nutch is installable via Setuptools, Pip and Easy Install.
pip install nutch
python setup.py build
python setup.py install
See the wiki for instructions on how to use Nutch-Python and its API.
When you install Nutch-Python you also get a new command
line client tool, nutch-python
installed in your /path/to/python/bin
directory.
The options and help for the command line tool can be seen by typing
nutch-python
without any arguments.
Send them to Chris A. Mattmann.
- Brian D. Wilson, JPL
- Chris A. Mattmann, JPL
- Aron Ahmadia, Continuum Analytics