Update 6/6/13
After several weeks hacking with and blogging about Haystax, it's time for a new approach. Following the Unix philosophy, this repo of Haystax will focus on what it does best, selecting XPaths through a visual interface.
As part of our julython push, we will experiment using Haytax-defined XPaths in other scrapers that we will develop in other repos. We'll link up those repos when they're ready.
Our goal remains to provide a GUI scraper for non-programmers.
The rest of this README is about the Hackasaurus X-Ray Goggles. That, obviously, should change.
These are Web X-Ray Goggles, which provide a simple, easy way for non-technical people to inspect Web pages and learn about how they are put together.
To play with the current version of the goggles, check out https://secure.toolness.com/webxray/.
The build system for Web X-Ray Goggles was inspired by jQuery. In fact, Web X-Ray Goggles are, in some sense, just a custom build of jQuery. The src
directory contains files that will ultimately be concatenated or "compiled" into a single JavaScript file, and the static-files
directory contains standalone files used by the goggles.
To develop the goggles, you'll need:
- git
- python
First, check out the webxray
repository:
git clone https://github.com/toolness/webxray.git
To start a web server that dynamically serves the Web X-Ray Goggles bookmarklet and unit tests, run:
python go.py serve
Alternatively, if you want to compile the static-files/webxray.js
file, run:
python go.py compile
If you have any bugs to report, please file them in Lighthouse.
Localizing the goggles is easy! Just translate webxray.pot on localize.mozilla.org.
Note that localizations are activated automatically at runtime by examining
navigator.language
. This can be manipulated in Firefox by changing
the Languages preference.
Currently, the goggles build on JQuery 1.5. To update the repository's files to use a different version of JQuery, check out the webxray-stable branch of JQuery and run the export_to_webxray.py
script.
All files that are part of this project are covered by the following license, except where explicitly noted.
Version: MPL 1.1/GPL 2.0/LGPL 2.1
The contents of this file are subject to the Mozilla Public License Version
1.1 (the "License"); you may not use this file except in compliance with
the License. You may obtain a copy of the License at
http://www.mozilla.org/MPL/
Software distributed under the License is distributed on an "AS IS" basis,
WITHOUT WARRANTY OF ANY KIND, either express or implied. See the License
for the specific language governing rights and limitations under the
License.
The Original Code is webxray.
The Initial Developer of the Original Code is the Mozilla Foundation.
Portions created by the Initial Developer are Copyright (C) 2010
the Initial Developer. All Rights Reserved.
Contributor(s):
Alternatively, the contents of this file may be used under the terms of
either the GNU General Public License Version 2 or later (the "GPL"), or
the GNU Lesser General Public License Version 2.1 or later (the "LGPL"),
in which case the provisions of the GPL or the LGPL are applicable instead
of those above. If you wish to allow use of your version of this file only
under the terms of either the GPL or the LGPL, and not to allow others to
use your version of this file under the terms of the MPL, indicate your
decision by deleting the provisions above and replace them with the notice
and other provisions required by the GPL or the LGPL. If you do not delete
the provisions above, a recipient may use your version of this file under
the terms of any one of the MPL, the GPL or the LGPL.