##How to Install
###Requirements -Python 2.7
-pip
-Django
-Apache Tika
###Instructions
Install python requirements
pip install -r requirements.txt
###How to Run the Application
1.Run Solr
Change directory to where you cloned the project
cd Solr/solr-5.3.1/
./bin/solr start
2.Clone lucene-geo-gazetteer repo
git clone https://github.com/chrismattmann/lucene-geo-gazetteer.git
cd lucene-geo-gazetteer
mvn install assembly:assembly
add lucene-geo-gazetteer/src/lucene-geo-gazetteer/src/main/bin to your PATH environment variable
make sure it is working
lucene-geo-gazetteer --help
usage: lucene-geo-gazetteer
-b,--build <gazetteer file> The Path to the Geonames
allCountries.txt
-h,--help Print this message.
-i,--index <directoryPath> The path to the Lucene index
directory to either create or read
-s,--search <set of location names> Location names to search the
Gazetteer for
3.You will now need to build a Gazetteer using the Geonames.org dataset. (1.2 GB)
cd lucene-geo-gazetteer/src/lucene-geo-gazetteer
curl -O http://download.geonames.org/export/dump/allCountries.zip
unzip allCountries.zip
lucene-geo-gazetteer -i geoIndex -b allCountries.txt
make sure it is working
lucene-geo-gazetteer -s Pasadena Texas
[
{"Texas" : [
"Texas",
"-91.92139",
"18.05333"
]},
{"Pasadena" : [
"Pasadena",
"-74.06446",
"4.6964"
]}
]
Now start lucene-geo-gazetteer server
lucene-geo-gazetteer -server
4.Run tika server as mentioned in https://wiki.apache.org/tika/GeoTopicParser
on port 8001
. Port is mentioned in https://github.com/MBoustani/GeoParser/blob/master/geoparser_app/views.py#L117
5.Run Django server
python manage.py runserver
6.Open in browser http://localhost:8000/
- Apache Tika
- [Lucene Geo Gazetteer] (https://github.com/chrismattmann/lucene-geo-gazetteer)
- Apache Solr