/searpl

a little php search engine with sqlite

Primary LanguagePythonMIT LicenseMIT

searpl

searpl is a small php search engine with the following features:

  • robot.txt compliant
  • sqlite, so theres no need to run some fancy database daemon
  • javascript not required
  • no cdns!
  • read-only database, nothing is written except with the shell

licensing

searpl is licensed under an MIT licence, see LICENSE for more information

setup

this guide assumes you have shell access and are comfortable using command line tools like git.

  • make sure you have php, php-sqlite, wget, sqlite3 and git installed
  • go in your htdocs, public_html or whatever and git clone this repo
  • copy the contents of create.sql and paste it into the prompt on sqlite3 db.sqlite to create the database

optionally, if you want search ranking that prioritizes smol sites:

  • build searplrank.so by running make
  • in your php.ini, make sure sqlite3.extension_dir is uncommented and set to somewhere reasonable
  • copy searplrank.so to wherever you set the extension_dir to

crawling

to crawl a site, do ./urls.sh https://example.com

to recursively crawl, change the recursion limit with -l

./urls.sh -l5 https://example.com