Automatically crawl your website and add search-engine capability.
Go to file
2023-04-20 10:46:20 -04:00
orcinus Delete all included repos for reupload 2023-04-20 10:20:38 -04:00
.gitattributes Initial commit 2023-04-11 22:02:16 -04:00
.gitignore Daily updates, big flow change in crawler.php 2023-04-18 17:20:27 -04:00
example.html ARIA search updates 2023-04-12 21:08:53 -04:00
example.php ARIA search updates 2023-04-12 21:08:53 -04:00
LICENSE Initial commit 2023-04-11 22:02:16 -04:00
README.md Update README.md 2023-04-20 10:46:20 -04:00

Orcinus Site Search

The Orcinus Site Search PHP script is an all-in-one website crawler and search engine that extracts searchable content from XML, HTML and PDF files from a single, or multiple websites. It replaces 3rd party, remote search solutions such as Google etc.

Orcinus will crawl your website content on a schedule, or at your command via the admin UI or even by CLI/crontab. Crawler log output conveniently informs you of missing pages, links that redirect, and other errors that you, as a webmaster can fix to keep your user experience tight. Customize your search results by blocking URLs, unlisting pages, or raising/lowering their search priority. You have complete control over the appearance of your search results with a convenient templating system.

Orcinus can generate a sitemap XML or XML.GZ file of your pages after every crawl, suitable for uploading to Google analytics. It can also export a JavaScript version of the entire search engine that works with offline mirrors, such as those generated by HTTrack.

Requires:

  • PHP >= 7.2.x
  • MySQL / MariaDB

3rd Party Libraries Included:

Optional Libraries: