Don't assume that other data from a PDF is the same as the content. Bypasses some still-unfixed PDFParser encoding issues. Also exit the crawler script if we are in debug mode and there is a crawl already running.