How to 'safely' access pdftotext from the http process
Posted: Thu Sep 07, 2023 8:35 pm
pdftotext is essential to Sphider. And there is a CLI using admin/spider.php, but I'm not yet familiar with all the commands I would need to replicate the web interface (e.g., to initiate reindexing).
From the standard web interface to the Sphider/admin/ portal, I can initiate indexing, but it seems to silently fail as pdftotext is 'beyond' the access of the http process. By silently fail, I think it moves on to the next document, cannot index it, and moves on again...
Am I missing a critical step?
Is there a 'safe' setting in the php configuration that grants it execute permission, which not exposing the world with too much access?
From the standard web interface to the Sphider/admin/ portal, I can initiate indexing, but it seems to silently fail as pdftotext is 'beyond' the access of the http process. By silently fail, I think it moves on to the next document, cannot index it, and moves on again...
Am I missing a critical step?
Is there a 'safe' setting in the php configuration that grants it execute permission, which not exposing the world with too much access?