Search found 302 matches

by captquirk
Tue Oct 17, 2023 12:21 am
Forum: Announcements
Topic: Sphider 5.4.0, SphiderLite 2.5.0 released
Replies: 0
Views: 5126

Sphider 5.4.0, SphiderLite 2.5.0 released

15 October 2023 - Processing of robots.txt files has been improved. Robots.txt is now case sensitive and consideration is given to "allow" directives. All common text files have been integrated into Sphider. The user may assign a default language to a web site, but Sphider will also try to...
by captquirk
Mon Oct 09, 2023 4:03 pm
Forum: Sphider Help
Topic: unicode indexing
Replies: 18
Views: 5416

Re: unicode indexing

I am puzzled... Obviously, I am no guru and am missing something. The 'keyword' field is capable of holding Unicode characters. What version of MySQL (or possibly MariaDB) is running? MySQL, from what I read, has finally made utf8 and utf8mb4 the same instead of utf8 == utf8mb3. If the DB is MariaDB...
by captquirk
Sun Oct 08, 2023 4:21 pm
Forum: Sphider Help
Topic: unicode indexing
Replies: 18
Views: 5416

Re: unicode indexing

"0 rows affected" is expected as there was no operation affecting actual data. Just a table creation which has no data. Can you run a "show create table keywords;" again just to confirm the table does have utf8mb4 varchar. The fact you can manually enter a unicode character into ...
by captquirk
Sat Oct 07, 2023 7:42 pm
Forum: Sphider MODS
Topic: Cause spidering to pause after indexing a specified number of pages
Replies: 2
Views: 55952

Re: Cause spidering to pause after indexing a specified number of pages

That is a suggestion I will keep in mind. Thanks!
by captquirk
Sat Oct 07, 2023 7:41 pm
Forum: Sphider MODS
Topic: Suggestion: phrase search
Replies: 2
Views: 3487

Re: Suggestion: phrase search

It doesn't.
The full text of the page is stored in the links table. When a phrase search is performed, Sphider looks for that string in the full text and not a chain of individual keywords.
by captquirk
Fri Oct 06, 2023 8:32 pm
Forum: Sphider Help
Topic: unicode indexing
Replies: 18
Views: 5416

Re: unicode indexing

Not all tables have varchar() fileds, and of those that do, only TWO are critical for this situation. Keywords and links are the critical tables. Go into the site setting and "Clear site", now go into the Clean tables tab and clean keywords. Your links and keywords tables should now be emp...
by captquirk
Fri Oct 06, 2023 8:27 pm
Forum: Sphider Help
Topic: Caractères mal interprétés
Replies: 1
Views: 2312

Re: Caractères mal interprétés

Using Google translate, I get: Good morning, I'm trying to use Sphider which seems very good to me. Only, I have a problem with accented characters (French). In my tables, for example, the word nièvre appears like this (with an è after ni). But in the Sphider_keywords table, the word appears like th...
by captquirk
Fri Sep 29, 2023 4:40 pm
Forum: Sphider Help
Topic: unicode indexing
Replies: 18
Views: 5416

Re: unicode indexing

Your code looks perfect! We have already established your tables are utf8mb4. Is there some way the fields in the table are NOT utf8mb4??? This does not seem likely, but I suppose anything is possible. Go into mysql, connect to the database you are using, and run a 'show create table' for links and ...
by captquirk
Thu Sep 28, 2023 3:32 pm
Forum: Sphider Help
Topic: unicode indexing
Replies: 18
Views: 5416

Re: unicode indexing

I sent you a PM with instructions.
by captquirk
Wed Sep 27, 2023 5:17 pm
Forum: Sphider Help
Topic: unicode indexing
Replies: 18
Views: 5416

Re: unicode indexing

I have to think you should be able to override whatever your host has as a default... I would like to see a portion of source code for a page containing Unicode characters... Even better would be to see the site itself. If you are not comfortable sharing that publicly, you may do so by private messa...