Help with site search/Sphider

They have: 68 posts

Joined: Jun 2006

These questions are about Sphider, a php search engine that can be found here: http://www.cs.ioc.ee/~ando/sphider/index.php

Server type: Windows 2003, latest PHP and mySQL and Apache 2.2

Here are the questions:

I was able to install the pdftotext converter, but when i looked at the doctotext converter, the site said it did not work with windows. I was wondering if there was such a converter and also a ppt converter?

Also, i have links on my pages that use javascript, so they aren't read by the spiders. Is there a way to embed hidden links so that these pages are indexed?

Finally, some of my PDF documents are fewer than 10 words, so they are excluded. Is there a way to change this so that no matter how many words, the document is indexed?

Thanks for your help.

waffles's picture

They have: 54 posts

Joined: Jun 2006

I don't know about PDFs (because I really don't like them). You could make a div that is Z-indexed underneath the real content that contains "hidden" links. Although from my experience it really doesn't matter if the page is linked to or not.

Go into the 'Settings' tab after you log into the admin panel. There's an option in the third block down that allows you to change the number of words needed. If 0 works use that, otherwise use 1.

waffles Radio Coming to a set of speakers near you September 2006

They have: 68 posts

Joined: Jun 2006

I wound up creating a sitemap to fix the indexing problem. But i am still having trouble with the PDFs. I know you don't like them... but this problem could relate to other files, like docs or ppts.

I installed a pdftotext converter... but when i re-index my site the files are not spidered. Before i installed the converter, they at least were found when indexing and just labeled (Not text). Now, there isn't even sign of them on the log.

Any ideas?

Want to join the discussion? Create an account or log in if you already have one. Joining is fast, free and painless! We’ll even whisk you back here when you’ve finished.