intranet search tool (was RE: [thelist] full text search with indexing services on Win2K server)

Ken Schaefer ken.schaefer at
Mon Sep 6 07:58:06 CDT 2004

Is "cost" a factor here?

Index Server has a PDF plugin that used to be available from Adobe.
How well it works, I don't know (I've never had to use it). If you're
going to have someone write down keywords for each file, you might as
well store them in the PDF's properties, and have Index Server (or
some other search engine) index that, rather than maintaining a
separate database (which would then require the file system, and the
DB reference to the file kept in-sync)


On Mon, 06 Sep 2004 11:05:59 +0300, Burhan Khalid <thelist at> wrote:
> On Thu, 2004-09-02 at 21:47, Joel D Canfield wrote:
> > Okay, how 'bout this: on your Windows intranet, what user-friendly
> > method do *you* use for full-text searching of large volumes of
> > documents? (large meaning tens of thousands)
> My question is similar to this, but with a different slant.  The company
> has a repository (around 5 gigs) of PDF and WHM files.  Currently, the
> files are haphazardly categorized by publisher and subject matter (ie,
> References\ORielly\CSharp) -- this is on a file system level. There are
> separate directories for each publisher, etc.
> We wanted to come up with an intranet solution so that employees can
> search this library based on keywords and then find out what documents
> they should download/checkout (or what book to ask for from the
> reference library).
> The files were on a Unix/Linux based server (now we have moved them to a
> Windows 2003 Server).  My initial idea was to find a search engine
> solution (like mngoSearch) and hope that it would index the files
> appropriately.
> Another idea that was presented at a staff meeting was to have someone
> write down keywords for each file and then store this information in a
> database. The end user then has the option of searching for keywords, or
> viewing a list of all files sorted by category/subject.
> I'm really looking for an automated system that can take care of
> indexing the content and provide a user friendly interface for searching
> the library.  Is there such a thing out there? I've never had to deal
> with such large amounts of documents -- so I'm not sure what would be
> the best solution.  The platforms that are available are Windows Server
> 2003, FreeBSD and Linux.
> Can anyone offer some recommendations?
> >
> > > I can't find info about this. Win2K indexing services seems to only
> > > search its own abstract of the document (the column
> > > and not really the text of the documents.
> > >
> > > We've got thousands of docs, mostly RTF but lots of other MS and plain
> > > text formats. I've used Site Server in the past and been very
> > > happy with
> > > the speed of indexing and reliability of searches. Indexing service
> > > doesn't seem to be doing it. Are my expectations unrealistic?
> > > Should it
> > > be performing full text searches?
> --
> News! - conference for web professionals.
> 17-19 September 2004 in Toronto, Canada.
> Details at
> * * Please support the community that supports you.  * *
> For unsubscribe and other options, including the Tip Harvester
> and archives of thelist go to:
> Workers of the Web, evolt !

More information about the thelist mailing list