[thelist] Scanning for many strings in many texts

manuel.gonzalez.noriega at gmail.com manuel.gonzalez.noriega at gmail.com
Thu Oct 13 14:10:23 CDT 2005


On 13/10/05, Judah McAuley <judah at wiredotter.com> wrote:

> Full text indicies (offered by MySQL and MSSQL to name two I've used)
> are quite good options, especially if the info is already stored in a
> database. If the texts currently reside in seperate files you might want
> to investigate a search engine like HtDig to index the content and then
> you can just query HtDig and retrieve the file(s) you need.

Judah, thanks. Like I said, I'm familiar to a certain level with full
text searches, inverted indexes and (basic) search techniques.

This situation, matching sets of documents against multiple predefined
searches instead of doing a single search at a time is however a tad
different and I don't know if there are special patterns available or
it's just a matter of doing sucessive individual searches.

--
Manuel
a veces :) a veces :(
pero siempre trabajando duro para Simplelógica: apariencia,
experiencia y comunicación en la web.
http://simplelogica.net # (+34) 985 22 12 65

¡Ah! y escribiendo en Logicola: http://logicola.simplelogica.net


More information about the thelist mailing list