[thelist] multilingual sql databases and searching

Steve Cook steve.cook at evitbe.com
Fri Jan 18 06:31:49 CST 2002


Hi Garry, nobody else had answered what is obviously a pretty specialised
question so I thought I would at least chuck in my observation.

The case insensitive searching should take care of itself in SQL Server
2000. If you're looking at large text entries (more than a few thousand
characters) then definitely look at the "text" datatype and look up
"Full-text Querying SQL Server Data" in books online (BOL) to see a whole
bunch of options that this datatype enables for searching within large text
entries.

The Diacritic searching question is harder and I guess that you will have to
build your own solution. I searched for the word "diacritic" in BOL and got
a single entry which referred to search result ordering. Nothing about using
diacritic-insensitive searching.

Your final question strikes me as a little odd. You could search several
databases using code to access one after another, returning record sets and
combining them within your code. However I would question why you want to do
this. If you are creating data from scratch here then you are better off
creating a single database for your needs. I'm sure others here may be able
to better answer this question, but it's not something I have come across as
a requirement before.

Hope that is of *some* help at least. Sounds like you have an interesting
project on your hands.

.steve



----------------------------------
   WapWarp - http://wapwarp.com
 Wap-Dev - http://www.wap-dev.net
 Cookstour - http://cookstour.org
----------------------------------

> -----Original Message-----
> From: garry wiseman [mailto:garryw_wiseman_99 at hotmail.com]
> Sent: den 17 januari 2002 15:57
> To: thelist at lists.evolt.org
> Subject: [thelist] multilingual sql databases and searching
> 
> 
> Hi,
> 
> I wish to store large amounts of copy in a database, either 
> MS SQL 2000 or 
> Oracle, that copy being in several different languages, e.g. French, 
> Russian, Japanese. How can I do this in the above two 
> databases? How do I 
> setup the tables, what string types?
> 
> I would then like to do two kinds of searching on this data:
> (1) Case-insensitive: so I search on "food", and the search will find 
> "FooD", fOOd", etc.
> (2) Diacritic-insensitive: so I search on "food", and the 
> search will find 
> all instances of data with "food" in regardless of accents or 
> other such 
> characters.
> 
> Can anyone help me with how to setup such searches? Particularly the 
> diacritic-insensitive searching?
> Finally - supposing I had 2-3 such databases, how could I 
> search all 2-3 
> simultaneously?
> 
> If anyone can help with any of these, that'd be fantastic.
> 
> Many thanks,
> Garry.
> 





More information about the thelist mailing list