All our databases were installed using the default collation (Latin1_General_CI_AS
).
We plan to change the collation to allow clients to search the database with accent insensitivity.
Questions:
What are the negatives (if any) of having an accent insensitive database?
Are there any performance overheads for an accent insensitive database?
Why is the default for SQL Server collation accent sensitive; why would anyone want accent sensitive by default?
Seriously, changing database collations is a royal pain. See this HOWTO from codeproject, and then think hard before you do it! This is the EASY way!
Firstly, you can permit searches of the database with accent insensitivity simply by specifying that as part of the search, you don't necessarily have to change the collation.
Simple as. However, this will hurt the indexes.
An alternative is to supply a calculated field which you can index separately.
The example above puts it in the table, but you could just as well create an indexed view.
Then, for accent-insensitive searches, use the view
TableName_AI
.To answer your specific questions:
In an accent insensitive database, accent sensitive searches will be slower.
Yes, but not so you would notice
It just is. Something has to be the default: If you don't like it don't use the default!
Think of it this way: "Hard" and "Herd" are not the same word. That one vowel difference is enough - even though they sound similar.
An accent difference (a vs. á) is somewhere between a case difference (A vs. a), and a letter difference (a vs e). You have to draw the line somewhere.
An accent affects the sound of the word and can make it have a different meaning, though I struggle to think of examples. I guess it makes more sense to someone who has words in their database in a language which makes use of accents.