By default, when one tells SQL Server (currently using 2008) to Full-Text index a column, it treats characters such as “@” and “.” as work-breakers, similarly to ” “.
I’d like to restrict the work-breaking characters to just be ” “, so that “joe.bloggs@somewhere.com” is treated as a single word.
It appears that one can choose a “Language for Word Breaker” against the indexed column – perhaps I need to set up a custom language?
Does anyone know how I can do this?
In order to make your word breaker fly with SQL Server you have to disable signature verification and add your COM CLSID to the registry. For more info check out this post: http://blogs.msdn.com/shajan/default.aspx
It helped me a lot! However I never managed to create my own language so I simply hijacked an already existing one.