.NET 3.5, C#
I have a web app with a ‘search’ feature. Some of the fields that are searchable are first-class columns in the table, but some of them are in fact nested fields inside an XML data type.
Previously, I built a system for dynamically constructing the SQL for my search. I had a nice class hierarchy that built SQL expressions and conditional statements. The only problem was it was not safe from SQL injection attacks.
I was reading Rob Conery’s excellent article which pointed out that multiple queries can combined into a single TSQL query for the server if the IQueryable result is never enumerated. This got me to thinking that my dynamic search construction was much too complicated – I just needed to combine multiple LINQ expressions.
For example (contrived):
Author: ID (int), LastName (varchar(32)), FirstName (varchar(32)) context.Author.Where(xx => xx.LastName == 'Smith').Where(xx => xx.FirstName == 'John')
Results in the following query:
SELECT [t0].[ID], [t0].[LastName], [t0].[FirstName] FROM [dbo].[Author] AS [t0] WHERE ([t0].[LastName] = Smith) AND ([t0].[FirstName] = John)
I realized this might be the perfect solution for a simple dynamic query generation that’s safe from SQL injection – I’d just loop over my IQueryable result and execute additional conditionals expressions to get my final single-execution expression.
However, I can’t find any support for evaluation of XML data. In TSQL, to get a value from an XML node, we would do something like
XMLField.value('(*:Root/*:CreatedAt)[1]', 'datetime') = getdate()
But I can’t find the LINQ to SQL equivalent of creating this evaluation. Does one exist? I know I can evaluate all non-XML conditions DB side, and then do my XML evaluations code side, but my data are large enough that A) that’s a lot of network traffic to drag on performance and B) I’ll get out-of-memory exceptions if I can’t evaluate the XML first DB side to exclude certain result sets.
Ideas? Suggestions?
Bonus question – If XML evaluation is in fact possible DB side, what about FLWOR support?
Now that is an interesting question.
Right now, you cannot instruct SQL Server to perform XML functions directly from Linq. However, you can get Linq to use user defined functions… so, you could setup a udf to process the xml, get the right data, etc, and then use that in your Linq expresion. This will execute on the server and should do what you want. There’s an important limitation, though: The XML path you’re looking for (the first parameter to
xmlColumn.valueor similar) has to be built into the function because it has to be a string literal, it can’t be built from an input parameter (for instance). So you can use UDFs for getting fields you know about when writing the UDF, but not as a general-purpose way to get data from XML columns.Check out the Supporting User Defined Functions (UDFs) section of Scott Gutherie’s excellent Blog series on Linq to SQL for more info on implementation.
Hope this helps.