The implementing-result-paging-in-hibernate-getting-total-number-of-rows question trigger another question for me, about some implementation concern:
Now you know you have to reuse part of the HQL query to do the count, how to reuse efficiently?
The differences between the two HQL queries are:
- the selection is
count(?), instead of the pojo or property (or list of) - the fetches should not happen, so some tables should not be joined
- the
order byshould disappear
Is there other differences?
Do you have coding best-practices to achieve this reuse efficiently (concerns: effort, clarity, performance)?
Example for a simple HQL query:
select a from A a join fetch a.b b where a.id=66 order by a.name
select count(a.id) from A a where a.id=66
UPDATED
I received answers on:
- using Criteria (but we use HQL mostly)
- manipulating the String query (but everybody agrees it seems complicated and not very safe)
- wrapping the query, relying on database optimization (but there is a feeling that this is not safe)
I was hoping someone would give options along another path, more related to String concatenation.
Could we build both HQL queries using common parts?
Nice question. Here’s what I’ve done in the past (many things you’ve mentioned already):
select count(*)select count(*) from ().fetch all propertiesfetchfrom joins if you’re parsing HQL as string. If you’re truly parsing the query with ANTLR you can removeleft joinentirely; it’s rather messy to check all possible references.order bygroup by/having.The above applies to HQL, naturally. For Criteria queries you’re quite limited with what you can do because it doesn’t lend itself to manipulation easily. If you’re using some sort of a wrapper layer on top of Criteria, you will end up with equivalent of (limited) subset of ANTLR parsing results and could apply most of the above in that case.
Since you’d normally hold on to offset of your current page and the total count, I usually run the actual query with given limit / offset first and only run the
count(*)query if number of results returns is more or equal to limit AND offset is zero (in all other cases I’ve either run thecount(*)before or I’ve got all the results back anyway). This is an optimistic approach with regards to concurrent modifications, of course.Update (on hand-assembling HQL)
I don’t particularly like that approach. When mapped as named query, HQL has the advantage of build-time error checking (well, run-time technically, because SessionFactory has to be built although that’s usually done during integration testing anyway). When generated at runtime it fails at runtime 🙂 Doing performance optimizations isn’t exactly easy either.
Same reasoning applies to Criteria, of course, but it’s a bit harder to screw up due to well-defined API as opposed to string concatenation. Building two HQL queries in parallel (paged one and “global count” one) also leads to code duplication (and potentially more bugs) or forces you to write some kind of wrapper layer on top to do it for you. Both ways are far from ideal. And if you need to do this from client code (as in over API), the problem gets even worse.
I’ve actually pondered quite a bit on this issue. Search API from Hibernate-Generic-DAO seems like a reasonable compromise; there are more details in my answer to the above linked question.