besides using Hive, is it a good idea in order to execute ad hoc

Question

0

Asked: June 1, 20262026-06-01T06:39:57+00:00 2026-06-01T06:39:57+00:00

besides using Hive, is it a good idea in order to execute ad hoc

0

besides using Hive, is it a good idea in order to execute ad hoc query on large scale log data on HDFS for SQL programmers?

Is there any similar open-source implementation?

Report

Leave an answer
Cancel reply

You must login to add an answer.

Need An Account,

1 Answer

Editorial Team · Answer 1 · 2026-06-01T06:39:58+00:00

Technically it should not be that complicated to implement. Some conceptual problem I see with it that performance-wise behavior of the NoSQL engines is fundamentally different from what MySQL engine expect from storage engines. Specifically – they have good random access and not that efficient in the full or range scans. The question is it will be possible to translate all these costs to the optimizer. It is something applicable to any RDBMS engine. Actually many of them has a concept of pluggable storage engines and have different level of flexibility / documentation.

I think, to have such integration efficient we need to be able to push down predicates to the NoSQL engines for the full / range scans. I am not 100% sure that MySQL supports it on the level of storage engine interface.
Another serious problem I see with this approach – the fact that MySQL does not have parallel query, and thereof can not be too good for processing big data.

Sign Up

Sign In

Forgot Password

The Archive Base Latest Questions

besides using Hive, is it a good idea in order to execute ad hoc

Leave an answerCancel reply

1 Answer

Leave an answer
Cancel reply