Check your eligibility for this 50% exam voucher offer and join us for free live learning sessions to get prepared for Exam DP-700.
Get StartedDon't miss out! 2025 Microsoft Fabric Community Conference, March 31 - April 2, Las Vegas, Nevada. Use code MSCUST for a $150 discount. Prices go up February 11th. Register now.
I currently use Power BI Desktop to connect to Apache Hive through a HortonWorks Hive ODBC connection. I pass SQL-like statements to Hive in Power BI to process the statements on the server and then have Hive return the results.
My issue is that returning the data to Power BI is extremely slow. For instance, it takes up to an hour to return a "table" of about 151M records in Power BI. When I use a database management system to query Hive -- like DBeaver, for instance -- I can get around this by running the queries through the Tez engine, with the statement below:
set hive.execution.engine=tez;
More on the Tez engine here.
Running these statements through the Tez engine takes about 1/100th of the time. (BTW: DBeaver connects to Hive through JDBC drivers, which looks like Power BI does not yet support.)
Is there a way to force Power BI to run queries through the Tez engine?
It can be done on the DSN config....go to advanced in your ODBC config...Server Side Properties...Add...put in hive.execution.engine as the key and tez as the value...OK...OK...OK. Worked for me...took an hour long query down to 12 minutes. Not perfect but way more palatable.
Hi,
This is not a solution to your question.
But I am trying to connect to Tez through Hive using Dbeaver, can you please help as you are able to connect.
Thanks in advance!
Manas
User | Count |
---|---|
118 | |
75 | |
60 | |
50 | |
44 |
User | Count |
---|---|
175 | |
125 | |
60 | |
60 | |
58 |