Power BI is turning 10, and we’re marking the occasion with a special community challenge. Use your creativity to tell a story, uncover trends, or highlight something unexpected.
Get startedJoin us for an expert-led overview of the tools and concepts you'll need to become a Certified Power BI Data Analyst and pass exam PL-300. Register now.
I currently use Power BI Desktop to connect to Apache Hive through a HortonWorks Hive ODBC connection. I pass SQL-like statements to Hive in Power BI to process the statements on the server and then have Hive return the results.
My issue is that returning the data to Power BI is extremely slow. For instance, it takes up to an hour to return a "table" of about 151M records in Power BI. When I use a database management system to query Hive -- like DBeaver, for instance -- I can get around this by running the queries through the Tez engine, with the statement below:
set hive.execution.engine=tez;
More on the Tez engine here.
Running these statements through the Tez engine takes about 1/100th of the time. (BTW: DBeaver connects to Hive through JDBC drivers, which looks like Power BI does not yet support.)
Is there a way to force Power BI to run queries through the Tez engine?
It can be done on the DSN config....go to advanced in your ODBC config...Server Side Properties...Add...put in hive.execution.engine as the key and tez as the value...OK...OK...OK. Worked for me...took an hour long query down to 12 minutes. Not perfect but way more palatable.
Hi,
This is not a solution to your question.
But I am trying to connect to Tez through Hive using Dbeaver, can you please help as you are able to connect.
Thanks in advance!
Manas
This is your chance to engage directly with the engineering team behind Fabric and Power BI. Share your experiences and shape the future.
Check out the June 2025 Power BI update to learn about new features.
User | Count |
---|---|
72 | |
69 | |
55 | |
36 | |
31 |
User | Count |
---|---|
84 | |
63 | |
63 | |
49 | |
45 |