Explore and share Fabric Notebooks to boost Power BI insights in the new community notebooks gallery.
Check it out now!Microsoft is giving away 50,000 FREE Microsoft Certification exam vouchers. Get Fabric certified for FREE! Learn more
Hi. Can someone guide me how to connect PBI Desktop to APACHE SPARK installed on a local windows machine? What should be the server details I should pass?
I have seen guidance for databricks & hdinsight but I need guidance for connection to local machine installed spark here. Any help is appreciable.
Hi @Anonymous ,
You could try the following steps.
1. Add below properties to /home/user/spark/conf/hive-site.xml and /home/user/apachehive/conf/hive-site.xml
<property> <name> hive.server2.transport.mode </name> <value> http </value> </property> <property> <name> hive.server2.thrift.http.port </name> <value> 10001 </value> </property> <property> <name> hive.server2.http.endpoint </name> <value> cliservice </value> </property>
2. Start hive metastore => start hive => start spark thriftserver
The commands are available below
cd /home/user/apachehive/bin/ //get into the directory of hive
./hive --service metastore& //start metastore
./hive //to start hive
cd /home/user/spark/sbin/ //get into the directory of spark
./start-thriftserver //to start spark thrift server
3. Open the powerbi and click on get data after that go to the spark and click on connect button.
4. Now in the server put http://hostipaddres:10001/cliservice
Select HTTP in Protocol
In Data Connectivity mode check on DirectQuery (means dont want to import data just directly work on it ) or Import (means import the data and then work on it )
5. Sign in your account and then choose the tables which you want and load it.
Reference this: Connect PowerBI with Spark
Best Regards,
Xue Ding
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.
Any suggestions here pls?