Power BI is turning 10! Tune in for a special live episode on July 24 with behind-the-scenes stories, product evolution highlights, and a sneak peek at what’s in store for the future.
Save the dateEnhance your career with this limited time 50% discount on Fabric and Power BI exams. Ends August 31st. Request your voucher.
Hi,
I've got an issue in a Synapse Spark cluster whereby a 3rd party tool generates data and saves to the clusters local tmp directory. What I would like to do is use Spark to read the parquet files that are saved locally, problem is I don't seem to be able to do that with syntax in a Notebook:
spark.read.parquet("File:///tmp/Files/tableone/*.parquet")
I get "file not found" even though the error message actually shows the parquet file, so not sure why it returns "file not found".
Any help appreciated.
HI @AndyDDC,
After further test with this, I think this should related to the path. Currently the notebook will try to find the file from the notebook run time environment. (obviously, your file should not exist in it)
Perhaps you can try to move the file to file server or share this file/folder and allow to access remotely. Then you can use correspond file path and use your notebook to access and read data remotely.
Regards,
Xiaoxin Sheng
Hi @AndyDDC,
I think this issue should be related to the tmp folder, normally it used to cache some temporary data and they may been cleanup to release resources.
I'd like to suggest you check the folder to confirm if this parquet file stay in the folder and you have enough permission during the notebook operations.(if files have been occupancy with other processing, they may be lockup and you can't read and edit it)
You can also try to move the file to a general folder instead of the specific folders and try again with your operations.
Regards,
Xiaoxin Sheng
This is your chance to engage directly with the engineering team behind Fabric and Power BI. Share your experiences and shape the future.
User | Count |
---|---|
5 | |
4 | |
2 | |
2 | |
2 |