Has anyone been able to successfully install the spark-xml library on the spark pool in Fabric?
I added this in the workspace settings:
After a few minutes, it was 'successful', but the notebook still complains:
Am I missing something? Thanks in advance Fabric community! 🙂
@PRADEEPCH-MSFT Can you let us know if there is any feedback from the product team as to why this does not work and if there is a planned ETA for when this will work? This works perfectly fine in Synapse notebooks.
I opened a support ticket for this, but they never got it working and said it was not supported now. They didn't give any timeline or any good reason as to why it doesn't work.
As a workaround, I am reading the XML with scala, which does work with this package. Then just doing the rest of what I need to do in Pysypark.
Thanks @Joshrodgers123, that's a great suggestion.
With a little help from ChatGPT I was able to rewrite parts of my notebook to use Scala, and then drop back into Pyspark to do the rest of the work.
It's a shame that Microsoft haven't got this working properly, especially as it works in Synapse Analytics, but this workaround will do for now.
@versydney - As per the repro, I was experiencing the same issue from my end and reached out to the internal team to get more information related to your query and will get back to you as soon as we have an update.
Has there been any update from Microsoft on this?
I'm running into the same problem as @versydney .
Spark-XML package uploaded via Library Management on the Fabric Workspace:
In my notebook, I have the following code:
And this returns the error:
This is the third iteration of this particular notebook, having first written it a couple of years ago to work on Databricks, then last year it was converted to a Synapse Spark Pool notebook and now it's been converted to a Microsoft Fabric notebook.
In Databricks and Synapse, simply importing the spark-xml package was enough to get it working, but for some reason the same approach isn't working in Fabric.
Check out the November 2023 Fabric update to learn about new features.
130+ sessions, 130+ speakers, Product managers, MVPs, and experts. All about Power BI and Fabric. Attend online or watch the recordings.
Ask your questions about Synapse here!