Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Enhance your career with this limited time 50% discount on Fabric and Power BI exams. Ends August 31st. Request your voucher.

Reply
sunilmaghanuru
Frequent Visitor

Cloudfiles error

Py4JJavaError: An error occurred while calling o4364.load. : org.apache.spark.SparkClassNotFoundException: [DATA_SOURCE_NOT_FOUND] Failed to find the data source: cloudFiles. Please find packages at `https://spark.apache.org/third-party-projects.html`.

1 REPLY 1
Anonymous
Not applicable

Hi @sunilmaghanuru ,

 

Please follow the steps below:

1. The error message indicates a mismatch between Python versions in the worker and driver environments. Ensure that both environments use the same minor Python version.Check the environment variables PYSPARK_PYTHON and PYSPARK_DRIVER_PYTHON to ensure they are correctly set.

2.The error may also be caused by Java heap space limitations. Consider adjusting the driver memory configuration in your Spark session.

3.Ensure that your Spark version supports cloudFiles. Compatibility can sometimes be an issue. If you're using Spark with a build that doesn't include `cloudFiles` by default, you might need to include the appropriate package when starting your Spark session. 

4.Some data sources require specific configuration settings. Review the documentation to ensure you have configured everything correctly for `cloudFiles`.

 

Best Regards,

Neeko Tang

If this post  helps, then please consider Accept it as the solution  to help the other members find it more quickly. 

Helpful resources

Announcements
Fabric July 2025 Monthly Update Carousel

Fabric Monthly Update - July 2025

Check out the July 2025 Fabric update to learn about new features.

August 2025 community update carousel

Fabric Community Update - August 2025

Find out what's new and trending in the Fabric community.

Top Kudoed Authors