Earn a 50% discount on the DP-600 certification exam by completing the Fabric 30 Days to Learn It challenge.
Hello,
I'm encountering an out of memory error when trying to run a Python script within Power BI to manipulate a dataset. I aim to truncate a dataset to only the first 5 rows using a Python script step in the Power Query Editor. Here's the script I am using:
import pandas as pd
# 'dataset' holds the input data for this script
# Keep only the first 5 rows
dataset = dataset.head(5)
However, when I execute the script, I receive the following error message:
DataSource.Error: ADO.NET: Python script error.
pandas.errors.ParserError: Error tokenizing data. C error: out of memory
Details:
DataSourceKind=Python
DataSourcePath=Python
Message=Python script error.
ErrorCode=-2147467259
ExceptionType=Microsoft.PowerBI.Scripting.Python.Exceptions.PythonScriptRuntimeException
The dataset is not particularly large, so it's puzzling why this memory issue arises. I've tried to ensure that my system has enough available memory and even filtered the dataset in Power Query before handing it off to Python, but to no avail.
Has anyone faced a similar issue or can offer insights into what might be going wrong? Any advice on how to troubleshoot this error would be greatly appreciated.
Thank you in advance for your help!
Do you see the same issue when keeping the top 5 rows in Power Query?
Hi @HamidBee ,
I really don't have experience of using it, but could you confirm the format of retrieved lines, it looks to be basically a parsing error.
A basic search on this error in internet indicate, it's common error related to formating.
Hope it helps.