The ultimate Fabric, Power BI, SQL, and AI community-led learning event. Save €200 with code FABCOMM.
Get registeredEnhance your career with this limited time 50% discount on Fabric and Power BI exams. Ends September 15. Request your voucher.
Hi Experts,
Is there an option that I do all the data cleaning in Power query and then import that table into R or Python ?? Please let me know
Regards
Afzal khan
I checked - everything works.
For example, you can input file from Python, apply Power query and then pass it back to Python.
I'm not sure about any usefulness of such workflow, but it works.
let Source = Python.Execute("import pandas as pd#(lf)df1 = pd.read_parquet(r'C:\Users\MyPC\Dropbox\Documents\DataScience\Pandas\pandas\chicago.parquet')"), df2 = Source{[Name="df1"]}[Value], #"Changed Type" = Table.TransformColumnTypes(df2,{{"Employee Annual Salary", type text}}), #"Replaced Value" = Table.ReplaceValue(#"Changed Type","$","",Replacer.ReplaceText,{"Employee Annual Salary"}), #"Filtered Rows" = Table.SelectRows(#"Replaced Value", each [Employee Annual Salary] <> null and [Employee Annual Salary] <> ""), #"Run Python script" = Python.Execute("# 'dataset' holds the input data for this script#(lf)dataset.to_parquet(r'C:\Users\MyPC\Dropbox\Documents\DataScience\Pandas\pandas\pbd.parquet')#(lf)",[dataset=#"Filtered Rows"]) in #"Run Python script"
User | Count |
---|---|
65 | |
62 | |
60 | |
53 | |
28 |
User | Count |
---|---|
181 | |
82 | |
67 | |
48 | |
44 |