Earn a 50% discount on the DP-600 certification exam by completing the Fabric 30 Days to Learn It challenge.
Hi everyone,
I am using a folder as the "source data" for a power bi report. This folder is supposed to contains multiple files, all using the same columns, so that PowerBI can merge them properly.
However, users are making the following mistakes:
- entering two records with the same primary key in the same file
- creating records for which there's no dimension (ex: Sales for Customer XYZ, when there's no customer XYZ in the corresponding Dimension table.
Is there a way to verify files before import, to make sure no errors exist in the data, a get a list of errors to address? Specifically
- Identify duplicate primary keys.
- missing dimensions
- Identify wrong datatype (ex: text where there should be date)
- files with mismatching columns
Thanks in advance
Solved! Go to Solution.
Hi @Datahungry,
Based on my research, I am afraid you could not achieve these features currently, you could summit an idea in ideas forum:
https://ideas.powerbi.com/forums/265200-power-bi-ideas
Or you could use the query editor in Power BI Desktop to clean your data before you import your data to the data model:
https://docs.microsoft.com/en-us/power-bi/guided-learning/gettingdata?tutorial-step=4
Regards,
Daniel He
Hi @Datahungry,
Could you please tell me if your problem has been solved? If it is, could you please mark the helpful replies as Answered to close this topic?
Regards,
Daniel He
Hi @Datahungry,
Based on my research, I am afraid you could not achieve these features currently, you could summit an idea in ideas forum:
https://ideas.powerbi.com/forums/265200-power-bi-ideas
Or you could use the query editor in Power BI Desktop to clean your data before you import your data to the data model:
https://docs.microsoft.com/en-us/power-bi/guided-learning/gettingdata?tutorial-step=4
Regards,
Daniel He
Might be able to do something like that in R or Python perhaps. Are you thinking a pre-process or as part of the query?
Right now I have no solution, so either would be helpful
Is it possible to accomplish both?
User | Count |
---|---|
98 | |
90 | |
78 | |
72 | |
65 |
User | Count |
---|---|
114 | |
97 | |
95 | |
68 | |
65 |