Don't miss your chance to take the Fabric Data Engineer (DP-600) exam for FREE! Find out how by attending the DP-600 session on April 23rd (pacific time), live or on-demand.
Learn moreNext up in the FabCon + SQLCon recap series: The roadmap for Microsoft SQL and Maximizing Developer experiences in Fabric. All sessions are available on-demand after the live show. Register now
Hello
I need some help with cleaning my data. I have a lot of duplicates in my table. I want to keep the record which contains the most data.
This is how my table looks like:
Records with the same 'key intervention' are duplicates. In the perfect scenario the records have no 'null' in 'time call', 'time arrival' and 'time end'. I want to keep the records in blue and remove the other records.
Help is really appreciated!
Solved! Go to Solution.
Hi @boa ,
Here are the steps you can follow:
1. Go to Power Query – Transform – Group By – Advanced – Set as follows.
2. Select the new column – click the expand flag -- check [time call], [time arrival], [time end].
3. Add Column – Custom Column – Enter the following code.
if [Count] = [time call] then 1 else 0
4. Select [Custom] – select only 1.
5. Check [Count], [Custom] – Remove Columns.
6. Result:
Best Regards,
Liu Yang
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly
Hi @boa ,
Here are the steps you can follow:
1. Go to Power Query – Transform – Group By – Advanced – Set as follows.
2. Select the new column – click the expand flag -- check [time call], [time arrival], [time end].
3. Add Column – Custom Column – Enter the following code.
if [Count] = [time call] then 1 else 0
4. Select [Custom] – select only 1.
5. Check [Count], [Custom] – Remove Columns.
6. Result:
Best Regards,
Liu Yang
If this post helps, then please consider Accept it as the solution to help the other members find it more quickly
Thank you for your help and excellent explanation! 🙏
go to transform data and goto power query editor and there select time arrival column click on down arrow and deselect null values, then you will get.
Hello sudhav
If I deselect "null" values in the 'time arrival' column (=remove empty?) I remove also records that I want to keep... If the arrival time is empty and the end time is not empty/or empty, I want to keep it as well
If you have recently started exploring Fabric, we'd love to hear how it's going. Your feedback can help with product improvements.
A new Power BI DataViz World Championship is coming this June! Don't miss out on submitting your entry.
Experience the highlights from FabCon & SQLCon, available live and on-demand starting April 14th.
| User | Count |
|---|---|
| 47 | |
| 44 | |
| 40 | |
| 20 | |
| 15 |
| User | Count |
|---|---|
| 70 | |
| 68 | |
| 32 | |
| 27 | |
| 25 |