Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Compete to become Power BI Data Viz World Champion! First round ends August 18th. Get started.

Reply
SachiP
New Member

How is your team handling fuzzy duplicates, outliers, and missing values in Fabric?

We’re building a tool to help automate the worst parts of real-world data cleaning — especially for teams working in Fabric and Power BI.
Common headaches we hear from data teams:

  • Fuzzy duplicates across merged sources (different spellings, casing, etc.)

  • Outliers that skew dashboards and break models

  • Missing values that kill calculated columns or ML prep

We’ve built patterns to automate:

  • Dynamic outlier detection (beyond simple Z-scores)

  • Smart missing value imputation (context-aware)

  • Fuzzy matching + deduplication across joins

👉Curious: How is your team currently solving these?
Is it mostly manual, or are you using any automated tools?

Would love to hear what’s working — or what’s still painful.

@FabricPlatformForums

1 ACCEPTED SOLUTION
v-pgoloju
Community Support
Community Support

Hi @SachiP,

 

As per my knowledge, I've been using a manual process to handle fuzzy duplicates, outliers, and missing values in Power BI

 

Thanks & Regards,

Prasanna Kumar

View solution in original post

2 REPLIES 2
SachiP
New Member

great Thanks! How much time does it take for you to clean a dataset?

v-pgoloju
Community Support
Community Support

Hi @SachiP,

 

As per my knowledge, I've been using a manual process to handle fuzzy duplicates, outliers, and missing values in Power BI

 

Thanks & Regards,

Prasanna Kumar

Helpful resources

Announcements
July PBI25 Carousel

Power BI Monthly Update - July 2025

Check out the July 2025 Power BI update to learn about new features.

August 2025 community update carousel

Fabric Community Update - August 2025

Find out what's new and trending in the Fabric community.

Top Solution Authors
Top Kudoed Authors