Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Get Fabric Certified for FREE during Fabric Data Days. Don't miss your chance! Request now

Reply
SachiP
Regular Visitor

How is your team handling fuzzy duplicates, outliers, and missing values in Fabric?

We’re building a tool to help automate the worst parts of real-world data cleaning — especially for teams working in Fabric and Power BI.
Common headaches we hear from data teams:

  • Fuzzy duplicates across merged sources (different spellings, casing, etc.)

  • Outliers that skew dashboards and break models

  • Missing values that kill calculated columns or ML prep

We’ve built patterns to automate:

  • Dynamic outlier detection (beyond simple Z-scores)

  • Smart missing value imputation (context-aware)

  • Fuzzy matching + deduplication across joins

👉Curious: How is your team currently solving these?
Is it mostly manual, or are you using any automated tools?

Would love to hear what’s working — or what’s still painful.

@FabricPlatformForums

1 ACCEPTED SOLUTION
v-pgoloju
Community Support
Community Support

Hi @SachiP,

 

As per my knowledge, I've been using a manual process to handle fuzzy duplicates, outliers, and missing values in Power BI

 

Thanks & Regards,

Prasanna Kumar

View solution in original post

2 REPLIES 2
SachiP
Regular Visitor

great Thanks! How much time does it take for you to clean a dataset?

v-pgoloju
Community Support
Community Support

Hi @SachiP,

 

As per my knowledge, I've been using a manual process to handle fuzzy duplicates, outliers, and missing values in Power BI

 

Thanks & Regards,

Prasanna Kumar

Helpful resources

Announcements
Fabric Data Days Carousel

Fabric Data Days

Advance your Data & AI career with 50 days of live learning, contests, hands-on challenges, study groups & certifications and more!

October Power BI Update Carousel

Power BI Monthly Update - October 2025

Check out the October 2025 Power BI update to learn about new features.

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.