Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Calling all Data Engineers! Fabric Data Engineer (Exam DP-700) live sessions are back! Starting October 16th. Sign up.

Reply
SachiP
Regular Visitor

How is your team handling fuzzy duplicates, outliers, and missing values in Fabric?

We’re building a tool to help automate the worst parts of real-world data cleaning — especially for teams working in Fabric and Power BI.
Common headaches we hear from data teams:

  • Fuzzy duplicates across merged sources (different spellings, casing, etc.)

  • Outliers that skew dashboards and break models

  • Missing values that kill calculated columns or ML prep

We’ve built patterns to automate:

  • Dynamic outlier detection (beyond simple Z-scores)

  • Smart missing value imputation (context-aware)

  • Fuzzy matching + deduplication across joins

👉Curious: How is your team currently solving these?
Is it mostly manual, or are you using any automated tools?

Would love to hear what’s working — or what’s still painful.

@FabricPlatformForums

1 ACCEPTED SOLUTION
v-pgoloju
Community Support
Community Support

Hi @SachiP,

 

As per my knowledge, I've been using a manual process to handle fuzzy duplicates, outliers, and missing values in Power BI

 

Thanks & Regards,

Prasanna Kumar

View solution in original post

2 REPLIES 2
SachiP
Regular Visitor

great Thanks! How much time does it take for you to clean a dataset?

v-pgoloju
Community Support
Community Support

Hi @SachiP,

 

As per my knowledge, I've been using a manual process to handle fuzzy duplicates, outliers, and missing values in Power BI

 

Thanks & Regards,

Prasanna Kumar

Helpful resources

Announcements
FabCon Global Hackathon Carousel

FabCon Global Hackathon

Join the Fabric FabCon Global Hackathon—running virtually through Nov 3. Open to all skill levels. $10,000 in prizes!

October Power BI Update Carousel

Power BI Monthly Update - October 2025

Check out the October 2025 Power BI update to learn about new features.

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.

Top Kudoed Authors