Advance your Data & AI career with 50 days of live learning, dataviz contests, hands-on challenges, study groups & certifications and more!
Get registeredGet Fabric Certified for FREE during Fabric Data Days. Don't miss your chance! Request now
We’re building a tool to help automate the worst parts of real-world data cleaning — especially for teams working in Fabric and Power BI.
Common headaches we hear from data teams:
Fuzzy duplicates across merged sources (different spellings, casing, etc.)
Outliers that skew dashboards and break models
Missing values that kill calculated columns or ML prep
We’ve built patterns to automate:
Dynamic outlier detection (beyond simple Z-scores)
Smart missing value imputation (context-aware)
Fuzzy matching + deduplication across joins
👉Curious: How is your team currently solving these?
Is it mostly manual, or are you using any automated tools?
Would love to hear what’s working — or what’s still painful.
@FabricPlatformForums
Solved! Go to Solution.
Hi @SachiP,
As per my knowledge, I've been using a manual process to handle fuzzy duplicates, outliers, and missing values in Power BI
Thanks & Regards,
Prasanna Kumar
great Thanks! How much time does it take for you to clean a dataset?
Hi @SachiP,
As per my knowledge, I've been using a manual process to handle fuzzy duplicates, outliers, and missing values in Power BI
Thanks & Regards,
Prasanna Kumar
Advance your Data & AI career with 50 days of live learning, contests, hands-on challenges, study groups & certifications and more!
Check out the October 2025 Power BI update to learn about new features.
| User | Count |
|---|---|
| 8 | |
| 7 | |
| 7 | |
| 4 | |
| 3 |