Explore and share Fabric Notebooks to boost Power BI insights in the new community notebooks gallery.
Check it out now!Microsoft is giving away 50,000 FREE Microsoft Certification exam vouchers. Get Fabric certified for FREE! Learn more
Hi, I'm working on an idea and it will involve combining around 1,000 excel files each with about 5,000 rows. I will then perform fuzzy match in just one column to look up some keywords which will potentially yield more tna 1 potential match (based on .40 threshold). What would be the best way to approach this considering the volume of the data? thanks!
I'm not exactly sure on how you will automate getting 1000 excel files. I guess that PBI will be very slow in getting so many different files. I would consolidate first in a single excel and load that.
Anyway 5 million rows is large but not extremely large for PBI. You can apply your formulas on a fuzzy match with a calculated column.
the files are saved in a SP library, and PBI will be used to combine them and then perform the fuzzy match. How can I apply fuzzy match using calculated columns?