Don't miss your chance to take the Fabric Data Engineer (DP-700) exam on us!
Learn moreNext up in the FabCon + SQLCon recap series: The roadmap for Microsoft SQL and Maximizing Developer experiences in Fabric. All sessions are available on-demand after the live show. Register now
Dear Community,
After having read countless pages on the subject and been in dialogue with MS Support I still fail to get a good answer on what is the "best" setup.
We are running;
Current solution;
Considering that the amount of data is really not that large, I am concerned that we are facing these issues already.
Any ideas on what could be improved from an architectural and technical point of view?
Dataflows are good at raw storage, similar to Parquet blobls or Hadoop. They suck at anything requires computation, including incremental refresh. If your data source has an ok spooling performance there is no need for dataflows at all. (And don't get me started with Direct Query against dataflows).
If you must, feed your dataset from the raw dataflow and do the computations on the dataset side. Implement incremental refresh in your dataset, and do the partition management yourself.
If you have recently started exploring Fabric, we'd love to hear how it's going. Your feedback can help with product improvements.
A new Power BI DataViz World Championship is coming this June! Don't miss out on submitting your entry.
Share feedback directly with Fabric product managers, participate in targeted research studies and influence the Fabric roadmap.