March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount! Early bird discount ends December 31.
Register NowBe one of the first to start using Fabric Databases. View on-demand sessions with database experts and the Microsoft product team to learn just how easy it is to get started. Watch now
Hello everyone!
I have access to a Data lake (Databricks) that stores multiple hundred million records.
Unfortunately, all of these tables/columns/values are stored as a string.
Now, since I have to define the types for every column, would you do that in the dataflow or in the dataset?
Especially from a performance and refresh time point of view.
Unfortunately, it is not possible to define the column types in the data lake.
Hi @joshua1990
There are differences between dataflow and dataset in many aspects. Which one to choose depends on your needs. They can be used to solve various problems. Each implementation has it’s advantages and disadvantages.
Here's an article for your reference,
https://ssbipolar.com/2019/10/14/power-bite-dataflows-vs-datasets/
Best Regards,
Community Support Team _Tang
If this post helps, please consider Accept it as the solution to help the other members find it more quickly.
March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount!
Arun Ulag shares exciting details about the Microsoft Fabric Conference 2025, which will be held in Las Vegas, NV.
User | Count |
---|---|
25 | |
21 | |
11 | |
10 | |
9 |
User | Count |
---|---|
48 | |
30 | |
18 | |
17 | |
15 |