Re: How to handle schema drift?

dhorrall · ‎08-27-2023

In legacy datafactory there were options to explicitly allow schema drift. I do not see that in Fabric. Am I missing it?

For example,

I was just doing some random testing of loading historical blob text files into a 'table' in Fabric to kick the tires
I had created some intermediary parquet files from the text files to get practice with that
I then attempted to load that parquet files to a 'table', and got the error..'Source column is not defined in delta meta data'
Obviously because files have columns that evolved over time

I don't see a straight-forward way to handle this.

eldpbi · ‎09-02-2024

So Consider this scenario: Copying multiple csv files from OneLake to a Data warehouse within Fabric capacity. CSVs have different schemas, and the tables in the Data Warehouse have to be autocreated (will require schema drift to be on). Now since the schemas are different, in the "Mapping" section of the "Copy" activity, one cannot provide the schemas, but still the current pipelines in Fabric do ask for that.

For more context - GetMetadata gets all the childitems in the onelake folder, and ForEach runs the CopyData activity on all the ChildItems.

@ajarora @GraceGu @haha

Jreed_7474 · ‎03-27-2024

Any update on if schema drift will be added in the futre? especially being it looks like that functionality exsists in ADF?

GraceGu · ‎09-05-2023

Suppose the ask is for explict schema mapping in copy activity exists in ADF today. Editing mapping for Lakehouse destination will be coming in 1-2 month. @dhorrall what destination you are looking for?

dhorrall · ‎09-02-2023

Probably all of the above. Current 'datafactory' has checkbox to handle. I see nothing like this in Fabric. This was the basis of my question.

ajarora · ‎09-05-2023

What you are referring to is possible through Azure Data Factory Mapping Dataflows, but they are not available in Fabric. Perhaps you want to try out Fabric Dataflows and see if it applies as it is ?

In terms of what copy activity allows, if your destination table already exists, and the data you are writing has a column missing, it will be defaulted to null (default value) when writing to destination. If there is a new column, or if a column is not typecastable to the destination type, then this is treated as a bad row, and you can either skip writing this bad row (and log it into a temporary storage to be processed later), or fail the operation (the default).

Anonymous · ‎09-18-2023

Is the ADF Mapping Dataflow coming to Fabric ?
We have the same kind of requirement with json files as source, evolving with new attributes, we need to have the schema drift available

NeedAUserName · ‎09-27-2023

Any answer on this? We have the same requirements and need pipelines to be able to handle schema drift as it is under ADF.

ajarora · ‎08-28-2023

How do you expect the schema variation to have taken effect ?

There can be many situations possible, column added, column dropped, or column type changed.

How to handle schema drift?

Helpful resources

Fabric Community Update - July 2025

Join our Fabric User Panel

Fabric Monthly Update - June 2025

Party with Power BI’s own Guy in a Cube

How to handle schema drift?

Helpful resources

Fabric Community Update - July 2025

Join our Fabric User Panel

Fabric Monthly Update - June 2025