Lakehouse Add or Remove columns from table

BryanCarmichael · ‎06-07-2023

Not sure if this is the right forum or not - but here is the issue.

We are loading data to a Lakehouse using gen 2 data flows (for now they are just pointing at exisitn gen 1 dataflows then doing the lakehouse insert - we will recify this later on).

Over time it is typical for columns to be added, removed and / or updated in a dataflow - with a datamart these changes are reflected automatically in the schema - however with a laehouse when adding a new column to the dtaflow i can see no way to bring that into the lakehouse.

What do i need to do here - only options i can see are
1: import it as a new table but that seems to be very clunky as you would need to update queoroes / stored procedures on your sql end point to cater for this
2: Delete exisitng table in lakehouse and then add a new one with the same name.

Am i missing something ?

frithjof_v · ‎03-31-2024

It seems to be possible to add columns in Lakehouse table now by using notebook.

I am able to use the following type of command in a Notebook:

%%sql

ALTER TABLE tableName

ADD COLUMN columnName dataType

And the table will get updated also in SQL Analytics Endpoint and Direct Lake Semantic Model, something which was a problem before.

Ref. this thread:

https://community.fabric.microsoft.com/t5/General-Discussion/SQL-ALTER-command/m-p/3748079#M4861

However, I get an error if I try to rename or remove (drop) a column.

frithjof_v · ‎04-13-2024

Maybe this is a solution for renaming columns, dropping columns and changing column type in Lakehouse tables:

https://community.fabric.microsoft.com/t5/General-Discussion/Dropping-and-recreating-lakehouse-table...

funtomas · ‎11-03-2023

There is 3rd option which worked for me:

3.Rename original table. For exmplate rename "Table" to "Table1"

Then go to your Dataflow and setup destination of your Dataflow again. (Of course use create new table called "Table".)

BryanCarmichael · ‎09-18-2023

So where we ended up with this is moving to using a warehouse for almost everything - and creating our own tables in it.

This allows you to alter them and create primary keys (not enforced) .
Lakehouse is good for super unstructured data but if you have structured data then a warehouse is a much better option.

asittrivedi · ‎09-12-2023

I have used a Python notebook to add a column to an existing table and that works just fine. One can use spark dataframe or pyspark.pandas dataframe to get the desired outcome.

murray-matthew · ‎03-30-2024

Would really appreciate any additional insight or links to resources that could be provided on this subject.

coolie · ‎11-20-2023

could you share how to do this in python?

asittrivedi · ‎04-01-2024

import pyspark.pandas as ps

import pandas as pd

import numpy as np

from pyspark.sql import *

psdf = ps.read_delta('path_to_table')

psdf.head(10)

psdf['new_col'] = ''

psdf.head(10)

sdf = psdf.to_spark()

sdf.write.mode('overwrite').saveAsTable('existing_table')

Once can use a spark dataframe in lieu of pandas.

How ever the easiest option now is to use a SQL notebook and add a column. Please refer

https://community.fabric.microsoft.com/t5/General-Discussion/SQL-ALTER-command/m-p/3748079#M4861

duttasoumya · ‎07-03-2023

Ref: https://learn.microsoft.com/en-us/fabric/data-warehouse/tsql-surface-area#t-sql-surface-area

marcuspaivio · ‎06-08-2023

I was wondering the same. 🙂

Lakehouse Add or Remove columns from table

Helpful resources

Join our Fabric User Panel

Fabric Monthly Update - June 2025

Fabric Community Update - June 2025

Join the #PBI10 DataViz contest

Lakehouse Add or Remove columns from table

Helpful resources

Join our Fabric User Panel

Fabric Monthly Update - June 2025

Fabric Community Update - June 2025