Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Join us at FabCon Vienna from September 15-18, 2025, for the ultimate Fabric, Power BI, SQL, and AI community-led learning event. Save €200 with code FABCOMM. Get registered

Reply
smpa01
Super User
Super User

Dataflow data

Using sempy one can query the data from a dataset directly into notebook.
Is there any way to similarly query/read the data directly from dataflow (gen1/gen2 df) in a notebook?
I am not sure if sempy can query the data from a df.
Is there any method available whatsoever (sempy, API, other workaround)?
Thank you in advance.

Did I answer your question? Mark my post as a solution!
Proud to be a Super User!
My custom visualization projects
Plotting Live Sound: Viz1
Beautiful News:Viz1, Viz2, Viz3
Visual Capitalist: Working Hrs
1 ACCEPTED SOLUTION
v-vpabbu
Community Support
Community Support

Hi @smpa01,

 

Thank you for reaching out to Microsoft Fabric Community Forum.

 

SemPy is designed primarily for querying Power BI datasets. However, it is important to note that it does not have built-in capabilities for querying dataflows. For querying dataflows, you may need to consider alternative tools or methods, as suggested by the super users below.

 

Regards,

Vinay Pabbu

View solution in original post

10 REPLIES 10
v-vpabbu
Community Support
Community Support

Hi @smpa01,

 As we haven’t heard back from you, we wanted to kindly follow up to check if the solution provided for the issue worked? or Let us know if you need any further assistance?
If our response addressed, please mark it as Accept as solution and click Yes if you found it helpful.

 

Regards,

Vinay Pabbu

Hi @smpa01,

 

we wanted to kindly follow up to check if the solution provided for the issue worked? or Let us know if you need any further assistance?
If our response addressed, please mark it as Accept as solution and click Yes if you found it helpful.

 

Regards,

Vinay Pabbu

 

Hi @smpa01,

 

May I ask if you have gotten this issue resolved?

If it is solved, please mark the helpful reply or share your solution and accept it as solution, it will be helpful for other members of the community who have similar problems as yours to solve it faster.

 

Regards,

Vinay Pabbu

v-vpabbu
Community Support
Community Support

Hi @smpa01,

 

Thank you for reaching out to Microsoft Fabric Community Forum.

 

SemPy is designed primarily for querying Power BI datasets. However, it is important to note that it does not have built-in capabilities for querying dataflows. For querying dataflows, you may need to consider alternative tools or methods, as suggested by the super users below.

 

Regards,

Vinay Pabbu

frithjof_v
Super User
Super User

If you're using Dataflow Gen2, why not write the outputs to a Lakehouse and the Notebook can query the Lakehouse 

DF Gen2 is yet to be cleared by IT + DF gen2 has issue with port 1433 . There is always something.

Did I answer your question? Mark my post as a solution!
Proud to be a Super User!
My custom visualization projects
Plotting Live Sound: Viz1
Beautiful News:Viz1, Viz2, Viz3
Visual Capitalist: Working Hrs

Concur.  That 1433 issue is an unvoluntary showstopper for us too. Out Cyber Security team does not take kindly to Microsoft's suggestion to "just open this port for these couple endpoints".

lbendlin
Super User
Super User

Dataflows Gen1 are stored as Azure Blobs.  It is technically possible, but very, very convoluted, to access them directly. Not worth the effort in my opinion. 

https://community.powerbi.com/t5/Service/Creating-CSV-file-in-Sharepoint-from-Power-BI-Dataflows/m-p...

 

Not really sure about Dataflows Gen2 - they give you the option to specify alternative destinations, so they may not even store anyting themselves.

@lbendlin Thanks. But the method described here is not straightforward.

 

I ideally want to be able to read the data by making an API call or so in a notebook.

The fact that,

a. there is a way to read the data in notebook directly from some sources (databricks,ssas,smartsheet etc as API access is secured)

and

b. there would always be data residing in sources (sharepoint, sql, others - can only be read through dataflow as direct API access is restricted by the data owners)

 

if MS can provide access to dataflow data (for #b) using API (last time I checked it is currently not possible, PBI access token does not have Dataflow as audience), it would be so much more easier to utilize notebook as the single source to read data from different sources and build downstream applications.

 

The alternative is to scrape dataflow altogether and build ssas db (to utilize #a in notebook) but the caveat is dataflow usually has more memory attributed than dataset.

 

I don't to how to solve this problem. The ecosystem is following

 

previously power query (limited opportunites, not optimized), notebook ( query/ transform/ wrangle/ ML from different sources) -> downstream storage (Lakehouse,KQL, DW) ->downstream applications (BI, others)

 

 

Did I answer your question? Mark my post as a solution!
Proud to be a Super User!
My custom visualization projects
Plotting Live Sound: Viz1
Beautiful News:Viz1, Viz2, Viz3
Visual Capitalist: Working Hrs

Occam's Razor - challenge the need for the dataflow if you can do all this with Data Factory, Pipelines and notebooks.

 

Even in legacy Power BI dataflows are very often a solution looking for a problem.

Helpful resources

Announcements
Join our Fabric User Panel

Join our Fabric User Panel

This is your chance to engage directly with the engineering team behind Fabric and Power BI. Share your experiences and shape the future.

June FBC25 Carousel

Fabric Monthly Update - June 2025

Check out the June 2025 Fabric update to learn about new features.

June 2025 community update carousel

Fabric Community Update - June 2025

Find out what's new and trending in the Fabric community.