Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Power BI is turning 10! Let’s celebrate together with dataviz contests, interactive sessions, and giveaways. Register now.

Reply
Anonymous
Not applicable

Question on data flows in Power BI service

Hi All,

I need to create a visual that uses several database sources (sql server, oracle, and impala). When I build some of the sources, I need to retrieve data based off another table. At a very high level, I’ll pull records from TableA. TableB will be populated using separate logic and only if customerid is in TableA is in TableB. I would do this in a query, but the data isn’t in the same database. I did some googling, and the best way to approach this is through a dataflow, but before I proceed, I wanted to make sure that I can create a dataflow and Publish my visual. If you have any suggestions on how to accomplish this in PowerBI, please let me know. Lastly, as I understand it, a dataflow will eliminate calls to the database every time the visual is viewed. A call will be made to the database only when the data is refreshed. Is that correct?

Thanks in advance!

1 ACCEPTED SOLUTION
ichavarria
Solution Specialist
Solution Specialist

Hi @Anonymous,

 

Yes, you can create a dataflow in Power BI to combine data from multiple sources, including SQL Server, Oracle, and Impala. In your case, you can create two entities, one for each table, and establish a relationship between them based on the customer ID column. You can then use the dataflow in your visualizations.

 

Before publishing your visual, you need to ensure that the dataflow is scheduled to refresh at the appropriate intervals, so your data is always up to date. You can schedule the refresh in the Power BI service.

 

Regarding your question about dataflow, yes, you are correct. Dataflows in Power BI cache the data and only refresh it when needed, such as when the dataflow is refreshed or when a report that uses the dataflow is refreshed. This helps to reduce the number of calls to the data source and improve performance.

 

 

Best regards, 

Isaac Chavarria

If this post helps, then please consider Accepting it as the solution and giving Kudos to help the other members find it more quickly.

View solution in original post

1 REPLY 1
ichavarria
Solution Specialist
Solution Specialist

Hi @Anonymous,

 

Yes, you can create a dataflow in Power BI to combine data from multiple sources, including SQL Server, Oracle, and Impala. In your case, you can create two entities, one for each table, and establish a relationship between them based on the customer ID column. You can then use the dataflow in your visualizations.

 

Before publishing your visual, you need to ensure that the dataflow is scheduled to refresh at the appropriate intervals, so your data is always up to date. You can schedule the refresh in the Power BI service.

 

Regarding your question about dataflow, yes, you are correct. Dataflows in Power BI cache the data and only refresh it when needed, such as when the dataflow is refreshed or when a report that uses the dataflow is refreshed. This helps to reduce the number of calls to the data source and improve performance.

 

 

Best regards, 

Isaac Chavarria

If this post helps, then please consider Accepting it as the solution and giving Kudos to help the other members find it more quickly.

Helpful resources

Announcements
Join our Fabric User Panel

Join our Fabric User Panel

This is your chance to engage directly with the engineering team behind Fabric and Power BI. Share your experiences and shape the future.

June 2025 Power BI Update Carousel

Power BI Monthly Update - June 2025

Check out the June 2025 Power BI update to learn about new features.

June 2025 community update carousel

Fabric Community Update - June 2025

Find out what's new and trending in the Fabric community.