Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Be one of the first to start using Fabric Databases. View on-demand sessions with database experts and the Microsoft product team to learn just how easy it is to get started. Watch now

Reply
DaxPadawan
Helper III
Helper III

Using Amazon Redshift as a data source

I'm a little confused as to the many options I've seen online on how to use Amazon Redshift as a data source. 

 

The two option I am considering at the moment:

1. Use a Dataflow to connect to Redshift. This was the solution initially proposed by the client. However, as I understand, Dataflows are used for data transformation. The tables we have on Redshift are intended to be "final" tables, they're essentially ready to be visualized straight away. 

 

2. Just connect straight to Redshift on Desktop, publish the report, and then schedule refreshes from there. 

 

Our data is fairly small (less than 5 MB) and is not expected to grow particularly large. Can someone help make the two alternatives clearer? What are the pros and cons?

2 ACCEPTED SOLUTIONS
GilbertQ
Super User
Super User

Hi @DaxPadawan 

 

If you have all the data ready I would then use option 2, I always say keep it as simple as possible.





Did I answer your question? Mark my post as a solution!

Proud to be a Super User!







Power BI Blog

View solution in original post

v-yueyunzh-msft
Community Support
Community Support

Hi , @DaxPadawan 

Based on your inquiry questions, here are some answers to your questions based on my own understanding, you currently have two options that you need to recommend which is convenient for us to choose.

vyueyunzhmsft_0-1674093406339.png

My understanding is that this choice is based on the complexity of the data transformation process (ETL) you need to do to connect to the Redshift data source, the more complex the ETL process, the more conversions the data refresh thing has to handle, the more time and resources the data refresh needs to consume after you connect directly, if the time limit is exceeded (2 hours for shared capacity, 5 hours for dedicated capacity), then it is highly recommended that you use data flow as a middle layer to execute the ETL data transformation process first. After the dataflow refresh succeeds, a refresh of the pointed dataset is performed. If the data transformation process is not complicated, it is recommended that you connect directly to the generated dataset.

 

For more information, you can refer to :
Connect to data created by Power Platform dataflows in Power BI Desktop - Power BI | Microsoft Learn

 

Thank you for your time and sharing, and thank you for your support and understanding of PowerBI! 

 

Best Regards,

Aniya Zhang

If this post helps, then please consider Accept it as the solution to help the other members find it more quickly

View solution in original post

2 REPLIES 2
v-yueyunzh-msft
Community Support
Community Support

Hi , @DaxPadawan 

Based on your inquiry questions, here are some answers to your questions based on my own understanding, you currently have two options that you need to recommend which is convenient for us to choose.

vyueyunzhmsft_0-1674093406339.png

My understanding is that this choice is based on the complexity of the data transformation process (ETL) you need to do to connect to the Redshift data source, the more complex the ETL process, the more conversions the data refresh thing has to handle, the more time and resources the data refresh needs to consume after you connect directly, if the time limit is exceeded (2 hours for shared capacity, 5 hours for dedicated capacity), then it is highly recommended that you use data flow as a middle layer to execute the ETL data transformation process first. After the dataflow refresh succeeds, a refresh of the pointed dataset is performed. If the data transformation process is not complicated, it is recommended that you connect directly to the generated dataset.

 

For more information, you can refer to :
Connect to data created by Power Platform dataflows in Power BI Desktop - Power BI | Microsoft Learn

 

Thank you for your time and sharing, and thank you for your support and understanding of PowerBI! 

 

Best Regards,

Aniya Zhang

If this post helps, then please consider Accept it as the solution to help the other members find it more quickly

GilbertQ
Super User
Super User

Hi @DaxPadawan 

 

If you have all the data ready I would then use option 2, I always say keep it as simple as possible.





Did I answer your question? Mark my post as a solution!

Proud to be a Super User!







Power BI Blog

Helpful resources

Announcements
Las Vegas 2025

Join us at the Microsoft Fabric Community Conference

March 31 - April 2, 2025, in Las Vegas, Nevada. Use code MSCUST for a $150 discount!

ArunFabCon

Microsoft Fabric Community Conference 2025

Arun Ulag shares exciting details about the Microsoft Fabric Conference 2025, which will be held in Las Vegas, NV.

December 2024

A Year in Review - December 2024

Find out what content was popular in the Fabric community during 2024.