Join us at FabCon Atlanta from March 16 - 20, 2026, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.
Register now!The Power BI Data Visualization World Championships is back! It's time to submit your entry. Live now!
Hi everyone,
I’ve previously worked with Azure + Power BI architecture, but now I need to design a solution using AWS + Power BI instead.
Can anyone share:
Recommended architecture patterns
Best practices
Documentation or guides
Or any real-world experience using AWS services as data sources for Power BI?
Solved! Go to Solution.
Hi @AnkitaaMishra ,
It is always exciting to bridge the gap between two clouds! @wardy912 shared some excellent official documentation, but coming from an Azure background, there are a few architectural "gotchas" you should be aware of when designing for AWS.
Here are the key patterns I have seen in the field:
1. The Connectivity Challenge (VPC vs. Public) Unlike Azure SQL where "Allow Azure Services" is a simple checkbox, AWS resources (Redshift, RDS, Athena) are often locked inside a private VPC.
The Architecture: You will likely need to deploy an On-Premises Data Gateway onto an EC2 instance inside that same VPC (or peered VPC). Power BI Service treats private AWS resources just like "On-Prem" data.
Performance: Make sure that EC2 instance has enough RAM if you are doing heavy mashups in Power Query, as the data has to pass through that node.
2. Amazon Athena (The Serverless Trap) Athena is fantastic for querying S3 data directly, similar to Serverless SQL Pool in Synapse.
Cost Warning: If you use DirectQuery with Athena, every visual interaction scans data on S3 and you get billed by AWS for data scanned. It can get expensive quickly.
Best Practice: Prefer Import Mode for Athena unless you need real-time data, or enforce strict partition filtering in your SQL queries to minimize scanning costs.
3. Amazon Redshift This is the closest experience to Azure Synapse. The native Redshift connector is mature and supports SSO (Single Sign-On) via Microsoft Entra ID (formerly Azure AD) if configured correctly on the AWS side. This is great for Row-Level Security (RLS).
In summary, plan your Gateway Strategy first. That is usually the piece that surprises Azure architects the most when moving to AWS data sources.
Hope this helps with your design!
If my response resolved your query, kindly mark it as the Accepted Solution to assist others. Additionally, I would be grateful for a 'Kudos' if you found my response helpful.
This response was assisted by AI for translation and formatting purposes.
Hi @AnkitaaMishra ,
It is always exciting to bridge the gap between two clouds! @wardy912 shared some excellent official documentation, but coming from an Azure background, there are a few architectural "gotchas" you should be aware of when designing for AWS.
Here are the key patterns I have seen in the field:
1. The Connectivity Challenge (VPC vs. Public) Unlike Azure SQL where "Allow Azure Services" is a simple checkbox, AWS resources (Redshift, RDS, Athena) are often locked inside a private VPC.
The Architecture: You will likely need to deploy an On-Premises Data Gateway onto an EC2 instance inside that same VPC (or peered VPC). Power BI Service treats private AWS resources just like "On-Prem" data.
Performance: Make sure that EC2 instance has enough RAM if you are doing heavy mashups in Power Query, as the data has to pass through that node.
2. Amazon Athena (The Serverless Trap) Athena is fantastic for querying S3 data directly, similar to Serverless SQL Pool in Synapse.
Cost Warning: If you use DirectQuery with Athena, every visual interaction scans data on S3 and you get billed by AWS for data scanned. It can get expensive quickly.
Best Practice: Prefer Import Mode for Athena unless you need real-time data, or enforce strict partition filtering in your SQL queries to minimize scanning costs.
3. Amazon Redshift This is the closest experience to Azure Synapse. The native Redshift connector is mature and supports SSO (Single Sign-On) via Microsoft Entra ID (formerly Azure AD) if configured correctly on the AWS side. This is great for Row-Level Security (RLS).
In summary, plan your Gateway Strategy first. That is usually the piece that surprises Azure architects the most when moving to AWS data sources.
Hope this helps with your design!
If my response resolved your query, kindly mark it as the Accepted Solution to assist others. Additionally, I would be grateful for a 'Kudos' if you found my response helpful.
This response was assisted by AI for translation and formatting purposes.
Working with Power BI on AWS is becoming more common, especially for teams that are moving toward multi-cloud setups. Even though Power BI naturally integrates with Azure, AWS still provides solid, reliable ways to store, process, and deliver data. The main goal is to design an architecture that fits your data size, refresh needs, cost limits, and security requirements.
A common pattern is to use AWS as your main data layer and then connect Power BI to clean and ready-to-use datasets. Popular choices include Amazon Redshift for large analytics workloads, Amazon Athena + S3 for flexible and low-cost querying, and RDS/Aurora when you want a managed relational database. Power BI already has a built-in connector for Redshift, and you can connect to Athena using the AWS ODBC driver.
For best results, teams usually store raw and processed data in Amazon S3, use AWS Glue for ETL/cleaning, and query the data using Athena or Redshift Spectrum. Security is important, so it’s recommended to use IAM roles, VPC connections, encryption (KMS), and avoid exposing databases publicly. Inside Power BI, features like incremental refresh, view-based modeling, and good data modeling practices help keep refresh times fast.
From real-world experience, Redshift + Power BI Import mode gives the fastest dashboard performance. Meanwhile, Athena + S3 is very cost-efficient, especially for teams working with large, semi-structured, or infrequent datasets. Some organizations also use AWS DMS or AWS Glue to move data into a more Power BI-friendly layer when they need heavy DirectQuery workloads.
Here are some helpful links to get started:
AWS + Power BI documentation
Redshift + Power BI:
https://docs.aws.amazon.com/redshift/latest/mgmt/power-bi-integration.html
Athena ODBC driver (for Power BI):
https://docs.aws.amazon.com/athena/latest/ug/connect-with-odbc.html
AWS Analytics Reference Architecture:
https://docs.aws.amazon.com/whitepapers/latest/aws-analytics-reference-architecture
General AWS data guidance
AWS Glue: https://docs.aws.amazon.com/glue/latest/dg/what-is-glue.html
S3 data lake best practices: https://docs.aws.amazon.com/prescriptive-guidance/latest/patterns/data-lake-on-aws.html
Here are some useful docs:
Use the Amazon Athena Power BI connector - Amazon Athena
Power Query Amazon Athena connector - Power Query | Microsoft Learn
Power Query Amazon Redshift connector - Power Query | Microsoft Learn
Building a Data Platform on AWS: Essential Design Considerations for Power BI - DEV Community
--------------------------------
I hope this helps, please give kudos and mark as solved if it does!
Connect with me on LinkedIn.
Subscribe to my YouTube channel for Fabric/Power Platform related content!
The Power BI Data Visualization World Championships is back! It's time to submit your entry.
| User | Count |
|---|---|
| 50 | |
| 43 | |
| 36 | |
| 33 | |
| 30 |
| User | Count |
|---|---|
| 138 | |
| 125 | |
| 60 | |
| 59 | |
| 56 |