Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Calling all Data Engineers! Fabric Data Engineer (Exam DP-700) live sessions are back! Starting October 16th. Sign up.

Reply
Anonymous
Not applicable

Wrong encoding when reading from Azure Synapse View on top of Parquet files

Hi, 

 

I have a view in Azure Synapse on top of my parquet files, I've created it like this:

CREATE VIEW [dbo].[v_tmp] AS ( 
    SELECT * FROM 
    OPENROWSETBULK 'https://***.dfs.core.windows.net/dwh/tmp/*.parquet'FORMAT='PARQUET'
    AS [result])
When I query this view in Synapse, everything goes as expected. 
But when I read from in it Power BI, there is a wrong encoding towards UTF-8. E.g. "Houzé" becomes "Houzé". 
I already tried:
  • To put the database in Synapse in UTF-8 as described here.
  • To transform the read in data column to UTF-8 using textEncoding.

 

Is there someone here with other ideas that I could try to overcome this issue?

 

Thanks a lot!

1 ACCEPTED SOLUTION
Anonymous
Not applicable

Afterall I found the solution: you should indeed change the collation of the database using the query below. But I also needed to recreate the views on top. Probably something was cached. I have the correct encoded data in power BI now. 

ALTER DATABASE datawarehouse 
    COLLATE Latin1_General_100_BIN2_UTF8;
 
Thanks for all of your input!

View solution in original post

5 REPLIES 5
Anonymous
Not applicable

Afterall I found the solution: you should indeed change the collation of the database using the query below. But I also needed to recreate the views on top. Probably something was cached. I have the correct encoded data in power BI now. 

ALTER DATABASE datawarehouse 
    COLLATE Latin1_General_100_BIN2_UTF8;
 
Thanks for all of your input!
v-deddai1-msft
Community Support
Community Support

Hi @Anonymous ,

 

Would you please try to get data directly from 'https://***.dfs.core.windows.net/dwh/tmp/*.parquet instead of using view from Azure Synapse?

 

If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

 

Best Regards,

Dedmon Dai

Anonymous
Not applicable

Thanks for your answer!

This does indeed work, but it is not what I want because I would like to keep my Power BI clean and user the Serverless pools to connect to. 

lbendlin
Super User
Super User

What happens when you use the query as your data source (rather than the view)?

Anonymous
Not applicable

Hi, 

 

When I query directly on the parquet file from within powerBI the encoding goes correctly. So the error must be somewhere in Synapse then.

 

Any idea what I can test more?

 

Thanks a lot for your input!

Helpful resources

Announcements
FabCon Global Hackathon Carousel

FabCon Global Hackathon

Join the Fabric FabCon Global Hackathon—running virtually through Nov 3. Open to all skill levels. $10,000 in prizes!

September Power BI Update Carousel

Power BI Monthly Update - September 2025

Check out the September 2025 Power BI update to learn about new features.

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.

Top Solution Authors
Top Kudoed Authors