Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

To celebrate FabCon Vienna, we are offering 50% off select exams. Ends October 3rd. Request your discount now.

Reply
Anonymous
Not applicable

How to extract data from a new pdf file (weekly) and append that data to a dataset?

Hi everyone, 

 

I want to automate a price update that I make weekly. Every week I received 5 pdfs (similar look every week) with this week latest prices and now I manually type the latest prices into a document. Is there anyway that I can upload the pdf, extract the data and append that data to a dataset instead?

 

Thank you in advance.

 

Sincerely, Signe

1 ACCEPTED SOLUTION
v-yanjiang-msft
Community Support
Community Support

Hi @Anonymous ,

According to your description, I think you can do it. I give an example to illustrate the steps.

1.Connect the PDF in Power BI Desktop through Get Data>PDF.

vkalyjmsft_0-1668062824860.png

2.It is presented in table format when connected.

vkalyjmsft_1-1668062874448.png

vkalyjmsft_2-1668063436306.png

3.Now you can reference values in the table. For example you want to get the Sales of Category D, create code like this:

Table.SelectRows(#"Page001",each [Category]="D")[Sales]{0}

Result:

vkalyjmsft_3-1668063691553.png

I attach my sample below for your reference.

 

Best Regards,
Community Support Team _ kalyj

If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

View solution in original post

3 REPLIES 3
Anonymous
Not applicable

Dear @v-yanjiang-msft 

Thank you for the answer and sorry for the late reply, I had not seen the reply e-mails. 

I think your solution is correct given the formulation of my question, but I need bit more help in order to solve my problem. 

So building on to your example, I don't just need the 4. I need the entire sales column from Page001 to be transposed into a row in the Query table and then with a date attached to the same row. So like this:

Date ABCD
2022-12-01 1234


And then I need to automate the process as I will receive a new pdf each week with new 'sales' data and will then need a new row in the query table with that week's data and so on. So like this:

Date ABCD
2022-12-01 1234
2022-12-08 1.523.255


Can you help me with this?

v-yanjiang-msft
Community Support
Community Support

Hi @Anonymous ,

Is your problem solved? If so, would you mind accept the helpful replies as solutions? Then we are able to close the thread. More people who have the same requirment will find the solution quickly and benefit here, thank you!

Best Regards,
Community Support Team _ kalyj

v-yanjiang-msft
Community Support
Community Support

Hi @Anonymous ,

According to your description, I think you can do it. I give an example to illustrate the steps.

1.Connect the PDF in Power BI Desktop through Get Data>PDF.

vkalyjmsft_0-1668062824860.png

2.It is presented in table format when connected.

vkalyjmsft_1-1668062874448.png

vkalyjmsft_2-1668063436306.png

3.Now you can reference values in the table. For example you want to get the Sales of Category D, create code like this:

Table.SelectRows(#"Page001",each [Category]="D")[Sales]{0}

Result:

vkalyjmsft_3-1668063691553.png

I attach my sample below for your reference.

 

Best Regards,
Community Support Team _ kalyj

If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

Helpful resources

Announcements
September Power BI Update Carousel

Power BI Monthly Update - September 2025

Check out the September 2025 Power BI update to learn about new features.

August 2025 community update carousel

Fabric Community Update - August 2025

Find out what's new and trending in the Fabric community.

Top Solution Authors
Top Kudoed Authors