Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

Get Fabric Certified for FREE during Fabric Data Days. Don't miss your chance! Learn more

Reply
Anonymous
Not applicable

How to extract data from a new pdf file (weekly) and append that data to a dataset?

Hi everyone, 

 

I want to automate a price update that I make weekly. Every week I received 5 pdfs (similar look every week) with this week latest prices and now I manually type the latest prices into a document. Is there anyway that I can upload the pdf, extract the data and append that data to a dataset instead?

 

Thank you in advance.

 

Sincerely, Signe

1 ACCEPTED SOLUTION
v-yanjiang-msft
Community Support
Community Support

Hi @Anonymous ,

According to your description, I think you can do it. I give an example to illustrate the steps.

1.Connect the PDF in Power BI Desktop through Get Data>PDF.

vkalyjmsft_0-1668062824860.png

2.It is presented in table format when connected.

vkalyjmsft_1-1668062874448.png

vkalyjmsft_2-1668063436306.png

3.Now you can reference values in the table. For example you want to get the Sales of Category D, create code like this:

Table.SelectRows(#"Page001",each [Category]="D")[Sales]{0}

Result:

vkalyjmsft_3-1668063691553.png

I attach my sample below for your reference.

 

Best Regards,
Community Support Team _ kalyj

If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

View solution in original post

3 REPLIES 3
Anonymous
Not applicable

Dear @v-yanjiang-msft 

Thank you for the answer and sorry for the late reply, I had not seen the reply e-mails. 

I think your solution is correct given the formulation of my question, but I need bit more help in order to solve my problem. 

So building on to your example, I don't just need the 4. I need the entire sales column from Page001 to be transposed into a row in the Query table and then with a date attached to the same row. So like this:

Date ABCD
2022-12-01 1234


And then I need to automate the process as I will receive a new pdf each week with new 'sales' data and will then need a new row in the query table with that week's data and so on. So like this:

Date ABCD
2022-12-01 1234
2022-12-08 1.523.255


Can you help me with this?

v-yanjiang-msft
Community Support
Community Support

Hi @Anonymous ,

Is your problem solved? If so, would you mind accept the helpful replies as solutions? Then we are able to close the thread. More people who have the same requirment will find the solution quickly and benefit here, thank you!

Best Regards,
Community Support Team _ kalyj

v-yanjiang-msft
Community Support
Community Support

Hi @Anonymous ,

According to your description, I think you can do it. I give an example to illustrate the steps.

1.Connect the PDF in Power BI Desktop through Get Data>PDF.

vkalyjmsft_0-1668062824860.png

2.It is presented in table format when connected.

vkalyjmsft_1-1668062874448.png

vkalyjmsft_2-1668063436306.png

3.Now you can reference values in the table. For example you want to get the Sales of Category D, create code like this:

Table.SelectRows(#"Page001",each [Category]="D")[Sales]{0}

Result:

vkalyjmsft_3-1668063691553.png

I attach my sample below for your reference.

 

Best Regards,
Community Support Team _ kalyj

If this post helps, then please consider Accept it as the solution to help the other members find it more quickly.

Helpful resources

Announcements
Fabric Data Days Carousel

Fabric Data Days

Advance your Data & AI career with 50 days of live learning, contests, hands-on challenges, study groups & certifications and more!

October Power BI Update Carousel

Power BI Monthly Update - October 2025

Check out the October 2025 Power BI update to learn about new features.

FabCon Atlanta 2026 carousel

FabCon Atlanta 2026

Join us at FabCon Atlanta, March 16-20, for the ultimate Fabric, Power BI, AI and SQL community-led event. Save $200 with code FABCOMM.

Top Kudoed Authors