Skip to main content
cancel
Showing results for 
Search instead for 
Did you mean: 

To celebrate FabCon Vienna, we are offering 50% off select exams. Ends October 3rd. Request your discount now.

Reply
soumyaiyer
Helper III
Helper III

Scraping of Data from Multiple Web Pages with no page # in the URL

Hello,

 

I am trying to extract all the pages of data from the URL (https://apps2.xxxxxxxx.net)

 

I am able to bring in the first 25 with web connector and extract HTML but I need all the pages. The URL does not change as you move through the pages on the site.

 

The advanced editor requires page number, but the URL I have does not show page numbers. The webpage has only the arrow option to view the next page.

 

Screenshot 2025-04-29 151928.pngAny help would be appreciated.

Thanks

 

2 ACCEPTED SOLUTIONS
MarkLaf
Super User
Super User

Hit F12 in your browser, this brings up web dev tools in most browsers. You can then inspect the html and other components of the page for clues as to how to grab the underlying data.

 

Your best bet, I'd guess, would be to look at the network traffic and look out for http api calls when you click the Next button. You may have to inspect the headers and body in the case of POSTs.

 

There is a chance that it's just impossible with straight querying, depending on how it's built. You may have to look into rpa with Power Automate or some other product.

View solution in original post

I don't believe so. In case the restrictions are due to security/whatever and not cost or complexity, you could run it off of a virtual machine: https://learn.microsoft.com/en-us/power-automate/desktop-flows/hosted-rpa-overview

 

It may be worth asking in the Power Automate forums if that's the route you are thinking of going. You're more likely to find folks with the experience to speak to the potential options/considerations.

View solution in original post

4 REPLIES 4
soumyaiyer
Helper III
Helper III

Below is what I see in the query advanced editor, this helps only to view the first 1-25

 

let
Source = Web.BrowserContents("https://apps2.xxxx.net/customerservice/campaign/?tag=xxx_inquiries"),
#"Extracted Table From Html" = Html.Table(Source, {{"Column1", "DIV.gridless > TABLE > * > TR > :nth-child(1)"}, {"Column2", "DIV.gridless > TABLE > * > TR > :nth-child(2)"}, {"Column3", "DIV.gridless > TABLE > * > TR > :nth-child(3)"}, {"Column4", "DIV.gridless > TABLE > * > TR > :nth-child(4)"}, {"Column5", "DIV.gridless > TABLE > * > TR > :nth-child(5)"}, {"Column6", "DIV.gridless > TABLE > * > TR > :nth-child(6)"}, {"Column7", "DIV.gridless > TABLE > * > TR > :nth-child(7)"}}, [RowSelector="DIV.gridless > TABLE > * > TR"]),
#"Promoted Headers" = Table.PromoteHeaders(#"Extracted Table From Html", [PromoteAllScalars=true]),
#"Changed Type" = Table.TransformColumnTypes(#"Promoted Headers",{{"", type text}, {"ID", Int64.Type}, {"Request Type", type text}, {"Customers", type text}, {"Accounts", Int64.Type}, {"Agent", type text}, {"Queued Since", type datetime}}),
#"Removed Columns" = Table.RemoveColumns(#"Changed Type",{""}),
#"Changed Type1" = Table.TransformColumnTypes(#"Removed Columns",{{"ID", type text}})
in
#"Changed Type1"

MarkLaf
Super User
Super User

Hit F12 in your browser, this brings up web dev tools in most browsers. You can then inspect the html and other components of the page for clues as to how to grab the underlying data.

 

Your best bet, I'd guess, would be to look at the network traffic and look out for http api calls when you click the Next button. You may have to inspect the headers and body in the case of POSTs.

 

There is a chance that it's just impossible with straight querying, depending on how it's built. You may have to look into rpa with Power Automate or some other product.

Tried the Power Automate route the  company has restrictions in providing access to the desktop version . Can I web scrap in the Power Automate cloud version?

I don't believe so. In case the restrictions are due to security/whatever and not cost or complexity, you could run it off of a virtual machine: https://learn.microsoft.com/en-us/power-automate/desktop-flows/hosted-rpa-overview

 

It may be worth asking in the Power Automate forums if that's the route you are thinking of going. You're more likely to find folks with the experience to speak to the potential options/considerations.

Helpful resources

Announcements
September Power BI Update Carousel

Power BI Monthly Update - September 2025

Check out the September 2025 Power BI update to learn about new features.

August 2025 community update carousel

Fabric Community Update - August 2025

Find out what's new and trending in the Fabric community.

Top Solution Authors
Top Kudoed Authors