The ultimate Microsoft Fabric, Power BI, Azure AI, and SQL learning event: Join us in Stockholm, September 24-27, 2024.
Save €200 with code MSCUST on top of early bird pricing!
Find everything you need to get certified on Fabric—skills challenges, live sessions, exam prep, role guidance, and more. Get started
Hi, I am new to power query and I'm trying to figure out how I can limit the data that I'm extracting from the source because I only need a couple of the columns. For example, I'm running a web query to an api and it returns the list of records and each record has a timestamp, loglevel, hostid, text and then 22 custom fields that is extracted from the text field. I'm just interested in grabbing the data from a couple of the custom fields. Currently it downloads all the data for record and then I have it remove the extra columns. The issue that I'm facing is because all of the data is downloaded it takes a very long time to complete. Is there a way to filter out via the source query instead of downloading all the data and removing the other columns?
Thank you very much for your time and assistance.
Hey pezwi,
I have found a way to directly pull the needed column. I use a OData.feed source. you need to change your table name and the name of the columns you want to add. Then end the command with "as table". See my example code below.
let
Source = OData.Feed("your link", null, [Implementation="2.0"]),
Table = Table.SelectColumns(Source{[Name="Your table name", Signature="table"]}[Data], {"Column 1", "Column 2", "Column 3", "Column 4", "Column 5"}) as table
in
Table
In this way you avoid the extra steps to remove the columns. However, I don't know if this is the fastest way to load data. I have to run some test and see if there is any difference between the run times.
Hi, @DnsLeu.
I tested your suggestion, and it worked very well to bring only the necessary columns. It is the equivalent of the query's "$select". However, what would the other options look like, such as "$filter" or "$orderby"?
it it possible to add all of those, but they need to be added *before* the column selection. the code that i wrote above serves as the *core* of your data source. so you will need to have that as the origin. I will give an example:
let
// Define source
Source = OData.Feed("your link", null, [Implementation="2.0"]),
// What to select
Table =
// Sort by
Table.Sort(
// Select rows
Table.SelectRows(
// Select columns
Table.SelectColumns(Source{[Name="Your table name", Signature="table"]}[Data],
{"Column 1", "Column 2", "Column 3", "Column 4", "Column 5"}
),
each [Column 1] = "X" and [Column 2] = "Y"
),
{{"Column 1", Order.Ascending}}
)
as table
in
Table
I highlighted the source in bold. All of these table.(select) functions require a table as the first argument. this table is given by the highlighted part in the code. this is what i mean as the *origin*. In this way, you select the source with the columns you want, and then apply a cascade of filtering, transformations, etc, without having to repeat the source for each table. function. If you want to add another transformation, you would add it before the Table.Sort. Might not be the most logical way to do it, but it works just fine in my case.
Hi @pezwi ,
You could add "Query" parameter in Web.Contents() as a filter. Here is a blog for your reference.
@pezwi - Well, if this was a SQL source you could use a View or write a SQL statement but since it is a Web API you are going to be limited to the functionality of that Web API. For example, I believe ODATA allows you to specify columns/fields but not sure about just any old Web API.
I'll have to look into the Odata option. I'm not finding a lot of examples for that.
Currently I'm pulling the data into a list which is capturing all the records. It would be great if I could do loop through the records but I'm having trouble figuring that out.
Source = List.Generate(
() => getPage(""),
each _[scrollToken] <> null,
each getPage(_[scrollToken]),
each [scrollToken = _[scrollToken], records = _[records]]
),
Here is an article I wrote last year that may help you get started with using OData. It includes examples of adding $select and other ways to let you limit the initial amount of data returned.
https://powerpivotpro.com/2019/02/some-odata-tips/
If this works for you, please mark it as the solution. Kudos are appreciated too. Please let me know if not.
Regards,
Pat
To learn more about Power BI, follow me on Twitter or subscribe on YouTube.
@pezwi , Right click on Table and Advance Query and see if there is a list of columns for selection. if yes, then try to remove some of them and try
Join the community in Stockholm for expert Microsoft Fabric learning including a very exciting keynote from Arun Ulag, Corporate Vice President, Azure Data.
Check out the August 2024 Power BI update to learn about new features.
User | Count |
---|---|
122 | |
89 | |
75 | |
55 | |
45 |
User | Count |
---|---|
135 | |
121 | |
77 | |
65 | |
64 |