<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Re: Seeking partition strategy in Data Engineering</title>
    <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Seeking-partition-strategy/m-p/4090093#M3496</link>
    <description>&lt;P&gt;&lt;A href="https://learn.microsoft.com/en-us/fabric/data-engineering/delta-optimization-and-v-order?tabs=sparksql" target="_blank" rel="noopener"&gt;Delta Lake table optimization and V-Order - Microsoft Fabric | Microsoft Learn&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://docs.delta.io/latest/best-practices.html" target="_blank" rel="noopener"&gt;Best practices — Delta Lake Documentation&amp;nbsp;&amp;nbsp;&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;DIV id="choose-the-right-partition-column" class=""&gt;
&lt;H2&gt;Choose the right partition column&lt;/H2&gt;
&lt;P&gt;You can partition a Delta table by a column. The most commonly used partition column is&lt;SPAN&gt;&amp;nbsp;&lt;CODE class=""&gt;&lt;SPAN class=""&gt;date. Follow these two rules of thumb for deciding on what column to partition by:&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL class=""&gt;
&lt;LI&gt;
&lt;P&gt;If the cardinality of a column will be very high, do not use that column for partitioning. For example, if you partition by a column&lt;SPAN&gt;&amp;nbsp;&lt;CODE class=""&gt;&lt;SPAN class=""&gt;userId&lt;SPAN&gt;&amp;nbsp;and if there can be 1M distinct user IDs, then that is a bad partitioning strategy.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;P&gt;Amount of data in each partition: You can partition by a column if you expect data in that partition to be at least 1 GB.&lt;/P&gt;
&lt;DIV id="compact-files" class=""&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;P&gt;&lt;SPAN&gt;&lt;CODE class=""&gt;&lt;/CODE&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;SPAN&gt;&lt;CODE class=""&gt;&lt;/CODE&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/DIV&gt;</description>
    <pubDate>Thu, 08 Aug 2024 21:53:46 GMT</pubDate>
    <dc:creator>lbendlin</dc:creator>
    <dc:date>2024-08-08T21:53:46Z</dc:date>
    <item>
      <title>Seeking partition strategy</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Seeking-partition-strategy/m-p/4089978#M3493</link>
      <description>&lt;P&gt;I am in early phase of ingestion-building and delta table creation. I know that having partition even increases the performance. Therefore, I am seeking a partitioning strategy on the delta tables.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Should I do it on the column with highest/lowest cardinality?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Highest cardinality - more partitions. But at what cost?&lt;/P&gt;
&lt;P&gt;Lowest cardinality - lesser partitions&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Also, I am ingesting data incrementally to managed delta tables. Hence, how does the code handle the new partitions from new data coming everyday?&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Thu, 08 Aug 2024 19:40:33 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Seeking-partition-strategy/m-p/4089978#M3493</guid>
      <dc:creator>smpa01</dc:creator>
      <dc:date>2024-08-08T19:40:33Z</dc:date>
    </item>
    <item>
      <title>Re: Seeking partition strategy</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Seeking-partition-strategy/m-p/4090027#M3494</link>
      <description>&lt;P&gt;I found this article quite helpful&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;SPAN&gt;&lt;A href="https://www.linkedin.com/pulse/50-shades-direct-lake-everything-you-need-know-new-power-nikola-ilic-we2if/" target="_blank"&gt;https://www.linkedin.com/pulse/50-shades-direct-lake-everything-you-need-know-new-power-nikola-ilic-we2if/&lt;/A&gt;&lt;/SPAN&gt;&lt;/P&gt;</description>
      <pubDate>Thu, 08 Aug 2024 20:37:48 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Seeking-partition-strategy/m-p/4090027#M3494</guid>
      <dc:creator>lbendlin</dc:creator>
      <dc:date>2024-08-08T20:37:48Z</dc:date>
    </item>
    <item>
      <title>Re: Seeking partition strategy</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Seeking-partition-strategy/m-p/4090085#M3495</link>
      <description>&lt;P&gt;This article is not relatable to this question.&lt;/P&gt;</description>
      <pubDate>Thu, 08 Aug 2024 21:50:03 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Seeking-partition-strategy/m-p/4090085#M3495</guid>
      <dc:creator>smpa01</dc:creator>
      <dc:date>2024-08-08T21:50:03Z</dc:date>
    </item>
    <item>
      <title>Re: Seeking partition strategy</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Seeking-partition-strategy/m-p/4090093#M3496</link>
      <description>&lt;P&gt;&lt;A href="https://learn.microsoft.com/en-us/fabric/data-engineering/delta-optimization-and-v-order?tabs=sparksql" target="_blank" rel="noopener"&gt;Delta Lake table optimization and V-Order - Microsoft Fabric | Microsoft Learn&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&lt;A href="https://docs.delta.io/latest/best-practices.html" target="_blank" rel="noopener"&gt;Best practices — Delta Lake Documentation&amp;nbsp;&amp;nbsp;&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;DIV id="choose-the-right-partition-column" class=""&gt;
&lt;H2&gt;Choose the right partition column&lt;/H2&gt;
&lt;P&gt;You can partition a Delta table by a column. The most commonly used partition column is&lt;SPAN&gt;&amp;nbsp;&lt;CODE class=""&gt;&lt;SPAN class=""&gt;date. Follow these two rules of thumb for deciding on what column to partition by:&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;UL class=""&gt;
&lt;LI&gt;
&lt;P&gt;If the cardinality of a column will be very high, do not use that column for partitioning. For example, if you partition by a column&lt;SPAN&gt;&amp;nbsp;&lt;CODE class=""&gt;&lt;SPAN class=""&gt;userId&lt;SPAN&gt;&amp;nbsp;and if there can be 1M distinct user IDs, then that is a bad partitioning strategy.&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/CODE&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;P&gt;Amount of data in each partition: You can partition by a column if you expect data in that partition to be at least 1 GB.&lt;/P&gt;
&lt;DIV id="compact-files" class=""&gt;&amp;nbsp;&lt;/DIV&gt;
&lt;/LI&gt;
&lt;LI&gt;
&lt;P&gt;&lt;SPAN&gt;&lt;CODE class=""&gt;&lt;/CODE&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/LI&gt;
&lt;/UL&gt;
&lt;P&gt;&lt;SPAN&gt;&lt;CODE class=""&gt;&lt;/CODE&gt;&lt;/SPAN&gt;&lt;/P&gt;
&lt;/DIV&gt;</description>
      <pubDate>Thu, 08 Aug 2024 21:53:46 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Seeking-partition-strategy/m-p/4090093#M3496</guid>
      <dc:creator>lbendlin</dc:creator>
      <dc:date>2024-08-08T21:53:46Z</dc:date>
    </item>
    <item>
      <title>Re: Seeking partition strategy</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Seeking-partition-strategy/m-p/4090468#M3499</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/24978"&gt;@smpa01&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for the reply from lbendlin&amp;nbsp;.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Partitioning by columns with a low &lt;SPAN&gt;cardinality&amp;nbsp;&lt;/SPAN&gt;is often recommended.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;High &lt;SPAN&gt;cardinality&amp;nbsp;&lt;/SPAN&gt;: more partitions result in better parallelism but at the cost of increased overhead of managing many small files and the performance degradation of too many small partitions.&lt;/P&gt;
&lt;P&gt;&lt;BR /&gt;&lt;FONT&gt;Low &lt;SPAN&gt;cardinality&amp;nbsp;&lt;/SPAN&gt;: fewer partitions are easier to manage.&lt;/FONT&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Here are some tests I've done on partitions that you can refer to. Here's my data:&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="vhuijieymsft_0-1723170928572.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1147679i3A7ACD4AB0A210CA/image-size/medium?v=v2&amp;amp;px=400" role="button" title="vhuijieymsft_0-1723170928572.png" alt="vhuijieymsft_0-1723170928572.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Use the following code to add a new cell; save the DataFrame and partition the data by Year and Month:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;orders_df.write.partitionBy("Year", "Month").mode("overwrite").parquet("Files/partitioned_data")
print ("Transformed data saved!")&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;STRONG&gt;&amp;nbsp;&lt;/STRONG&gt;&lt;/P&gt;
&lt;P&gt;Check the Files folder to see if the partition folder was created successfully.&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="vhuijieymsft_1-1723170928575.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1147678i562F1124B1E7F796/image-size/medium?v=v2&amp;amp;px=400" role="button" title="vhuijieymsft_1-1723170928575.png" alt="vhuijieymsft_1-1723170928575.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Use the following code to add a new cell to load a new data frame from the orders.parquet file:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;orders_2021_df = spark.read.format("parquet").load("Files/partitioned_data/Year=2021/Month=*")
display(orders_2021_df)&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="vhuijieymsft_2-1723170969378.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1147682iFD43D8492A84F985/image-size/medium?v=v2&amp;amp;px=400" role="button" title="vhuijieymsft_2-1723170969378.png" alt="vhuijieymsft_2-1723170969378.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Note that the partitioned columns specified in the paths (Year and Month) are not included in the DataFrame.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;When you append new data to a Delta table, Delta Lake automatically creates a new partition based on the specified partition column. If the partition already exists, the data is appended to the existing partition.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;More information on partitioning can be found in the following documentation:&lt;/P&gt;
&lt;P&gt;&lt;A href="https://delta.io/blog/2023-01-18-add-remove-partition-delta-lake/" target="_blank" rel="noopener"&gt;Adding and Deleting Partitions in Delta Lake tables | Delta Lake&lt;/A&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you have any other questions please feel free to contact me.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Best Regards,&lt;BR /&gt;Yang&lt;BR /&gt;Community Support Team&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If there is any post&amp;nbsp;&lt;STRONG&gt;&lt;EM&gt;helps&lt;/EM&gt;&lt;/STRONG&gt;, then please consider&amp;nbsp;&lt;STRONG&gt;&lt;EM&gt;Accept it as the solution&lt;/EM&gt;&lt;/STRONG&gt;&amp;nbsp;&amp;nbsp;to help the other members find it more quickly.&lt;BR /&gt;If I misunderstand your needs or you still have problems on it, please feel free to let us know.&amp;nbsp;&lt;STRONG&gt;&lt;EM&gt;Thanks a lot!&lt;/EM&gt;&lt;/STRONG&gt;&lt;/P&gt;</description>
      <pubDate>Fri, 09 Aug 2024 02:43:23 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Seeking-partition-strategy/m-p/4090468#M3499</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-08-09T02:43:23Z</dc:date>
    </item>
    <item>
      <title>Re: Seeking partition strategy</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Seeking-partition-strategy/m-p/4092512#M3521</link>
      <description>&lt;P&gt;&amp;nbsp; thanks I have managed to create partitons on maged delta tables using both SQL and Delta table API.&amp;nbsp;&lt;/P&gt;
&lt;P&gt;But I see that you are using Dataframe API. Can you use partition by in this API when saveAsTable to Delta to create partitions within table (as SQL / Delta table API would do)&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;DIV style="color: #cccccc; background-color: #1f1f1f; font-family: Consolas, 'Courier New', monospace; font-weight: normal; font-size: 14px; line-height: 19px; white-space: pre;"&gt;
&lt;DIV&gt;&lt;SPAN&gt;&lt;SPAN&gt;df&lt;SPAN&gt;&lt;SPAN&gt;.write.&lt;SPAN class=""&gt;partitionBy&lt;SPAN class=""&gt;('emp_id&lt;SPAN class=""&gt;'&lt;SPAN class=""&gt;)&lt;SPAN class=""&gt;.&lt;SPAN class=""&gt;mode&lt;SPAN class=""&gt;('append&lt;SPAN class=""&gt;'&lt;SPAN class=""&gt;).&lt;SPAN&gt;format(&lt;SPAN&gt;'delta'&lt;SPAN&gt;).saveAsTable(&lt;SPAN&gt;'people'&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/SPAN&gt;&lt;/DIV&gt;
&lt;/DIV&gt;
&lt;P&gt;@Anonymous&lt;/a&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Also, do you know how can I verify the partitions on managed delta tables? I tried the following it did not work&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;//create table with partition
    spark.sql(f"""
    CREATE TABLE IF NOT EXISTS {table_name} (
        {query_string}
    )
    USING DELTA
    PARTITIONED BY ({partition_definition})
    LOCATION '{table_location}'

//verification
spark.sql("SHOW PARTITIONS StagingLakehouse.tbl").show()

//the above shows following
AnalysisException: Table spark_catalog.StagingLakehouse.tbl does not support partition management.;&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Sat, 10 Aug 2024 00:40:29 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Seeking-partition-strategy/m-p/4092512#M3521</guid>
      <dc:creator>smpa01</dc:creator>
      <dc:date>2024-08-10T00:40:29Z</dc:date>
    </item>
    <item>
      <title>Re: Seeking partition strategy</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Seeking-partition-strategy/m-p/4093033#M3524</link>
      <description>&lt;P&gt;Here is some discussion about partitioning, I found it interesting: &lt;A href="https://www.reddit.com/r/MicrosoftFabric/s/FF0ZV8JzzI" target="_blank" rel="noopener"&gt;https://www.reddit.com/r/MicrosoftFabric/s/FF0ZV8JzzI&lt;/A&gt;&lt;/P&gt;</description>
      <pubDate>Sat, 10 Aug 2024 07:31:03 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Seeking-partition-strategy/m-p/4093033#M3524</guid>
      <dc:creator>frithjof_v</dc:creator>
      <dc:date>2024-08-10T07:31:03Z</dc:date>
    </item>
    <item>
      <title>Re: Seeking partition strategy</title>
      <link>https://community.fabric.microsoft.com/t5/Data-Engineering/Seeking-partition-strategy/m-p/4094596#M3555</link>
      <description>&lt;P&gt;Hi &lt;a href="https://community.fabric.microsoft.com/t5/user/viewprofilepage/user-id/24978"&gt;@smpa01&lt;/a&gt;&amp;nbsp;,&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Thanks for the reply from frithjof_v&amp;nbsp;.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;To verify partitions on a managed delta table, there are the following methods:&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;1. Delta Lake stores partitioned data in a nested catalog structure. You can navigate to where the table is stored and examine the catalog structure to view the partitions.&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="vhuijieymsft_0-1723443439953.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1149062i49797C6D9F3C0F7F/image-size/medium?v=v2&amp;amp;px=400" role="button" title="vhuijieymsft_0-1723443439953.png" alt="vhuijieymsft_0-1723443439953.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;2. Check for the existence of partitions by using a SQL query.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;SELECT DISTINCT emp_id FROM people&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;3. Use the Delta Lake API to check for partitions.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;LI-CODE lang="markup"&gt;from delta.tables import DeltaTable

delta_table = DeltaTable.forName(spark, "people")
delta_table.toDF().select("emp_id ").distinct().show()&lt;/LI-CODE&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;&lt;span class="lia-inline-image-display-wrapper lia-image-align-inline" image-alt="vhuijieymsft_1-1723443439958.png" style="width: 400px;"&gt;&lt;img src="https://community.fabric.microsoft.com/t5/image/serverpage/image-id/1149063i3844655E52BE904E/image-size/medium?v=v2&amp;amp;px=400" role="button" title="vhuijieymsft_1-1723443439958.png" alt="vhuijieymsft_1-1723443439958.png" /&gt;&lt;/span&gt;&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If you have any other questions please feel free to contact me.&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;Best Regards,&lt;BR /&gt;Yang&lt;BR /&gt;Community Support Team&lt;/P&gt;
&lt;P&gt;&amp;nbsp;&lt;/P&gt;
&lt;P&gt;If there is any post&amp;nbsp;&lt;STRONG&gt;&lt;EM&gt;helps&lt;/EM&gt;&lt;/STRONG&gt;, then please consider&amp;nbsp;&lt;STRONG&gt;&lt;EM&gt;Accept it as the solution&lt;/EM&gt;&lt;/STRONG&gt;&amp;nbsp;&amp;nbsp;to help the other members find it more quickly.&lt;BR /&gt;If I misunderstand your needs or you still have problems on it, please feel free to let us know.&amp;nbsp;&lt;STRONG&gt;&lt;EM&gt;Thanks a lot!&lt;/EM&gt;&lt;/STRONG&gt;&lt;/P&gt;</description>
      <pubDate>Mon, 12 Aug 2024 06:19:30 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Data-Engineering/Seeking-partition-strategy/m-p/4094596#M3555</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2024-08-12T06:19:30Z</dc:date>
    </item>
  </channel>
</rss>

