<?xml version="1.0" encoding="UTF-8"?>
<rss xmlns:content="http://purl.org/rss/1.0/modules/content/" xmlns:dc="http://purl.org/dc/elements/1.1/" xmlns:rdf="http://www.w3.org/1999/02/22-rdf-syntax-ns#" xmlns:taxo="http://purl.org/rss/1.0/modules/taxonomy/" version="2.0">
  <channel>
    <title>topic Correct Python variables calls in Developer</title>
    <link>https://community.fabric.microsoft.com/t5/Developer/Correct-Python-variables-calls/m-p/907771#M22074</link>
    <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am trying to build a proportion table of late vendors based off orders. I built this basic code back in october through Jupyter Notebook and it still works fine there. I am trying to implement it on power bi now and I am confused whether it is necessary to declare the variable I am using, even though I have selected the column in Power BI. I get a name error as follows: LateDays is not defined. LateDays is the field that I get on power bi by using Odata, a service my company uses to fetch live data.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;this is the code:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;df = pandas.DataFrame(&lt;STRONG&gt;LateDays&lt;/STRONG&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;df = dataset.drop_duplicates()&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; pandas &lt;/SPAN&gt;&lt;SPAN&gt;as&lt;/SPAN&gt;&lt;SPAN&gt; pd&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; numpy &lt;/SPAN&gt;&lt;SPAN&gt;as&lt;/SPAN&gt;&lt;SPAN&gt; np&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; matplotlib&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; matplotlib.pyplot &lt;/SPAN&gt;&lt;SPAN&gt;as&lt;/SPAN&gt;&lt;SPAN&gt; plt &lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;matplotlib.style.use(&lt;/SPAN&gt;&lt;SPAN&gt;'seaborn'&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; seaborn &lt;/SPAN&gt;&lt;SPAN&gt;as&lt;/SPAN&gt;&lt;SPAN&gt; sns&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;sns.&lt;/SPAN&gt;&lt;SPAN&gt;set&lt;/SPAN&gt;&lt;SPAN&gt;(font_scale = &lt;/SPAN&gt;&lt;SPAN&gt;2&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;# Paste or type your script code here:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;def&lt;/SPAN&gt;&lt;SPAN&gt; is_late(x):&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;return&lt;/SPAN&gt;&lt;SPAN&gt; x &amp;gt; &lt;/SPAN&gt;&lt;SPAN&gt;0&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;df[&lt;/SPAN&gt;&lt;SPAN&gt;'Late'&lt;/SPAN&gt;&lt;SPAN&gt;] = df[&lt;/SPAN&gt;&lt;SPAN&gt;'LateDays'&lt;/SPAN&gt;&lt;SPAN&gt;].&lt;/SPAN&gt;&lt;SPAN&gt;apply&lt;/SPAN&gt;&lt;SPAN&gt;(is_late)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;group_by_vendor = df.groupby([&lt;/SPAN&gt;&lt;SPAN&gt;'CompanyName'&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;'Late'&lt;/SPAN&gt;&lt;SPAN&gt;])&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;delayed_by_vendor = df.groupby([&lt;/SPAN&gt;&lt;SPAN&gt;'CompanyName'&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;'Late'&lt;/SPAN&gt;&lt;SPAN&gt;]).size().unstack().reset_index()&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;delayed_by_vendor[&lt;/SPAN&gt;&lt;SPAN&gt;'orders_count'&lt;/SPAN&gt;&lt;SPAN&gt;] = (delayed_by_vendor[&lt;/SPAN&gt;&lt;SPAN&gt;False&lt;/SPAN&gt;&lt;SPAN&gt;] + delayed_by_vendor[&lt;/SPAN&gt;&lt;SPAN&gt;True&lt;/SPAN&gt;&lt;SPAN&gt;])&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;delayed_by_vendor[&lt;/SPAN&gt;&lt;SPAN&gt;'Proportion_Late'&lt;/SPAN&gt;&lt;SPAN&gt;] = delayed_by_vendor[&lt;/SPAN&gt;&lt;SPAN&gt;True&lt;/SPAN&gt;&lt;SPAN&gt;] / delayed_by_vendor[&lt;/SPAN&gt;&lt;SPAN&gt;'orders_count'&lt;/SPAN&gt;&lt;SPAN&gt;]&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;delayed_by_vendor.sort_values(&lt;/SPAN&gt;&lt;SPAN&gt;'proportion_late'&lt;/SPAN&gt;&lt;SPAN&gt;, ascending=&lt;/SPAN&gt;&lt;SPAN&gt;False&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;Right at the first string of code I get the name error. How can I define the variable in the code if the variable is fetched off a live service with huge amounts of data? thanks.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
    <pubDate>Wed, 22 Jan 2020 10:22:29 GMT</pubDate>
    <dc:creator>Anonymous</dc:creator>
    <dc:date>2020-01-22T10:22:29Z</dc:date>
    <item>
      <title>Correct Python variables calls</title>
      <link>https://community.fabric.microsoft.com/t5/Developer/Correct-Python-variables-calls/m-p/907771#M22074</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;I am trying to build a proportion table of late vendors based off orders. I built this basic code back in october through Jupyter Notebook and it still works fine there. I am trying to implement it on power bi now and I am confused whether it is necessary to declare the variable I am using, even though I have selected the column in Power BI. I get a name error as follows: LateDays is not defined. LateDays is the field that I get on power bi by using Odata, a service my company uses to fetch live data.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;this is the code:&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;df = pandas.DataFrame(&lt;STRONG&gt;LateDays&lt;/STRONG&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;df = dataset.drop_duplicates()&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; pandas &lt;/SPAN&gt;&lt;SPAN&gt;as&lt;/SPAN&gt;&lt;SPAN&gt; pd&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; numpy &lt;/SPAN&gt;&lt;SPAN&gt;as&lt;/SPAN&gt;&lt;SPAN&gt; np&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; matplotlib&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; matplotlib.pyplot &lt;/SPAN&gt;&lt;SPAN&gt;as&lt;/SPAN&gt;&lt;SPAN&gt; plt &lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;matplotlib.style.use(&lt;/SPAN&gt;&lt;SPAN&gt;'seaborn'&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;import&lt;/SPAN&gt;&lt;SPAN&gt; seaborn &lt;/SPAN&gt;&lt;SPAN&gt;as&lt;/SPAN&gt;&lt;SPAN&gt; sns&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;sns.&lt;/SPAN&gt;&lt;SPAN&gt;set&lt;/SPAN&gt;&lt;SPAN&gt;(font_scale = &lt;/SPAN&gt;&lt;SPAN&gt;2&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;# Paste or type your script code here:&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;def&lt;/SPAN&gt;&lt;SPAN&gt; is_late(x):&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;return&lt;/SPAN&gt;&lt;SPAN&gt; x &amp;gt; &lt;/SPAN&gt;&lt;SPAN&gt;0&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;df[&lt;/SPAN&gt;&lt;SPAN&gt;'Late'&lt;/SPAN&gt;&lt;SPAN&gt;] = df[&lt;/SPAN&gt;&lt;SPAN&gt;'LateDays'&lt;/SPAN&gt;&lt;SPAN&gt;].&lt;/SPAN&gt;&lt;SPAN&gt;apply&lt;/SPAN&gt;&lt;SPAN&gt;(is_late)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;group_by_vendor = df.groupby([&lt;/SPAN&gt;&lt;SPAN&gt;'CompanyName'&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;'Late'&lt;/SPAN&gt;&lt;SPAN&gt;])&lt;/SPAN&gt;&lt;/DIV&gt;&lt;BR /&gt;&lt;DIV&gt;&lt;SPAN&gt;delayed_by_vendor = df.groupby([&lt;/SPAN&gt;&lt;SPAN&gt;'CompanyName'&lt;/SPAN&gt;&lt;SPAN&gt;, &lt;/SPAN&gt;&lt;SPAN&gt;'Late'&lt;/SPAN&gt;&lt;SPAN&gt;]).size().unstack().reset_index()&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;delayed_by_vendor[&lt;/SPAN&gt;&lt;SPAN&gt;'orders_count'&lt;/SPAN&gt;&lt;SPAN&gt;] = (delayed_by_vendor[&lt;/SPAN&gt;&lt;SPAN&gt;False&lt;/SPAN&gt;&lt;SPAN&gt;] + delayed_by_vendor[&lt;/SPAN&gt;&lt;SPAN&gt;True&lt;/SPAN&gt;&lt;SPAN&gt;])&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;delayed_by_vendor[&lt;/SPAN&gt;&lt;SPAN&gt;'Proportion_Late'&lt;/SPAN&gt;&lt;SPAN&gt;] = delayed_by_vendor[&lt;/SPAN&gt;&lt;SPAN&gt;True&lt;/SPAN&gt;&lt;SPAN&gt;] / delayed_by_vendor[&lt;/SPAN&gt;&lt;SPAN&gt;'orders_count'&lt;/SPAN&gt;&lt;SPAN&gt;]&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;delayed_by_vendor.sort_values(&lt;/SPAN&gt;&lt;SPAN&gt;'proportion_late'&lt;/SPAN&gt;&lt;SPAN&gt;, ascending=&lt;/SPAN&gt;&lt;SPAN&gt;False&lt;/SPAN&gt;&lt;SPAN&gt;)&lt;/SPAN&gt;&lt;/DIV&gt;&lt;DIV&gt;&amp;nbsp;&lt;/DIV&gt;&lt;DIV&gt;&lt;SPAN&gt;Right at the first string of code I get the name error. How can I define the variable in the code if the variable is fetched off a live service with huge amounts of data? thanks.&lt;/SPAN&gt;&lt;/DIV&gt;&lt;/DIV&gt;</description>
      <pubDate>Wed, 22 Jan 2020 10:22:29 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Developer/Correct-Python-variables-calls/m-p/907771#M22074</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2020-01-22T10:22:29Z</dc:date>
    </item>
    <item>
      <title>Re: Correct Python variables calls</title>
      <link>https://community.fabric.microsoft.com/t5/Developer/Correct-Python-variables-calls/m-p/908196#M22077</link>
      <description>&lt;P&gt;Hi,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;It seems on my Power BI desktop, Python Visual fetches the data and saves it as the variable "dataset".&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Try to replace "df" by "dataset" in your script.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Hope it will solve your problem.&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Thanks,&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;&lt;P&gt;Ty&amp;nbsp;&lt;/P&gt;&lt;P&gt;&amp;nbsp;&lt;/P&gt;</description>
      <pubDate>Wed, 22 Jan 2020 16:13:01 GMT</pubDate>
      <guid>https://community.fabric.microsoft.com/t5/Developer/Correct-Python-variables-calls/m-p/908196#M22077</guid>
      <dc:creator>Anonymous</dc:creator>
      <dc:date>2020-01-22T16:13:01Z</dc:date>
    </item>
  </channel>
</rss>

