<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
<channel>
<title>Ask Ghassem - Recent questions tagged predict</title>
<link>https://ask.ghassem.com/tag/predict</link>
<description>Powered by Question2Answer</description>
<item>
<title>forecast log transformed fitted values for 2 years using ARMA model</title>
<link>https://ask.ghassem.com/1023/forecast-transformed-fitted-values-years-using-arma-model</link>
<description>Input is a stock price in exponential transformation. We are asked to forecast using ARMA results for 2 years.</description>
<category>Exploratory Data Analysis</category>
<guid isPermaLink="true">https://ask.ghassem.com/1023/forecast-transformed-fitted-values-years-using-arma-model</guid>
<pubDate>Wed, 04 May 2022 20:31:44 +0000</pubDate>
</item>
<item>
<title>How can you build dynamic pricing model with data only from rigid pricing?</title>
<link>https://ask.ghassem.com/1016/build-dynamic-pricing-model-with-data-only-from-rigid-pricing</link>
<description>I want to build a dynamic pricing model which means if product is too expansive for a client and there is a risk that we might loose a client we lower the price for them but if client doesn&amp;#039;t care that much about the price we might increase price a little.&lt;br /&gt;
&lt;br /&gt;
All the articles I&amp;#039;ve seen describe some kind of A/B testing for the pricing and then create a model.&lt;br /&gt;
&lt;br /&gt;
I want to build a model only on the existing rigid pricing data. So I have prices offered to customers and I know who bought the product and who went to other company.&lt;br /&gt;
&lt;br /&gt;
How can I do the increasing price part?</description>
<category>General</category>
<guid isPermaLink="true">https://ask.ghassem.com/1016/build-dynamic-pricing-model-with-data-only-from-rigid-pricing</guid>
<pubDate>Fri, 21 Jan 2022 06:44:31 +0000</pubDate>
</item>
<item>
<title>When dealing with categorical values, should the &#039;year&#039; column be encoded using OHE or OrdinalEncoder?</title>
<link>https://ask.ghassem.com/1012/dealing-categorical-values-should-encoded-ordinalencoder</link>
<description>It&amp;#039;s a car prices dataset, and so I&amp;#039;m assuming that the more recent the more value a car should have. The values in the &amp;#039;year&amp;#039; column simply consist of years from 1995 to 2020.&lt;br /&gt;
I am trying to predict the selling price of the car.&lt;br /&gt;
&lt;br /&gt;
I&amp;#039;m a bit new to ML, currently still doing my undergraduate so any help / tips are appreciated. Thank you.</description>
<category>Machine Learning</category>
<guid isPermaLink="true">https://ask.ghassem.com/1012/dealing-categorical-values-should-encoded-ordinalencoder</guid>
<pubDate>Sat, 18 Dec 2021 18:46:07 +0000</pubDate>
</item>
<item>
<title>How do I know which encoder to use to convert from categorical variables to numerical?</title>
<link>https://ask.ghassem.com/1006/know-which-encoder-convert-categorical-variables-numerical</link>
<description>So say I have a column with categorical data like different styles of temperature: &amp;#039;Lukewarm&amp;#039;, &amp;#039;Hot&amp;#039;, &amp;#039;Scalding&amp;#039;, &amp;#039;Cold&amp;#039;, &amp;#039;Frostbite&amp;#039;,... etc.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
I know that we can use pd.get_dummies to convert the column to numerical data within the dataframe, but I also know that there are other &amp;#039;converters&amp;#039; (not sure if that&amp;#039;s the correct terminology) that we can use, i.e. OneHotEncoder from Sk-learn (like I could use the pipeline module to make a nice pipeline and feed my dataframe through the pipeline to also get my categorical data encoded to numerical).&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
How do I know which to use? Does it matter? If it does matter, when does it matter the most (i.e. what types of problems? When there are lots of categorical variables, or few?) If anyone can give me any pointers on this type of stuff I&amp;#039;d greatly appreciate it.</description>
<category>Exploratory Data Analysis</category>
<guid isPermaLink="true">https://ask.ghassem.com/1006/know-which-encoder-convert-categorical-variables-numerical</guid>
<pubDate>Mon, 29 Nov 2021 04:09:06 +0000</pubDate>
</item>
<item>
<title>how many samples do we need to test image segmentation using synthetic data ?</title>
<link>https://ask.ghassem.com/993/many-samples-need-test-image-segmentation-using-synthetic</link>
<description>Hello,&lt;br /&gt;
&lt;br /&gt;
I trained a CNN using synthetic data to perform a segmentation task on human faces. During the test and to evaluate the prediction of this network, I used 200 examples from the database to compute precision and recall.&lt;br /&gt;
&lt;br /&gt;
Is this number sufficient, knowing that I control myself the data generator and that I build the database by randomly drawing the elements using centered Gaussian distributions.&lt;br /&gt;
&lt;br /&gt;
&lt;br /&gt;
Thank you,</description>
<category>Deep Learning</category>
<guid isPermaLink="true">https://ask.ghassem.com/993/many-samples-need-test-image-segmentation-using-synthetic</guid>
<pubDate>Mon, 21 Jun 2021 12:26:32 +0000</pubDate>
</item>
<item>
<title>Do  I need to save the standardization transformation?</title>
<link>https://ask.ghassem.com/970/do-i-need-to-save-the-standardization-transformation</link>
<description>When I standardized my data when I created my model. Do I need to save the standardization transformation when I want to predict with my model new data ?</description>
<category>Machine Learning</category>
<guid isPermaLink="true">https://ask.ghassem.com/970/do-i-need-to-save-the-standardization-transformation</guid>
<pubDate>Tue, 15 Dec 2020 13:06:48 +0000</pubDate>
</item>
<item>
<title>How to predict from unseen data?</title>
<link>https://ask.ghassem.com/954/how-to-predict-from-unseen-data</link>
<description>&lt;p&gt;Hi. I have a question about model-based predictions when data is only available after the fact.&amp;nbsp;Let me give you an example. I try to predict the result (HOME,&amp;nbsp;AWAY or a DRAW) of the match based on data like number of shots, ball possession, number of fouls, etc.&lt;/p&gt;

&lt;table border=&quot;1&quot; cellpadding=&quot;1&quot; style=&quot;width:500px&quot;&gt;
&lt;thead&gt;
&lt;tr&gt;
&lt;th scope=&quot;col&quot;&gt;TARGET&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;TEAM 1&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;TEAM 2&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;
&lt;p&gt;possesion&lt;/p&gt;

&lt;p&gt;team 1&lt;/p&gt;
&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;
&lt;p&gt;possesion&lt;/p&gt;

&lt;p&gt;team 2&lt;/p&gt;
&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;
&lt;p&gt;shots&lt;/p&gt;

&lt;p&gt;team 1&lt;/p&gt;
&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;
&lt;p&gt;shots&lt;/p&gt;

&lt;p&gt;team 2&lt;/p&gt;
&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;
&lt;p&gt;fouls&lt;/p&gt;

&lt;p&gt;team 1&lt;/p&gt;
&lt;/th&gt;
&lt;th scope=&quot;col&quot;&gt;
&lt;p&gt;fouls&lt;/p&gt;

&lt;p&gt;team 2&lt;/p&gt;
&lt;/th&gt;
&lt;/tr&gt;
&lt;/thead&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;td&gt;HOME&lt;/td&gt;
&lt;td&gt;Arsenal&lt;/td&gt;
&lt;td&gt;Chelsea&lt;/td&gt;
&lt;td&gt;60&lt;/td&gt;
&lt;td&gt;40&lt;/td&gt;
&lt;td&gt;12&lt;/td&gt;
&lt;td&gt;8&lt;/td&gt;
&lt;td&gt;5&lt;/td&gt;
&lt;td&gt;7&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;

&lt;p&gt;Let&#039;s say I&#039;m already after training the model and I want to see if I can predict the upcoming match. However, this match is only a few days away and I want to know the result of the model today.&amp;nbsp;I understand that if the match had already taken place and I had the data, I could test it on the model and get the result. The goal is for the model to predict what will happen before the match.&lt;/p&gt;

&lt;p&gt;Is it possible at all? What are my options? Should I only select pre-match variables? For example, last game form, match referee etc or should I aggregate the variables and include average possession, average shots and average number of fouls from recent matches?&lt;/p&gt;</description>
<category>Machine Learning</category>
<guid isPermaLink="true">https://ask.ghassem.com/954/how-to-predict-from-unseen-data</guid>
<pubDate>Tue, 17 Nov 2020 16:18:28 +0000</pubDate>
</item>
<item>
<title>How to model unknown yet data</title>
<link>https://ask.ghassem.com/943/how-to-model-unknown-yet-data</link>
<description>&lt;p&gt;So far, I have modeled on known historical data. What if there are variables known only after the fact?&lt;br&gt;
Let me give you an example. I want to predict the outcome of the match, win, lose or draw. I use variables from previous games such as ball possession, number of shots, corners, etc. Let&#039;s say the Chelsea-Arsenal game is approaching Saturday. How am I supposed to build a model and predict the result if this data is not yet available for my event? What to do in such cases, is it possible to forecast such data?&lt;/p&gt;

&lt;div id=&quot;gtx-trans&quot; style=&quot;position: absolute; left: -4px; top: -5.27273px;&quot;&gt;
&lt;div class=&quot;gtx-trans-icon&quot;&gt;&amp;nbsp;&lt;/div&gt;
&lt;/div&gt;</description>
<category>Machine Learning</category>
<guid isPermaLink="true">https://ask.ghassem.com/943/how-to-model-unknown-yet-data</guid>
<pubDate>Tue, 27 Oct 2020 10:39:47 +0000</pubDate>
</item>
</channel>
</rss>