<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
<channel>
<title>Ask Ghassem - Recent questions tagged natural-language-process</title>
<link>https://ask.ghassem.com/tag/natural-language-process</link>
<description>Powered by Question2Answer</description>
<item>
<title>Very short text classification when category text should be replaced by another category text?</title>
<link>https://ask.ghassem.com/980/classification-category-should-replaced-another-category</link>
<description>&lt;div style=&quot;max-width:800px&quot;&gt;
&lt;div style=&quot;color:#1A1A1B&quot;&gt;
&lt;p&gt;I need some tool to classify articles based on short category text which consists of two or three words separated by &#039;-&#039;. The RSS/XML tag content is for example:&lt;/p&gt;

&lt;blockquote&gt;
&lt;p&gt;Foreign - News&lt;/p&gt;
&lt;/blockquote&gt;

&lt;blockquote&gt;
&lt;p&gt;Football - Foreign&lt;/p&gt;
&lt;/blockquote&gt;

&lt;p&gt;I created my own categories in DB and now I need to classify categories from parsed RSS of this news source, so it fits news categories defined by me.&lt;/p&gt;

&lt;p&gt;I would, for example need all articles containing category &quot;football&quot; to be identified as a category &lt;em&gt;Sport&lt;/em&gt; but sometimes those categories XML tags contains exact match like &lt;em&gt;Foreign - News&lt;/em&gt; should belong in the DB to category defined by me as &lt;em&gt;Foreign&lt;/em&gt;.&lt;/p&gt;

&lt;p&gt;Since I used only trained decision trees frameworks from AI so for another project so far, I would like to hear advice about probably AI based approach, technique or particular framework I can use to solve this problem. I don&#039;t want to get into a dead-end street by my own poor, in the field of AI not very experienced decision.&lt;/p&gt;

&lt;p&gt;While it can be solved by many ifs and &#039;contains&#039; function, it seems to me like not a very good solution.&lt;/p&gt;

&lt;p&gt;TLDR; I need basically something like &quot;clever, flexible and universal if-elseif&quot;.&lt;/p&gt;

&lt;p&gt;NOTE: I can also use article description text, if that would be necessary but it seems to me that this former category text is &amp;nbsp;unambiguous enough for this kind of problem.&lt;/p&gt;
&lt;/div&gt;
&lt;/div&gt;</description>
<category>Artificial Intelligence</category>
<guid isPermaLink="true">https://ask.ghassem.com/980/classification-category-should-replaced-another-category</guid>
<pubDate>Thu, 11 Feb 2021 12:48:47 +0000</pubDate>
</item>
<item>
<title>Binary Classification and neutral tag</title>
<link>https://ask.ghassem.com/978/binary-classification-and-neutral-tag</link>
<description>&lt;p&gt;I am trying to create a sentiment analysis model using binary classification as loss.I have a batch of tweets that some of them are tagged as positive (labeled as 1)&amp;nbsp;and&amp;nbsp;negative (labeled as 0).I manage to gather some tweets that are tagged as neutral but there are less&amp;nbsp; tweets than positive and negative.My thinking is to tag them with 0.5 to balance the classification probability.Is this legit?&lt;/p&gt;

&lt;div id=&quot;gtx-trans&quot; style=&quot;position: absolute; left: 460px; top: 54px;&quot;&gt;
&lt;div class=&quot;gtx-trans-icon&quot;&gt;&amp;nbsp;&lt;/div&gt;
&lt;/div&gt;</description>
<category>Deep Learning</category>
<guid isPermaLink="true">https://ask.ghassem.com/978/binary-classification-and-neutral-tag</guid>
<pubDate>Sat, 30 Jan 2021 10:08:01 +0000</pubDate>
</item>
<item>
<title>Pre trainned word Embeddings and Preproceess</title>
<link>https://ask.ghassem.com/849/pre-trainned-word-embeddings-and-preproceess</link>
<description>How should i preprocess my data if i am gonna use a pretrainned word embedding like glove or word2vec?Should I use stemming or stopword removal techniques?</description>
<category>Machine Learning</category>
<guid isPermaLink="true">https://ask.ghassem.com/849/pre-trainned-word-embeddings-and-preproceess</guid>
<pubDate>Fri, 10 Apr 2020 12:08:09 +0000</pubDate>
</item>
<item>
<title>How to calculate the class probabilities and classify using Naive Bayes classifier for NLP?</title>
<link>https://ask.ghassem.com/654/calculate-class-probabilities-classify-using-classifier</link>
<description>&lt;p&gt;We want to use Naive Bayes for tagging documents. It is a classification task that we want to assign a class (tag) to each string. We currently have two tags: &lt;strong&gt;Sport &lt;/strong&gt;and&lt;strong&gt; &lt;/strong&gt;&lt;strong&gt;Not Sport&lt;/strong&gt;&lt;/p&gt;

&lt;p&gt;Which tag does the sentence&amp;nbsp;&lt;strong&gt;&lt;em&gt;A very close game&lt;/em&gt;&amp;nbsp;&lt;/strong&gt;belong to? Using Naive Bayes classifier, calculate the class probability for &lt;strong&gt;Sport&lt;/strong&gt; and &lt;strong&gt;Not sport &lt;/strong&gt;for this sentence based on&amp;nbsp;the dataset and decide about the tag.&lt;/p&gt;

&lt;table border=&quot;1px&quot; cellpadding=&quot;1px&quot; style=&quot;width:500px&quot;&gt;
&lt;tbody&gt;
&lt;tr&gt;
&lt;th&gt;&lt;strong&gt;Text&lt;/strong&gt;&lt;/th&gt;
&lt;th&gt;&lt;strong&gt;Tag&lt;/strong&gt;&lt;/th&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;“A great game”&lt;/td&gt;
&lt;td&gt;Sports&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;“The election was over”&lt;/td&gt;
&lt;td&gt;Not sports&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;“Very clean match”&lt;/td&gt;
&lt;td&gt;Sports&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;“A clean but forgettable game”&lt;/td&gt;
&lt;td&gt;Sports&lt;/td&gt;
&lt;/tr&gt;
&lt;tr&gt;
&lt;td&gt;“It was a close election”&lt;/td&gt;
&lt;td&gt;Not sports&lt;/td&gt;
&lt;/tr&gt;
&lt;/tbody&gt;
&lt;/table&gt;</description>
<category>Machine Learning</category>
<guid isPermaLink="true">https://ask.ghassem.com/654/calculate-class-probabilities-classify-using-classifier</guid>
<pubDate>Wed, 26 Jun 2019 19:43:41 +0000</pubDate>
</item>
</channel>
</rss>