<?xml version="1.0" encoding="utf-8"?>
<rss version="2.0">
<channel>
<title>Ask Ghassem - Recent activity in Big Data Tools</title>
<link>https://ask.ghassem.com/activity/data-science/big-data-tools</link>
<description>Powered by Question2Answer</description>
<item>
<title>Answer selected: Terminology clarification in Spark</title>
<link>https://ask.ghassem.com/979/terminology-clarification-in-spark?show=981#a981</link>
<description>The fact is the engine is still the same, regardless of which interface language you use. For some tasks, such as special cleaning we probably do not have SQL commands, and we have to use Scala or Python. Using Zeppelin, you can switch back and forth among languages the engine supports, however it is not a common practice. For some specific tasks, you can use pure Spark SQL or if you want to use the SQL in pyspark or scala, there are functions that can help you achieve the goal.&lt;br /&gt;
&lt;br /&gt;
I believe observing more examples will help you understand when you can use what.</description>
<category>Big Data Tools</category>
<guid isPermaLink="true">https://ask.ghassem.com/979/terminology-clarification-in-spark?show=981#a981</guid>
<pubDate>Wed, 17 Feb 2021 16:04:05 +0000</pubDate>
</item>
<item>
<title>Answered: Logstash takes more than 48 hours to ingest 10 GB CSV file. Is there a way to expedite?</title>
<link>https://ask.ghassem.com/388/logstash-takes-more-than-hours-ingest-csv-file-there-expedite?show=389#a389</link>
<description>&lt;p&gt;It depends on your system processing power and the way you configure Logstash. If you make more outputs streams it will take longer. Please take a look at &lt;a rel=&quot;nofollow&quot; href=&quot;https://www.elastic.co/blog/logstash-configuration-tuning&quot;&gt;this link&lt;/a&gt; to see how you can tune the configurations. More than that, you can see some nice tips &lt;a rel=&quot;nofollow&quot; href=&quot;https://www.elastic.co/guide/en/logstash/current/tuning-logstash.html&quot;&gt;here&lt;/a&gt;.&lt;/p&gt;</description>
<category>Big Data Tools</category>
<guid isPermaLink="true">https://ask.ghassem.com/388/logstash-takes-more-than-hours-ingest-csv-file-there-expedite?show=389#a389</guid>
<pubDate>Mon, 15 Oct 2018 02:05:16 +0000</pubDate>
</item>
</channel>
</rss>