Sublime packages you should have in Sublime Text:
Common Used Shortcuts
Most common used Jupyter keyboard shortcuts
Hotkeys when working with Terminal
My blog timeline since it was first created in November 2016.
This is the continuing post to my previous article Introduction to SparkSQL, intending to understand SparkSQL on a deeper level.
Shuffle is one of the most expensive operations that will affect the performance of the job. Even though Spark tries to avoid shuffling as possible as it can...
Introduction Today, let’s get to understand what’s really happening behind the scene after we submit a Spark job to the cluster. I promise you that there wil...
Apache Spark Modules
After spending a significant time in reading the source code in spark-core project, I can briefly draw the architecture showing the relationships and the flo...