Apache Spark Components CheatSheet

Troubled by confusing concepts such as Executors, Node, RDD, Task in spark? Invest just 2 minutes of your time to make some order in this mess!

I'll clean up these apache spark concepts for you!

Spark building blocks: executor,tasks,cache,sparkcontext,cluster manager

Executor => Multiple Tasks: is a JVM process sitting on all nodes. Executors receive tasks (jars with your code) deserialize it, and run it as a task.

Executors utilize cache so that the tasks can run faster.

Node => Multiple Executors: Each node has multiple executors.

RDD => Big DataStructure: Its main strength is that it represents data which cannot be stored on a single machine, so its data is distributed, partitioned, split across computers.

Input => RDD: Every RDD is born out of some input like a text file, hadoop files etc.

Output => RDD: The output of functions in spark can produce an RDD. So it's like one function after another each receives an input RDD and outputs an output RDD, it's functional.

RDD[Type, Type] : RDD's are typed, they are data of a certain type.

RDD => 1,2,3: RDD's are ordered.

RDD => Zzzz: RDD's are lazily evaluated. We said functional, didn't we? so you have multiple transformations on your data and only when you hit an action you need the actual data.

RDD => Partitioned: RDD's are partitioned between servers, we said it's big data so we need to partition it.

RDD => Array(thing1, thing2, thing3) : You can think of RDD's as a bunch of things.

Guys if you have any other mess and want me to cheatsheet something for you just comment below, also I would highly appreciate any comment's about this post please feedback me!

Comments

Sowmiya R6 July 2020 at 10:13
Thanks for the information. The one thing I have noticed in this website is that you were continuously updating the changes that you have been made. It is a good sign to attract more people and I appreciate it. Hope more update and news from you.
Oracle Training | Online Course | Certification in chennai | Oracle Training | Online Course | Certification in bangalore | Oracle Training | Online Course | Certification in hyderabad | Oracle Training | Online Course | Certification in pune | Oracle Training | Online Course | Certification in coimbatore

ReplyDelete
Replies
Sowmiya R6 July 2020 at 10:50
It is actually a great and helpful piece of information. I am satisfied that you simply shared this helpful information with us. Please stay us informed like this. Thanks for sharing.

Oracle Training | Online Course | Certification in chennai | Oracle Training | Online Course | Certification in bangalore | Oracle Training | Online Course | Certification in hyderabad | Oracle Training | Online Course | Certification in pune | Oracle Training | Online Course | Certification in coimbatore
ReplyDelete
Replies
Devi6 August 2020 at 03:15
Really it was an awesome article… very interesting to read…. oracle training in chennai
ReplyDelete
Replies
Sages Marketing10 September 2020 at 06:38
Damien Grant
Damien Grant
Damien Grant
Damien Grant
Damien Grant
Damien Grant
Damien Grant
Damien Grant
ReplyDelete
Replies
Sages Marketing10 September 2020 at 06:38
Damien Grant
Damien Grant
Damien Grant
Damien Grant
Damien Grant
Damien Grant
Damien Grant
Damien Grant
ReplyDelete
Replies
Devi26 May 2021 at 04:36
Infycle Technologies is the best software training center in Chennai and is widely known for its excellence in giving the best software training in Chennai. Providing quality software programming training with 100% assured placement & to build a strong career for every individual and young professionals in the software industry is the ultimate aim of Infycle Technologies. Apart from all, the students love the 100% practical training, which is the specialty of Infycle Technologies. To proceed with your career with a solid base, reach Infycle Technologies through 7502633633.Best Software Training Center in Chennai | Infycle Technologies
ReplyDelete
Replies
Hussey26 November 2021 at 03:50
Happy to read the informative blog. Thanks for sharing
IELTS Coaching Center in Chennai
best ielts coaching centre in chennai

ReplyDelete
Replies
Pavithra Devi10 February 2022 at 21:06

This post is so interactive and informative.keep update more information...
ccna Training in Tambaram
ccna course in Chennai
ReplyDelete
Replies
Links For You10 August 2022 at 18:49
Red Gate .NET Reflector Crack is a program with which users can extract the source code for Windows programs and apply the required changes.Red Gate .NET Reflector Crack
ReplyDelete
Replies
syedhaseeb11 August 2022 at 03:53
Beyond Compare Key License Keygen fully lets key's the latest stage to give you various countenances for the same data format without .Beyond Compare Crack</
ReplyDelete
Replies
Links For You26 August 2022 at 01:13
Surprise Quotes For Him our man, despite his gruff look, longs to be cherished and wanted by you, furthermore on hear that you just love him. Surprise Quotes For Him
ReplyDelete
Replies
Mudassara Shahzad22 October 2022 at 04:12

This is a very well-written piece. Keep posting great things on your page. Your blog is wonderful.
https://softkeygen.com/scrivener-crack-license-key/
ReplyDelete
Replies

Add comment

Code Code Code Blog

Search This Blog

Apache Spark Components CheatSheet

Labels

Comments

Post a Comment

Popular posts from this blog

Functional Programming in Scala for Working Class OOP Java Programmers - Part 1

Alternatives to Using UUIDs

Bellman Ford Graph Algorithm