What is difference between Hadoop and Spark?
Spark | Hadoop |
---|---|
It's a Data Analytics Engine | It is a Big Data Process Engine |
Used to Process real-time data, using real-time events like Twitter and Facebook | Batch processing with a huge volume of data |
Has a Low latency computing | Has a High latency computing |
Can process data extracted interactively | Process the data extracted in batch mode |
It is easier to use, enables a user to process data using high-level operators through abstractions | Hadoop's model is a bit complex, need to handle low-level APIs |
Has an in-memory computation, thus, no external scheduler is required | The external job scheduler is required for memory computation |
It is a bit less secure as compare to Hadoop | Highly secure |
Costlier than Hadoop | Less Costly |
BY Best Interview Question ON 06 Feb 2020