未加星标

Hadoop vs Cassandra 2018: Feature Wise Comparison

字体大小 | |
[数据库(综合) 所属分类 数据库(综合) | 发布者 店小二05 | 时间 2018 | 作者 红领巾 ] 0人收藏点击收藏
1. Hadoop vs Cassandra

Today, we will take a look at Hadoop vs Cassandra. There is always a question occurs that which technology is a right choice between Hadoop vs Cassandra. So, in this article, “Hadoop vs Cassandra: Feature wise Comparison” we will see the difference between Apache Hadoop and Cassandra . Although, to understand well we will start with an individual introduction of both in brief.

So, let’s start Hadoop vs Cassandra.


Hadoop vs Cassandra 2018: Feature Wise Comparison

Hadoop vs Cassandra 2018: Feature Wise Comparison

2. Difference Between Hadoop and Cassandra

We will see the Big Data Hadoop vs Cassandra difference by discussing the meaning of Hadoop and Cassandra:

a. What is Hadoop?

As we know an open source software, especially, designed to handle parallel processing is what we call Hadoop. We also use it as a data warehouse for large volume data. In other words, this is a framework which allows storing as well as processing big data in a distributed environment across clusters of computers by using simple programming models. Basically, the main aim to design it is to scale up from single servers to thousands of machines. And, especially, to make each of them offering local computation as well as storage.

b. What is Cassandra?

Whereas, it is simply a NoSQL database, for the purpose of high speed, online transactional data. Well, its best feature is that it works without a single point of failure.

Moreover, it helps to keep the updated status of the surrounding nodes in the cluster with the help of the gossip protocol. There may be a time when one node goes down, at that time the other one takes its responsibility until the failed one is not fixed. Although, when the nodes exchange the gossip, older information gets overwritten by a newer version of gossip, because all gossip messages possess a version associated with it.

Let’s Check HBase vs Cassandra

In addition, it supports unstructured data along with a flexible schema.

3. Feature Wise Comparison of Hadoop vs Cassandra

Now, let’s begin the comparison,Hadoop vs Cassandra:

a. Supported format Apache Hadoop

Hadoop handles several types of data such as structured, semi-structured, unstructured or images.

Have a look at Setup for Hadoop

Cassandra

However, rather than Images, Cassandra handles almost all structured, semi-structured, unstructured datasets. In addition, we can say Cassandra is best to perform on a semi-structured dataset.

b. Usage Apache Hadoop

Especially, we use Hadoop for batch processing of data.

Let’s discuss Hadoop Features

Cassandra

Whereas, it is mostly used for real-time processing.


Hadoop vs Cassandra 2018: Feature Wise Comparison
c. Work Apache Hadoop

Hadoop’s core is HDFS, that is a base for other analytical components especially for handling big data.

You must see the Hadoop Working Process

Cassandra

Well, it works on top HDFS .

d. CAP Parameters(consistency, availability and partition tolerance ) Apache Hadoop

It supports consistency and partition tolerance.

Cassandra

But it supports availability and partition tolerance.

Learn Hadoop from Industry Experts

e. Communication Apache Hadoop

For communication among nodes in a cluster, Hadoop uses RPC/TCP and UDP.

Cassandra

And, it uses gossip protocol, for communication between nodes. Basically, this protocol helps by broadcasting the node status to its peer nodes in the cluster .

f. Architecture Apache Hadoop

It has a master-slave architecture. Where master is Namenodeand Slave is data node.

Cassandra

Butit has a distributed architecture . Although, here is a peer to peer communication between all the nodes.

g. Data Access Mode Apache Hadoop

Basically, to read/write, it uses map-reduce .

Cassandra

Well, it uses Cassandra query language .

h. Fault tolerance Apache Hadoop

Everything goes for a tossif master node goes down. Hence, we can say, Hadoop is not good with failure.

Cassandra

But Cassandra is good with it, because when one node goes down, at that time the other one takes its responsibility until the failed one is not fixed.

i. Data Compression Apache Hadoop
Hadoop vs Cassandra 2018: Feature Wise Comparison

It compresses files 10-15 %by using best available techniques.

Cassandra

Whereas, it compresses files up to 80% even without any overhead.

j. Data Protection Apache Hadoop

Access control &Data audit, verify the appropriate user/group permission, in Hadoop.

Cassandra

Whereas, in Cassandra, Data is protected with commit log design. Moreover, backup and restore mechanism (Build in security) plays a vital role here.

本文数据库(综合)相关术语:系统安全软件

tags: Hadoop,Cassandra,Apache,vs,data
分页:12
转载请注明
本文标题:Hadoop vs Cassandra 2018: Feature Wise Comparison
本站链接:https://www.codesec.net/view/586986.html


1.凡CodeSecTeam转载的文章,均出自其它媒体或其他官网介绍,目的在于传递更多的信息,并不代表本站赞同其观点和其真实性负责;
2.转载的文章仅代表原创作者观点,与本站无关。其原创性以及文中陈述文字和内容未经本站证实,本站对该文以及其中全部或者部分内容、文字的真实性、完整性、及时性,不作出任何保证或承若;
3.如本站转载稿涉及版权等问题,请作者及时联系本站,我们会及时处理。
登录后可拥有收藏文章、关注作者等权限...
技术大类 技术大类 | 数据库(综合) | 评论(0) | 阅读(141)