未加星标

MapR Takes Road Less Traveled to Big Data

字体大小 | |
[数据库(综合) 所属分类 数据库(综合) | 发布者 店小二05 | 时间 2017 | 作者 红领巾 ] 0人收藏点击收藏

The big data market continues to evolve, as I have written previously . Vendors are attempting to differentiate their offerings as they seek to encourage customers to pay for technology that they could potentially download for free.

MapR is one of those big data vendors. It entered the market in 2011 with a Hadoop distribution that used an alternative, POSIX-based file system that also offers HDFS API compatibility. While other Hadoop distribution vendors were chasing a volume-based business by using the Apache open source community surrounding Hadoop, MapR deliberately chose a more focused strategy of targeting enterprise capabilities and features. The MapR file system was designed to provide features that were missing in the early days of Hadoop, and it continues to form the backbone of the company’s offering today. I recently attended the company’s inaugural analyst day at its headquarters in San Jose to get an update on its progress and approach to the market.

The MapR file system (MapR-FS) provides NFS access, allowing organizations to load and share files from standard file storage systems. MapR-FS provides read-write capabilities, which typically are not available in Hadoop implementations. Using the NFS capabilities, MapR was able to provide high availability, snapshots and replication before other distributions. The company also claims NFS offers higher performance than HDFS since NFS is supported natively in the operating system. With these capabilities MapR secured some early enterprise deals with customers such as American Express and comScore that remain customers today.

In 2013, MapR added MapR-DB for NoSQL database capabilities running in the Hadoop cluster. MapR-DB uses the JSON interface for document database capabilities and provides an API for HBase applications. The high availability, snapshot and replication capabilities mentioned above are available for MapR-DB since MapR-FS is the underlying platform for both parts of the system.

Then in 2015, the company introduced MapR Streams for processing streaming event data, and it began to call the combined products the MapR Converged Data Platform since it
MapR Takes Road Less Traveled to Big Data
offered batch processing via Hadoop, operational processing via NoSQL and streaming data. The last is a critical capability for Internet of Things (IoT) applications. Our recently completed IoT and operational intelligence benchmark research shows that nearly half (46%) of those implementing IoT applications consider it essential to have low or very low latency for processing events.

Like other parts of its platform, MapR Streams supports an open source API, in this case the Kafka API. Combining open source APIs with its proprietary products enables MapR to participate in and benefit from the open source ecosystem surrounding the big data market. The MapR platform also supports Apache Spark , which I have written about ,and provides a SQL interface via Apache Drill.

The strategy seems to be working. Many of MapR’s early customers have continued to use its products and increased their investments in them. This approach allows MapR to have clearly differentiated products based on open source technology. However, it also creates some challenges. The divergence from the open source versions results in a smaller community for the MapR products and may cause it to be passed over by prospects who prefer to stay closer to the Apache version of Hadoop. Nevertheless, MapR has managed to establish a position as one of the top three Hadoop distributions. It also claims to be growing revenues significantly and shared some financial metrics under nondisclosure with the analysts in attendance.

MapR offers a robust platform that covers many of the big data requirements, which often require integration of separate products or open source projects. If you are considering a big data project, I recommend evaluating whether MapR meets your needs.

(About the author: David Menninger is a senior vice president and research director at Ventana Research. This post originally appeared on his Ventana blog, which can be viewed here ).

本文数据库(综合)相关术语:系统安全软件

主题: HadoopSQLHDFSSparkHBaseKafka
分页:12
转载请注明
本文标题:MapR Takes Road Less Traveled to Big Data
本站链接:http://www.codesec.net/view/529983.html
分享请点击:


1.凡CodeSecTeam转载的文章,均出自其它媒体或其他官网介绍,目的在于传递更多的信息,并不代表本站赞同其观点和其真实性负责;
2.转载的文章仅代表原创作者观点,与本站无关。其原创性以及文中陈述文字和内容未经本站证实,本站对该文以及其中全部或者部分内容、文字的真实性、完整性、及时性,不作出任何保证或承若;
3.如本站转载稿涉及版权等问题,请作者及时联系本站,我们会及时处理。
登录后可拥有收藏文章、关注作者等权限...
技术大类 技术大类 | 数据库(综合) | 评论(0) | 阅读(47)