未加星标

Open source Microsoft Graph Engine takes on Neo4j

字体大小 | |
[数据库(综合) 所属分类 数据库(综合) | 发布者 店小二04 | 时间 2017 | 作者 红领巾 ] 0人收藏点击收藏

Open source Microsoft Graph Engine takes on Neo4j

Sometimes the relationships between the data you've gathered are more important than the data itself. (See: Facebook monetizing your list of friends.) That's when a graph processing system comes in handy. It's animportant but oftenpoorly understood method for exploring how items in a dataset are interrelated.

Microsoft's been exploring this area since at least 2013, when it published a paper describing the Trinity project, a cloud-based, in-memory graph engine. The fruits of the effort, known as the Microsoft Graph Engine , are now available as an MIT-licensed open source project as an alternative to the likes of Neo4j or the linux Foundation's recently announced JanusGraph.

Everything is connected

Microsoft calls Graph Engine (GE) as "both a RAM store and a computation engine." Data can be inserted into GE and retrieved at high speed since it's kept in-memory and only written back to disk as needed. It can work as a simple key-value store like Memcached, but Redis may be the better comparison, since GE stores data in strongly typed schemas (string, integer, and so on).

The "computation engine" part of the equation means GE implements distributed algorithms across nodes, written in C#. It's not optimized out of the box for a specific kind of graph algorithm, so it'll likely appeal to those who want to write their own graph-exploration algorithms from the ground up -- or simply write their own distributed algorithms.

"Instead of trying to provide an exhaustive set of built-in computation modules," states Microsoft's documentation , "GE tries to provide generic building blocks to allow us to easily build such modules." Those blocks include a system for synchronous and asynchronous message passing, as well as the LIKQ graph query language that's already used by the Academic Graph Search API in Microsoft Cognitive Services.

Different ways through the maze

How does all this shape up against the leading open source graph database, Neo4j? For one, Neo4j has been in the market longer and has an existing user base. It's also available in both an open source community edition and a commercial product, whereas GE is only an open source project right now.

That said, only the commercial, enterprise-oriented edition of Neo4j supports sharding and replication. GE, by contrast, is clustered in its default open source incarnation, although clustering on both Neo4j and GE requires manual setup. In GE's case, the roles for each node in the cluster (servers and, optionally, query-aggregating proxies) need to be configured manually depending on the use case.

Another distributed graph database worth comparing to GE is JanusGraph , a new project under the sponsorship of the Linux Foundation with contributions by Google, Hortonworks, and IBM. It's been built to work closely with and leverage the Hadoop ecosystem. Elasticsearch and Lucene can be used as indexing engines, and Cassandra and HBase can be used as data stores. With GE, data has to be imported into it first.

What Microsoft appears to be aiming for with GE isn't head-on competition with those projects. Instead, GE is a piece of distributed data-storage infrastructure that receives new data and provides graph computation as one of its multiple benefits. Its liberal licensing also makes it easily refittable into other products or readily repurposed for hosting at scale. It isn't clear if Microsoft has used GE as part of any of its own systems (although it has used LIKQ, as noted above).

If those building on non-Microsoft platforms are interested in trying out Graph Engine, cross-platform support for Linux/BSD is coming shortly, according to one of the developers .

本文数据库(综合)相关术语:系统安全软件

主题: LinuxMemcachedHadoopHBaseLuceneRedisC#CassandraFacebookIBM
分页:12
转载请注明
本文标题:Open source Microsoft Graph Engine takes on Neo4j
本站链接:http://www.codesec.net/view/533909.html
分享请点击:


1.凡CodeSecTeam转载的文章,均出自其它媒体或其他官网介绍,目的在于传递更多的信息,并不代表本站赞同其观点和其真实性负责;
2.转载的文章仅代表原创作者观点,与本站无关。其原创性以及文中陈述文字和内容未经本站证实,本站对该文以及其中全部或者部分内容、文字的真实性、完整性、及时性,不作出任何保证或承若;
3.如本站转载稿涉及版权等问题,请作者及时联系本站,我们会及时处理。
登录后可拥有收藏文章、关注作者等权限...
技术大类 技术大类 | 数据库(综合) | 评论(0) | 阅读(56)