|By Daniel Thompson||
|February 25, 2013 10:07 AM EST||
Last October I published a post that identified the features that both JBoss Data Grid and Oracle Coherence provide (link). My goal was to establish a baseline for the features that a data grid should provide. It was not to state that one data grid was better than the other. Little did I know an Oracle employee would respond by attacking Red Hat, its engineers, and myself.
It is fear? Is it hostility? I don’t know.
I have engaged in discussions with competitors before. Roman and I engaged in a competitive discussion in response to one of my posts comparing IBM WebSphere and JBoss EAP (link). However, we both conducted ourselves in a professional manner. I’ve engaged in competitive discussions with Spring evangelists, but we focused on the technology.
I always enjoy reading the discussions between Cameron, Nikita, and Nati on the TheServerSide. I find their discussions to be insightful. They conduct themselves in a professional manner. To me, it looks like they respect each other and they respect each others products.
To be fair, this is just a single anonymous visitor. If they had not left a comment while connected to Oracle’s network, I would not have known that they work for Oracle. However, I would have inferred it.
Let the show begin.
I’m also interested in this question too RK. JDG lack of references that could corroborate performance majority against Coherence. Coherence in the other hand has a lot of public cases that shows how scalable, reliable and fast it is. It is the word’s first in-memory computing platform of the world, so this blog doesn’t offer credibility at all, mostly because Shane is a marketing guy from Red Hat.
He’s just using a old marketing technique to improve reliability of their offers product, comparing it with another one which is leader in its industry, like Coherence. Comparing with Coherence would pass the idea of “JDG is so good just like Coherence, so instead of buying Coherence, buy from Red Hat” but in fact it is not true. JDG should implement A LOT OF features to be comparable with Coherence.
I published the results of a performance test (JDG 6.0.1) last December (link). I have written a technical white paper that includes the results of a number of performance tests (JDG 6.0.1). However, it is awaiting publication. I expect it to be made available via the Red Hat Customer Portal. In addition, I will be publishing the results of a few performance tests (JDG 6.1) executed on better hardware to How to JBoss within the next two weeks.
I executed the performance tests with RadarGun (link), an open source project for data grid performance testing. When I published the results, I provided both the RadarGun and the JDG configuration files. The best way for an organization to select a data grid based on performance and reliability is to configure and execute their own performance tests based on their own requirements in their own controlled environment.
Those results compares JDG against Terracotta from Software AG, not with Coherence from Oracle. You cannot say at all that JDG is better than Coherence because you’ve never tested. Again, not reliable statements coming from you. You’ve tried to use a Terracotta comparison to generalize JDG performance results. Lets call Oracle, VMware, IBM, Gigaspaces and TIBCO to participate of the tests.
I have never stated that JDG performs better than Oracle Coherence. Thus, I am unaware of these “not reliable statements”. Can you could point them out? You are welcome to call Oracle, VMWare, IBM, GigaSpaces, and TIBCO. They are welcome to use the my RadarGun and JDG configuration files to configure and execute their own performance tests with RadarGun. This is essentially what I did after coming across the performance test results published by Terracotta. I simply used the parameters that they made available. As I mentioned previously, a number of organizations evaluating JDG are doing just that. They are executing performance tests against both JDG and Oracle Coherence using RadarGun or YCSB (Yahoo! Cloud Serving Benchmark).
Tongosol Coherence was an innovative product in its day, but that day was several years ago. I do not question that it remains reliable. However, there have been a number of advancements in distributed systems over the past few years. JBoss Data Grid brings together the reliability of the previous generation of data grids and the innovation of the next generation of data grids.
For some unique features of Coherence like its non-blocking I/O TCP/IP network based on TCMP, which allow it to achive better results with distributed transactions, fail-over detection (the fastest of the industry) WAN replication with latency issues due geographical distribution and the HTTP Session offload from AppServers. Not mentioning integration with A LOT OF AppServers like WebLogic, GlassFish, Websphere, Tomcat, IIS, Resin and even your JBoss AS. JDG only gives support for which is from Red Hat. What a nice example of being “open” hãm ?! :)
The TCMP features that you listed are not unique to TCMP. They are provided by JGroups as well. Those features include non-blocking I/O (NIO), failure detection, and cross site (WAN) replication. I would hope that Oracle Coherence*Web would support both Oracle WebLogic and Oracle GlassFish. Is that really a feature? If so, Red Hat provides it as well. JDG supports both JBoss EAP and JBoss EWS (Apache Tomcat). However, there is no reason for an organization to use JDG or Oracle Coherence*Web for session replication with IBM WebSphere.
Red Hat public references include both Chicago Board Options Exchange (CBOE) and Cisco, and they have both presented at Red Hat Summit / JBoss World. I can’t think of an environment with higher demands for both performance and reliability than financial trading. The Pentaho BI Platform / Server includes a plugin for Infinispan (link). There is no Oracle Coherence plugin.
Only this? Coherence has thousands of customer references, including mission critical ones that for years NEVER, I mean, NEVER restarted their servers. Come on, you can do better than this. Red Hat (you) should be a little bit more humble when talking about leaders like Oracle. Someday Red Hat will be a huge company, I don’t doubt that, but that didn’t happened so far and will take some time.
Can you point out a list that includes thousands of public customer references for Oracle Coherence? The only list that I found includes 39 customer references, and that list includes duplicates (link). You state that thousands of Oracle Coherence customers have NEVER restarted their servers. That is a bold claim. Do you have evidence to support such a claim? After all, servers may be restarted to upgrade the hardware and / or operating system. That, and enterprise software typically has a finite life cycle. Has not a just one of those thousands of Oracle Coherence customers ever upgraded their original version to the latest version? I suspect that you and I have different interpretations of “huge company”. I find it ironic that you demand humility while showing disrespect.
Are you stating that because the company you work for (Oracle) productized (well, acquired) a data grid before the company I work for (Red Hat) and that my role is now in marketing, I lack credibility in the data grid domain? I would advise against such a statement. My technical knowledge of data grids is second to none, and it is not derived from my role in marketing. I have worked with a number of enterprise organizations in the financial, telecommunication, and media sectors in a developer / architect capacity in my previous role to integrate data grids in demanding environments.
Oh yes? Give me examples of data grid technologies you’ve worked with, scenarios of data partitioning and JVM tuning you’ve implemented for, entity domain versioning strategies you’ve designed it, hashCode algorithms strategies that you’ve proposed for a complex based key node, examples of KPIs that you retrieved from JMX and from the DG, and of course, examples of the following DG scenarios: average latency less than 600 microseconds, 5k TPS or higher considering a transaction with a minimum of 15KB of size, client applications both based on Java, C++, .NET and “the rest of world” that could be accessed with REST or SOAP, projects with more than 20K hours of duration (real one projects) instead of stupid POCs, usage of at least three serious data grids technologies including Coherence, GemFire, Websphere eXtreme Scale, Gigapaces, TIBCO ActiveSpaces, etc.
I do not question your knowledge and experience, nor am I going to. I am dumbfounded as to why you feel justified in questioning mine.
I look at it like this. You have pilots, and you have mechanics. You have users, and you have engineers. A pilot knows how the controls work, a mechanic knows the parts work. When it comes to JDG, I have been a full time pilot and a part time mechanic. However, the activities that you have mentioned are those of a user, not of an engineer. Further, they are not specific to data grids. It’s one thing to talk about metrics, latency, and throughput. It’s another to talk about concurrency, algorithms, and how distributed systems work.
JVM tuning. I have posted a handful of notes on both OS and JVM tuning (link / link / link). Instead of talking about JVM tuning, let’s talk about implementations of ConcurrentMap (link). JMX. I have monitored and analyzed the performance of JDG with JBoss Operations Network, in-house tools, and BTrace. Here is a list of JMX attributes and operations for JDG (link). An average of latency of 600 microseconds is not particularly impressive in the financial trading industry. Nor is 5,000 transactions per second. Did I not mention, the fact that I collaborated with their engineers, and that co-presented with them at Red Hat Summit / JBoss World? Instead of talking about latency and throughput, let’s talk about data structures and eviction algorithms (link). I’ll be honest, I have not worked on projects that required integration in a heterogeneous environment. Those that have, have done so with REST and memcache. Oracle Coherence doesn’t support the memcache protocol, does it? Instead of talking about REST and SOAP, let’s talk about local / remote transaction contexts and the number of remote procedure calls (RPC) required for optimistic / pessimistic locking. Partitioning and hashing. JDG has implemented consistent hashing and virtual nodes. A modern solution. It uses an implementation of the excellent MurmurHash3 algorithm (link). It does not rely on dated implementation based on centralized and / or manual hashing. Does Oracle Coherence? Instead of talking about hashing, let’s talk about vector clocks.
Let’s talk about rebalancing and push / pull implementations.
Of course, that is the benefit of open source software. Users can be engineers. The can understand the implementation by studying the code. That is exactly what I did. I studied the code, I modified the code, I created and submitted patches, and I engaged in discussions with Red Hat engineers on implementations details. With proprietary software, users can only be users.
Asking how fast someone has flown will not reveal how much they know about planes.
Are you familiar with all of the projects that I have been on? I ask because I’m uncertain as to why you would describe them as “stupid POCs”. I do not think that the engineers at CBOE or any of the other organizations that I have collaborated with would appreciate you calling the work that they put into production “stupid POCs”. I know I don’t.
How am I “improving the reliability” of JBoss Data Grid by identifying the functionality that both JBoss Data Grid and Oracle Coherence provide? Do you not believe that JBoss Data Grid has implemented A LOT OF features? The functionality descibed in this post represents nearly all of the features and benefits listed in the Oracle Coherence data sheet (link). JBoss Data Grid lacks a few features provided by Oracle Coherence. Oracle Coherence lacks a few features provided by JBoss Data Grid. Would you say that Oracle Coherence has not implemented A LOT OF features because it lacks a few features provided by JBoss Data Grid?
No! It just had integrated a couple open-source existing technologies into a new ecosystem and productized in a minimum level to take some money from the customers with subscriptions. Nothing really new, innovated, creative or respectable. The type of thing Red Hat likes to do: take existing technologies, combine them and make some money.
What are these open-source, existing technologies that you are referring to? Could they be Infinispan? Of course they exist, Red Hat created them. It would be hard to productize something that does not exist. I find it both disrespectful and insulting to Red Hat engineers to describe their work as not new, innovative, creative, or respectable. You said “take existing technologies, combine them and make some money”. Interesting. Is that not what Oracle did with Tangosol Coherence? Oracle purchased their data grid. Red Hat created its data grid.
You really knows to play with words, starting with the usage of the word “nearly” :)
You forgot some key features that only Coherence has like: Elastic Data (off-heap and SSD storage of data), distributed GC against any type of storage and cache layout, ability to handle thousands of GB being able to handle even terabytes of data. Don’t came say to me that with on-heap allocation and regular JVM like HotSpot (or OpenJDK which is even worse) you could allocate terabytes of data. Native SDKs for C/C++ and .NET, Continuous Queries, support for many AppServers rather than only JBoss, integration with Java EE 6 using @Resource annotation, monitoring and management capabilities both integrated with the product and with other external tools like Enterprise Manager, integration with CEP world to enrich events and being the clustering enabled mechanism to handle fail-over scenarios, security features that could deal with scenarios of authentication, authorization, SSL and load-balancers (Eg: BigIP) integration. Pre-built filters and a powerful query language that could make easier for the developers to interact with the cache instead of force them to write Java code, support for Hibernate, Toplink, EclipseLink, GoldenGate, etc. Thousands of pre-implemented scenario patterns in the product and externally with the incubator strategy started by Tangosol and now owned by Oracle. Oh and of course: support for a high performance serialization strategy and a highly scalable TCP/IP implementation like TCMP. Not mentioning that support for InfiniBand based networks.
I admit that off-heap storage is an interesting concept. However, I question how practical it is. I would not recommend storing a TB of data on a single node with or without off-heap storage. I recommend partitioning physical servers into multiple virtual servers. It increases node portability while reducing the effects (e.g. rebalancing) of adding or removing nodes. JDG supports Java EE integration with both @resource and @inject. Does Oracle Coherence not support @inject? JDG includes management and monitoring as well. JDG clients are smart clients. It is not practical to load balance requests. Can you point me to a list of these “thousands of pre-implemented scenario patterns”? JDG supports both high performance serialization (JBoss Marshalling) and a highly scalable TCP / IP implementation (JGroups). However, JDG does not require developers to write additional code to use high performance serialization unlike Oracle Coherence and Portable Object Format (POF) (link). I will give you Infiniband, but it may not matter for long (link).
Update: I thought when you referred to off-heap storage that you referring to off-heap memory. I had not realized that “off-heap storage” is the new marketing term for “disk storage”. It turns out that Elastic Data is marketing for “overflow to disk” (link). This was a featured provided by Ehcache 10 years ago. It’s a feature provided by JDG. You mentioned 1TB of data. However, Elastic Data can only support up to 100GB per node. It is not a persistence solution. It does not support eviction. It should not be used with aggregation (i.e. map / reduce) or entry processors.
All of the “unique” features provided by JDG are not considered by real customers, independent analysts like Gartner, Forrester and IDC as really important. Are features that just align with the Red Hat strategy to force its entrance in the Big Data world, which on the other hand is a terrible strategy because to a real Big Data strategy Red Hat lacks A LOT OF technology stacks compared with real Big Data vendors like Oracle, EMC and IBM. Just an example, even Oracle does not consider Coherence as its Big Data strategy. When Oracle talk about Coherence, they’re talking about caching, grid and in-memory computing scenarios, which fits perfectly to elastic data grid technologies.
It find it funny that justify the lack of features by stating that they are not important to analysts. Will you go on record as stating that Oracle Coherence will never implement JDG features that it lacks? Software evolves. New features become standard features. Personally, I think data grids will continue to incorporate features provided by NoSQL implementations. Eventual consistency comes to mind. What is a “real’ customer”? Is there another kind of customer? What do these features have to do with Red Hat’s big data strategy? Did you see the Red Hat big data announcement (link)? It was quite clear on what is and what is not our big data strategy. Just as Coherence is not Oracle’s big data strategy, JDG is not Red Hat’s big data strategy. We too place our data grid in the context of in-memory distributed data and parallel processing. I would expect a data grid to “fit perfectly to elastic data grid technologies” as it is, after all, a data grid and one of the defining characteristics of a data grid is that it is elastic. However, there is some overlap between in-memory data grids, NoSQL, and big data platforms. They all distribute data and implement parallel processing. The provide data locality. As such, in-memory data grids fit perfectly inside of broader big data solutions.
"delaPlex is a software development company. We do team-based outsourcing development," explained Mark Rivers, COO and Co-founder of delaPlex Software, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York City, NY.
Jun. 28, 2016 07:30 PM EDT Reads: 325
IoT is rapidly changing the way enterprises are using data to improve business decision-making. In order to derive business value, organizations must unlock insights from the data gathered and then act on these. In their session at @ThingsExpo, Eric Hoffman, Vice President at EastBanc Technologies, and Peter Shashkin, Head of Development Department at EastBanc Technologies, discussed how one organization leveraged IoT, cloud technology and data analysis to improve customer experiences and effi...
Jun. 28, 2016 07:15 PM EDT Reads: 238
The IoT is changing the way enterprises conduct business. In his session at @ThingsExpo, Eric Hoffman, Vice President at EastBanc Technologies, discussed how businesses can gain an edge over competitors by empowering consumers to take control through IoT. He cited examples such as a Washington, D.C.-based sports club that leveraged IoT and the cloud to develop a comprehensive booking system. He also highlighted how IoT can revitalize and restore outdated business models, making them profitable ...
Jun. 28, 2016 06:30 PM EDT Reads: 262
There are several IoTs: the Industrial Internet, Consumer Wearables, Wearables and Healthcare, Supply Chains, and the movement toward Smart Grids, Cities, Regions, and Nations. There are competing communications standards every step of the way, a bewildering array of sensors and devices, and an entire world of competing data analytics platforms. To some this appears to be chaos. In this power panel at @ThingsExpo, moderated by Conference Chair Roger Strukhoff, Bradley Holt, Developer Advocate a...
Jun. 28, 2016 04:30 PM EDT Reads: 888
A strange thing is happening along the way to the Internet of Things, namely far too many devices to work with and manage. It has become clear that we'll need much higher efficiency user experiences that can allow us to more easily and scalably work with the thousands of devices that will soon be in each of our lives. Enter the conversational interface revolution, combining bots we can literally talk with, gesture to, and even direct with our thoughts, with embedded artificial intelligence, wh...
Jun. 28, 2016 02:00 PM EDT Reads: 1,132
Machine Learning helps make complex systems more efficient. By applying advanced Machine Learning techniques such as Cognitive Fingerprinting, wind project operators can utilize these tools to learn from collected data, detect regular patterns, and optimize their own operations. In his session at 18th Cloud Expo, Stuart Gillen, Director of Business Development at SparkCognition, discussed how research has demonstrated the value of Machine Learning in delivering next generation analytics to imp...
Jun. 28, 2016 01:00 PM EDT Reads: 809
The cloud market growth today is largely in public clouds. While there is a lot of spend in IT departments in virtualization, these aren’t yet translating into a true “cloud” experience within the enterprise. What is stopping the growth of the “private cloud” market? In his general session at 18th Cloud Expo, Nara Rajagopalan, CEO of Accelerite, explored the challenges in deploying, managing, and getting adoption for a private cloud within an enterprise. What are the key differences between wh...
Jun. 28, 2016 12:45 PM EDT Reads: 952
Connected devices and the industrial internet are growing exponentially every year with Cisco expecting 50 billion devices to be in operation by 2020. In this period of growth, location-based insights are becoming invaluable to many businesses as they adopt new connected technologies. Knowing when and where these devices connect from is critical for a number of scenarios in supply chain management, disaster management, emergency response, M2M, location marketing and more. In his session at @Th...
Jun. 28, 2016 12:00 PM EDT Reads: 1,136
Basho Technologies has announced the latest release of Basho Riak TS, version 1.3. Riak TS is an enterprise-grade NoSQL database optimized for Internet of Things (IoT). The open source version enables developers to download the software for free and use it in production as well as make contributions to the code and develop applications around Riak TS. Enhancements to Riak TS make it quick, easy and cost-effective to spin up an instance to test new ideas and build IoT applications. In addition to...
Jun. 28, 2016 10:30 AM EDT Reads: 478
When people aren’t talking about VMs and containers, they’re talking about serverless architecture. Serverless is about no maintenance. It means you are not worried about low-level infrastructural and operational details. An event-driven serverless platform is a great use case for IoT. In his session at @ThingsExpo, Animesh Singh, an STSM and Lead for IBM Cloud Platform and Infrastructure, will detail how to build a distributed serverless, polyglot, microservices framework using open source tec...
Jun. 28, 2016 10:00 AM EDT Reads: 446
Apixio Inc. has raised $19.3 million in Series D venture capital funding led by SSM Partners with participation from First Analysis, Bain Capital Ventures and Apixio’s largest angel investor. Apixio will dedicate the proceeds toward advancing and scaling products powered by its cognitive computing platform, further enabling insights for optimal patient care. The Series D funding comes as Apixio experiences strong momentum and increasing demand for its HCC Profiler solution, which mines unstruc...
Jun. 28, 2016 10:00 AM EDT Reads: 389
IoT offers a value of almost $4 trillion to the manufacturing industry through platforms that can improve margins, optimize operations & drive high performance work teams. By using IoT technologies as a foundation, manufacturing customers are integrating worker safety with manufacturing systems, driving deep collaboration and utilizing analytics to exponentially increased per-unit margins. However, as Benoit Lheureux, the VP for Research at Gartner points out, “IoT project implementers often ...
Jun. 28, 2016 09:45 AM EDT Reads: 444
It is one thing to build single industrial IoT applications, but what will it take to build the Smart Cities and truly society changing applications of the future? The technology won’t be the problem, it will be the number of parties that need to work together and be aligned in their motivation to succeed. In his Day 2 Keynote at @ThingsExpo, Henrik Kenani Dahlgren, Portfolio Marketing Manager at Ericsson, discussed how to plan to cooperate, partner, and form lasting all-star teams to change t...
Jun. 28, 2016 09:30 AM EDT Reads: 1,295
In his general session at 18th Cloud Expo, Lee Atchison, Principal Cloud Architect and Advocate at New Relic, discussed cloud as a ‘better data center’ and how it adds new capacity (faster) and improves application availability (redundancy). The cloud is a ‘Dynamic Tool for Dynamic Apps’ and resource allocation is an integral part of your application architecture, so use only the resources you need and allocate /de-allocate resources on the fly.
Jun. 28, 2016 08:30 AM EDT Reads: 1,221
In his keynote at 18th Cloud Expo, Andrew Keys, Co-Founder of ConsenSys Enterprise, provided an overview of the evolution of the Internet and the Database and the future of their combination – the Blockchain. Andrew Keys is Co-Founder of ConsenSys Enterprise. He comes to ConsenSys Enterprise with capital markets, technology and entrepreneurial experience. Previously, he worked for UBS investment bank in equities analysis. Later, he was responsible for the creation and distribution of life sett...
Jun. 28, 2016 08:00 AM EDT Reads: 1,344
Presidio has received the 2015 EMC Partner Services Quality Award from EMC Corporation for achieving outstanding service excellence and customer satisfaction as measured by the EMC Partner Services Quality (PSQ) program. Presidio was also honored as the 2015 EMC Americas Marketing Excellence Partner of the Year and 2015 Mid-Market East Partner of the Year. The EMC PSQ program is a project-specific survey program designed for partners with Service Partner designations to solicit customer feedbac...
Jun. 28, 2016 08:00 AM EDT Reads: 474
Cloud computing is being adopted in one form or another by 94% of enterprises today. Tens of billions of new devices are being connected to The Internet of Things. And Big Data is driving this bus. An exponential increase is expected in the amount of information being processed, managed, analyzed, and acted upon by enterprise IT. This amazing is not part of some distant future - it is happening today. One report shows a 650% increase in enterprise data by 2020. Other estimates are even higher....
Jun. 26, 2016 05:00 PM EDT Reads: 1,349
SYS-CON Events announced today that Bsquare has been named “Silver Sponsor” of SYS-CON's @ThingsExpo, which will take place on November 1–3, 2016, at the Santa Clara Convention Center in Santa Clara, CA. For more than two decades, Bsquare has helped its customers extract business value from a broad array of physical assets by making them intelligent, connecting them, and using the data they generate to optimize business processes.
Jun. 26, 2016 05:00 PM EDT Reads: 1,260
Internet of @ThingsExpo, taking place November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with 19th Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world. The Internet of Things (IoT) is the most profound change in personal and enterprise IT since the creation of the Worldwide Web more than 20 years ago. All major researchers estimate there will be tens of billions devices - comp...
Jun. 26, 2016 04:00 PM EDT Reads: 1,313
19th Cloud Expo, taking place November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA, will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy. Meanwhile, 94% of enterpri...
Jun. 26, 2016 04:00 PM EDT Reads: 1,394