Welcome!


Latest Articles from Apache Developer's Journal
The Open Group is planning a tweet jam around what it calls Platform 3.0 issues -- big data, cloud computing, the consumerization of IT and other current trends. Over recent years a number of technologies -- cloud, mobile, big data, social -- have emerged and converged to disrupt the ...
In his session at the 12th International Cloud Expo, Tony Shan, who was one of the key drivers inside IBM for the cloud reference architecture, and also helped coin "Cloud Engineering," will discuss lots of valuable best practices and lessons learned in designing cloud architecture and...
Over the past two decades relational databases have been most successful in serving large scale OLTP and OLAP applications across enterprises. However, in the past couple of years with the advent of Big Data processing, especially processing unstructured data coupled with the need for ...
While nearly 90 percent of business and IT leaders agree that big data can be useful in making intelligent business decisions, only one-third of companies have implemented big-data initiatives. Furthermore, more than 50 percent of survey respondents said that they had only lukewarm s...
BigData (and Hadoop) are buzzword and growth areas of computing; this article will distill the concepts into easy-to-understand terms. As the name implies, BigData is literally "big data" or "lots of data" that needs to be processed. Lets take a simple example: the city council of San...
Splunk, the software platform for real-time operational intelligence, and Hortonworks, the Hadoop Big Data distribution start-up, have allied so organizations can get operational intelligence using open source Apache Hadoop. Their pact means that data can be moved between Splunk Ente...
The IT industry is nothing if not a breeding ground for an infinite variety of acronyms and neologisms. Alongside cloud computing today sits the term Big Data, which of course we understand to mean “that amount” of data which a traditional database would find hard to compute and proces...
New technologies allow schools, colleges and universities to analyze absolutely everything that happens. From student behavior, testing results, career development of students as well as educational needs based on changing societies. A lot of this data has already been stored and is us...
In the coming years, big data will change the way organisations and societies are operated and managed. Big data however, is not the only trend that will impact significantly how organisations operate. Another major trend at the moment is gamification. Gamification will change the way ...
Companies around the world are collecting massive amounts of data everyday that’s sitting around and not being utilized. Take for example the fact that companies collect demographic and location-based data via mobile devices all the time, but have to figure out how to monetize that dat...
I'd like to address a recent blog post in CloudTweaks titled, "Cloudera Not Cutting It With Big Data Security." The author makes a number of very salient and valid points about Hadoop security… or lack thereof. Indeed the Apache Hadoop platform, which includes HDFS and MapReduce and o...
Adding more memory to your JVMs (Java Virtual Machines) might be a temporary solution to fixing memory leaks in Java applications, but it for sure won’t fix the root cause of the issue. Instead of crashing once per day it may just crash every other day. “Preventive” restarts are also j...
Following my high-level write-up of Hadoop and Big Data, this article will present each of the components or projects that make up Hadoop with a technical description of each. First, what is Hadoop? Hadoop stores and processes large volumes of a wide variety of data that changes rapi...
This review covers both Core Java Volume I--Fundamentals (9th Edition) and Core Java, Volume II--Advanced Features (9th Edition). Both books are part of the Prentice Hall Core Series. I actually got Volume II first and liked it so much I ordered Volume I. I felt like I was missing the...
Master Data Management (MDM) is a very important data governance aspect in enterprises whereby MDM enables the development of a "Single Version of Truth." MDM establishes Single Version of Truth by providing common descriptions for enterprise-wide entities. Need for MDM in Big Data Pr...
Infor, the software company where former Larry Ellison lieutenant Chuck Phillips went after he was bounced out of Oracle to make room for ex-HP CEO Mark Hurd, is working on a Big Data initiative called Sky Vault that will leverage its own ION Business Vault and Amazon’s Redshift gussie...
Over the last couple of months I have been talking to more and more customers who are either bringing their Hadoop clusters into production or have already done so and are now getting serious about operations. This leads to some interesting discussions about how to monitor Hadoop prope...
Big Data has made a huge splash in the enterprise world, but the legal and risk management implications are seldom discussed. It’s critical for businesses to assess these issues and develop a proactive strategy to protect the enterprise from costly errors. Corporate policies and regu...
Hibernate is one of the most used ORM Java frameworks out there. It is really simple to use, just add few annotations and you’re ready to go. However, it is also really easy to experience strange behaviors and bugs if you don’t respect Hibernate’s best practices. That’s why at Tocea we...
Last week, I presented Caching Up and Down the Stack at the Boston Web Performance meetup. It was great to get the chance to present to the 60+ people who came out for the talk. Unsurprisingly, many of the people there knew a lot about caching in all of the different levels I touched o...
Big Data start-ups are like ants at a church picnic. They’re everywhere and they’re getting fed. Guavus, which is focused on tier-one communications service providers and has kept a low profile for the past seven years, has just raised another $9 million, which makes a whopping $87 m...
The Hadoop framework and SSD technology augment cloud data systems ranging from analytics to on-line transaction processing (OLTP) to data warehousing. The resulting balance of processing, networking, SSD storage, and Hadoop optimization results in improving Big Data sort responsivenes...
Federal agencies spent close to $4.9 billion on Big Data resources during fiscal year 2012 and that number could grow to $5.7 billion in 2014. Research firm Deltek estimates federal big data spending will grow to $7.2 billion by 2017 as agencies strive to handle ever-increasing volume...
Enterprises can't close their doors just because integration tools won't cope with the volume of information that their systems produce. As each day goes by, their information will become larger and more complicated and enterprises must constantly struggle to manage the integration of ...
Xyratex, the UK data storage house that spun out of IBM in a management buy-out 20 years ago, is now a strategic supplier to AMD, which picked the Brit’s OneStor Modular Enclosure as a building block for its Big Data and storage-intensive solutions. The Xyratex widgetry has been opti...
If you’re not convinced the hype around Big Data is entirely justified, consider the following statistics: in the time it takes an average person to read this article, 72 petabytes of data (that’s 72 x 1015 if you’re counting) will have been added to the global information pool. Each h...
The Apache Software Foundation (ASF) has made CloudStack, which it took in as an incubator project a year ago, a Top-Level Program (TLP). That’s supposed to mean that Citrix, which donated the IaaS OpenStack rival to Apache after acquiring it from Cloud.com in 2011, doesn’t have much...
The term Big Data is going to become a key part of the forward-looking business technology debate among informed, proactive and ICT savvy executives. But what's really driving the growing demand for meaningful solutions? While most companies are collecting, storing and analyzing data,...
Technology has been taken over by the Cloud. Putting it in simple terms, ‘Cloud’ is the new meme used to describe nirvana from clogged up computers and saving files directly to the internet. So how does it work? Essentially, instead of saving files to a hard disk or using software on ...
After two years in development and six months in private beta, Platfora, the native in-memory Business Intelligence (BI) platform for Hadoop, has gone on sale. The widgetry is meant to put business users directly in touch with Big Data and eliminate the need for complex and expensive...
inovex GmnbH has just announced its partnership with C12G Labs to support the deployment and operation of OpenNebula-based enterprise cloud infrastructures, and the automation and migration of legacy systems. With OpenNebula as their software platform, inovex experts can set up cloud i...
Today's applications need fast access to data for maximum performance. In his session at the 12th International Cloud Expo, Dr. William L. Bain, founder and CEO of ScaleOut Software, will discuss how in-memory data grids combine distributed caching with powerful in-memory analysis an...
How do you work with a remote product owner who is in a different time zone with very little overlap of normal working hours? An agile puritan would have a simple answer – Don’t. There is an underlying assumption behind this statement. The assumption is that the product owner can pro...
MapR Technologies, the Hadoop technology concern, has gotten a $30 million C round of financing led by new investor Mayfield Fund. Existing backers Lightspeed Venture Partners, NEA and Redpoint Ventures participated. The new money brings total funding to $59 million. The new financing...
Concerns are raised every once in a while in the broader free and open source software community about freeloaders. The attitude expressed is that if you're getting the benefit of FOSS, you should contribute. Building a business on a FOSS project you don't own, whether you're providi...
Big Data applications such as Hadoop and Hive are becoming more widely adopted and mainstream. There is an increasing number of users who will select the cloud – whether private or public – as an efficient and scalable deployment vehicle for these large-scale distributed apps. Hadoop i...
BMC Software this week launched MyIT, an enterprise IT help desk solution that empowers employees to take more personal control over their IT services and to get the right type of help they need -- anytime, anywhere, from any device. Frustration with company IT departments is a widely...
Big Data – a large amount of information that comes in a variety of forms and constantly changes – has generated a significant amount of buzz in the business world, mostly around the implications for marketing. But there’s little attention paid to its potential impact on risk managemen...
VMware’s top management vented at Amazon last week during the company’s annual Partner Exchange conference in Las Vegas, telling the audience that they’re all doomed if the corporate world moves to Amazon’s infrastructure. “We want to own the corporate workload,” CEO Pat Gelsinger sa...
In the last couple of years Hadoop has become synonymous with Big Data. This framework is so vast and popular that Microsoft recently announced, for the first time in its history, that it is going to invest in this large-scale, open-source project as its solution for Big Data. In his ...