Apache Developer's Journal

Imagine my surprise when reading the March 28, 2016 issue of BusinessWeek and stumbling across the article titled “Lies, Damned Lies, and More Statistics.” In the article, "BusinessWeek" warned readers to beware of “p-hacking,” which is the statistical practice of tweaking data i... (more)
Unless you've been living under a rock for the past couple of years, you've been hearing about the world of Big Data nonstop. Big Data promises fortune and power to those that can wield the somewhat mystical and often nebulous power of "Big Data". Unfortunately for the rest of us... (more)
Impetus Technologies has announced that StreamAnalytix™ has been selected by Hortonworks® as a complementary solution to provide real-time streaming capabilities for the new version of Hortonworks DataFlow (HDF™). HDF powered by Apache NiFi is an integrated platform to collect, co... (more)
Three Trends Behind the Movement to Real-Time Big Data By Ashley Stirrup As I read through the various 2016 technology predictions lists, I started discovering a common theme: the inclusion of real-time data initiatives as a main priority for IT. Real-time streaming data and analy... (more)
We can reasonably break down the core imperatives of data analytics into three essential areas. Firms that wish to gain the full potential out of engaging in data analytics today are obliged to at least recognize these rudimentary cornerstones as the central building blocks of da... (more)
IoT and Cassandra In his session at @ThingsExpo, Ben Bromhead, CTO of Instaclustr, will walk you through the basics of building an IoT-based platform leveraging Cassandra, Spark and Kafka. This session is aimed at developers, admins and DevOps engineers who have to build, run an... (more)
Putting Analytics into the Decision-Making Workflow with Apache Spark Data-driven businesses use analytics to inform and support their decisions. In many companies, marketing, sales, finance, and operations departments tend to be the earliest adopters of data analytics, with the r... (more)
Apache Spark continues to gain a lot of traction as companies launch or expand their big data initiatives. There is no doubt that it’s finding a place in corporate IT strategies. The open-source cluster computing framework was developed in the AMPLab at the University of Califor... (more)
Last June IBM made a serious commitment to the future of Apache Spark with a series of initiatives: It will offer Apache Spark as a service on Bluemix (Bluemix is an implementation of IBM's Open Cloud Architecture based on Cloud Foundry, an open source Platform as a Service (Pa... (more)
If you’re running Big Data applications, you’re going to want to look at some kind of distributed processing system. Hadoop is one of the best-known clustering systems, but how are you going to process all your data in a reasonable time frame? Apache Spark offers services that go... (more)
A little over five months ago, we turned the DevOps world on its head by turning a bunch of mild-mannered engineers and other web performance experts into ravenous, glory-seeking fiends who thought nothing of stepping on and over one another in their quest for the ultimate prize ... (more)
Apache Hadoop and NoSQL as the Analysis Engines for Internet-of-Things Data You have your devices and your data, but what about the rest of your Internet of Things story? Two popular classes of technologies that nicely handle the Big Data analytics for Internet of Things are Apac... (more)
Keep CALM And Embrace DevOps – Measurement By John Rakowski The next letter in the DevOps CALMS model brings us to M for Measurement. Focusing on the right metrics and measurements is vital if you are going to succeed with DevOps adoption. “You Never Know Where You Are Going Unti... (more)
HP today at its Big Data Conference in Boston unveiled a series of new products, services, and programs designed to help organizations better leverage data and analytics. The company announced: A new release of HP Vertica, called Excavator, that feature data streaming and advan... (more)
Operational Hadoop and the Lambda Architecture for Streaming Data Apache Hadoop is emerging as a distributed platform for handling large and fast incoming streams of data. Predictive maintenance, supply chain optimization, and Internet-of-Things analysis are examples where Hadoop... (more)
This post is the first in a series of blog posts that will explore and exploit the Big Data and analytics tools. I will walk through easy steps to start working with such tools like Apache Hadoop, Pig, Mahout and solve some problems related to analytics and learning in the large ... (more)
The enhanced adoption of Cloud, Big Data, Mobility is causing more services to be developed and aggregated, hence there is a greater emphasis on Agile for service aggregation. Agile processes have specific methods to manage the rapid development cycles and changing requirements i... (more)
The Power of Complete Install Automation It used to take months to travel across the U.S. Or any sizable landmass for that matter. One of the few really well documented wagon trains took four months to travel from Iowa to Montana… A trip that takes an airplane about four hours t... (more)
AppDynamics Monitoring Excels for Microservices; New Pricing Model Introduced It’s no news that microservices are one of the top trends, if not the top trend, in application architectures today. Take large monolithic applications which are brittle and difficult to change and brea... (more)
"DevOps is really about the business. The business is under pressure today, competitively in the marketplace to respond to the expectations of the customer. The business is driving IT and the problem is that IT isn't responding fast enough," explained Mark Levy, Senior Product M... (more)
© 2008 SYS-CON Media