Click here to close now.


Apache Authors: Jim Scott, Liz McMillan, Craig Lowell, AppDynamics Blog, Dana Gardner

Related Topics: @BigDataExpo, Java IoT, Microservices Expo, Microsoft Cloud, Containers Expo Blog, @CloudExpo, Apache

@BigDataExpo: Blog Feed Post

Big Data | Thinking Outside the Firewall

Big Data is a by-product of the Internet and the ever increasing power of computers

A few months back, Gartner placed big data at the peak of its hype cycle for cloud computing, meaning most big data products are solutions looking for a problem. I always find this bad entrepreneurial habit to be one of the most frustrating of our industry. Having recently joined Meltwater as head of marketing and product (BTW Meltwater is hiring marketing and product managers!), I think a lot about big data and how to unleash it’s value to solve important business problems, because that is our business. How does big data go from “so what” to “must have”?

The Big Data Challenge
Big data is a by-product of the Internet and the ever increasing power of computers. Kind of like petroleum sludge. We know there must be great value buried within this vast, raw resource, but the challenge lies in figuring out how to turn it into something useful like plastic, or the other thousands of petroleum products that we produce from the 20% of crude oil that can’t be turned into fuel.

big data by product

This is no small feat. I can confidently predict that there will be no shortage of well-intentioned, well-funded start-ups that fail to live up to this challenge, producing varying versions of gift-wrapped sludge that never quite deliver on the promises of their pitches. Overcoming the hype and producing real value from big data requires much more than data-processing infrastructure. It requires a laser-like focus on creating order-of-magnitude improvements to how we work and live.

Reengineering Across the Firewall
More than a year ago, McKinsey and Company predicted that big data would be “The Next Frontier of Innovation, Competition and Productivity.” Now, I’m not generally one to argue with the likes of McKinsey, especially in this case as I happen to agree with it. If you have the time, I highly recommend checking out the report. At 156 pages, however, it can be a little hard to digest, so I thought I’d fearlessly attempt to boil it down to a blog post by sharinig a little of how we think about the Big Data Challenge @Meltwater.

big data meltwater reengineering

Big data implies a shift in real-time access to valuable information outside the firewall. It offers the opportunity to reengineer business processes that cross the firewalland that benefit greatly from this information, such as competitive strategy, sales, customer support, vendor management, employee recruiting, etc.

Cloud-based businesses create value in one of two ways: lowering TCO (cost advantage) and network-enabled innovation (differentiation). Most early-entry, enterprise SaaS applications like are really not that different from their on-premise counterparts in feature and function. They rely on lower TCO as the primary driver of adoption, and while they expand the market through lower prices to SMBs, they are locked into a replacement battle against on-premise software leaders like Oracle, SAP and Microsoft. However, there are a handful of truly innovative enterprise SaaS and cloud categories that mine the Internet for all it’s worth and have no on-premise equivalent, such as search, media monitoring, marketing automation, Web analytics, social media marketing, human capital management, and cloud integration. These categories all have one thing in common, they reengineer business processes that cross the firewall by leveraging data outside the firewall.

Reengineering is a business term that gained popularity in the early nineties as client-server washed away mainframe applications en masse. The idea was to leverage the new technology to redesign business processes for dramatic gains in productivity, as opposed to just upgrading legacy systems. Fast-forward 20 years and we find ourselves in a similar situation. We’ve spent billions of dollars on ERP, CRM, BI and countless other software acronyms to automate every last internal business process. Inside the firewall, there is very little left to do.

The vision of big data should not be an upgrade, like the next generation of enterprise business intelligence, only bigger. Big data is a fundamental shift in real-time access to valuable information outside the firewall. It offers the opportunity to reengineer business processes that cross the firewall and that benefit greatly from this information, such as competitive strategy, sales, customer support, vendor management, employee recruiting, etc. For example, it is one of the great ironies of enterprise software that in most companies customers never touch the customer relationship management system.

Social Business and Big Data
If any emerging category can claim more hype than big data, it is social business. This is no accident. Social business and big data are inexorably linked. Big social data empowers social business. Take the case of the social community manager, a new and evolving social business role. The social community manager must engage in a dialog with community members that is personal and relevant. Yet at the same time, the social community manager must sift through millions of online conversations to zero in on specific opportunities for personalized social engagement. Enter big data. Let’s redraw the above diagram for the social community manager.

social community management meltwater

The social community manager needs a social mission control panel to digest the vast amounts of big social data and zero in on the conversations, channels, and community members that require immediate attention. (At Meltwater, we like to call this Meltwater Buzz 3.0 which launched today! ;) )

The social community manager is engaged in the business process of building a social community. To accomplish this daunting task, the social community manager must reach across the firewall to crunch a lot of big data and engage community members in in real-time. The process goes something like this:

  1. Gather big data outside the firewall about the social community
  2. Develop a social campaign strategy informed by insights from the data
  3. Create a social campaign plan of execution
  4. Reach across the firewall to engage the social community
  5. Rinse and repeat in real-time

I think social community management is a lot like air traffic control, including the potential for social media disasters when information, systems or social community managers are not up to the task. The social community manager must digest vast amounts of big data to find the one conversation or one community member that requires immediate attention. The social community manager doesn’t need raw big data. The social community manager needs a social mission control panel to digest the vast amounts of big social data and zero in on the conversations, channels, and community members that require immediate attention.

Big Data and The Cloud | A Match Made in Heaven
The fact that big data originates largely as a by product of the Internet, and the fact that it is, well, big, lead to the natural conclusion that like big data itself, big data-based solutions are best situated in the cloud. It will be the rare global 500 company that is both big enough and motivated enough to house and sift through the mountains of data available out there to build on-premise big data analytics and automation. Economies-of-scale will rule in the aggregation, enrichment and processing of big data with most businesses interested in paying only for results, i.e., insights that can be used to reengineer business processes across the firewall for order-of-magnitude improvements in productivity and service.

Read the original blog entry...

More Stories By Joel York

Joel York is an Internet software executive and popular SaaS / Cloud blogger at Chaotic Flow and Cloud Ave. He is well known for his work in SaaS / cloud business models, sales and marketing strategy, and financial metrics. Professionally, he has managed global sales and marketing organizations serving over 50 countries, including local offices in the United States, United Kingdom, Germany, and India. He holds degrees in physics from Caltech and Cornell and received his MBA from the University of Chicago. Joel York is currently VP Marketing at Meltwater Group and Principal at the Internet startup consulting firm affinitos.

@ThingsExpo Stories
SYS-CON Events announced today that Sandy Carter, IBM General Manager Cloud Ecosystem and Developers, and a Social Business Evangelist, will keynote at the 17th International Cloud Expo®, which will take place on November 3–5, 2015, at the Santa Clara Convention Center in Santa Clara, CA.
The IoT market is on track to hit $7.1 trillion in 2020. The reality is that only a handful of companies are ready for this massive demand. There are a lot of barriers, paint points, traps, and hidden roadblocks. How can we deal with these issues and challenges? The paradigm has changed. Old-style ad-hoc trial-and-error ways will certainly lead you to the dead end. What is mandatory is an overarching and adaptive approach to effectively handle the rapid changes and exponential growth.
The IoT is upon us, but today’s databases, built on 30-year-old math, require multiple platforms to create a single solution. Data demands of the IoT require Big Data systems that can handle ingest, transactions and analytics concurrently adapting to varied situations as they occur, with speed at scale. In his session at @ThingsExpo, Chad Jones, chief strategy officer at Deep Information Sciences, will look differently at IoT data so enterprises can fully leverage their IoT potential. He’ll share tips on how to speed up business initiatives, harness Big Data and remain one step ahead by apply...
WebRTC converts the entire network into a ubiquitous communications cloud thereby connecting anytime, anywhere through any point. In his session at WebRTC Summit,, Mark Castleman, EIR at Bell Labs and Head of Future X Labs, will discuss how the transformational nature of communications is achieved through the democratizing force of WebRTC. WebRTC is doing for voice what HTML did for web content.
Today air travel is a minefield of delays, hassles and customer disappointment. Airlines struggle to revitalize the experience. GE and M2Mi will demonstrate practical examples of how IoT solutions are helping airlines bring back personalization, reduce trip time and improve reliability. In their session at @ThingsExpo, Shyam Varan Nath, Principal Architect with GE, and Dr. Sarah Cooper, M2Mi's VP Business Development and Engineering, will explore the IoT cloud-based platform technologies driving this change including privacy controls, data transparency and integration of real time context w...
"Matrix is an ambitious open standard and implementation that's set up to break down the fragmentation problems that exist in IP messaging and VoIP communication," explained John Woolf, Technical Evangelist at Matrix, in this interview at @ThingsExpo, held Nov 4–6, 2014, at the Santa Clara Convention Center in Santa Clara, CA.
WebRTC has had a real tough three or four years, and so have those working with it. Only a few short years ago, the development world were excited about WebRTC and proclaiming how awesome it was. You might have played with the technology a couple of years ago, only to find the extra infrastructure requirements were painful to implement and poorly documented. This probably left a bitter taste in your mouth, especially when things went wrong.
The broad selection of hardware, the rapid evolution of operating systems and the time-to-market for mobile apps has been so rapid that new challenges for developers and engineers arise every day. Security, testing, hosting, and other metrics have to be considered through the process. In his session at Big Data Expo, Walter Maguire, Chief Field Technologist, HP Big Data Group, at Hewlett-Packard, will discuss the challenges faced by developers and a composite Big Data applications builder, focusing on how to help solve the problems that developers are continuously battling.
Nowadays, a large number of sensors and devices are connected to the network. Leading-edge IoT technologies integrate various types of sensor data to create a new value for several business decision scenarios. The transparent cloud is a model of a new IoT emergence service platform. Many service providers store and access various types of sensor data in order to create and find out new business values by integrating such data.
There are so many tools and techniques for data analytics that even for a data scientist the choices, possible systems, and even the types of data can be daunting. In his session at @ThingsExpo, Chris Harrold, Global CTO for Big Data Solutions for EMC Corporation, will show how to perform a simple, but meaningful analysis of social sentiment data using freely available tools that take only minutes to download and install. Participants will get the download information, scripts, and complete end-to-end walkthrough of the analysis from start to finish. Participants will also be given the pract...
WebRTC: together these advances have created a perfect storm of technologies that are disrupting and transforming classic communications models and ecosystems. In his session at WebRTC Summit, Cary Bran, VP of Innovation and New Ventures at Plantronics and PLT Labs, will provide an overview of this technological shift, including associated business and consumer communications impacts, and opportunities it may enable, complement or entirely transform.
SYS-CON Events announced today that Dyn, the worldwide leader in Internet Performance, will exhibit at SYS-CON's 17th International Cloud Expo®, which will take place on November 3-5, 2015, at the Santa Clara Convention Center in Santa Clara, CA. Dyn is a cloud-based Internet Performance company. Dyn helps companies monitor, control, and optimize online infrastructure for an exceptional end-user experience. Through a world-class network and unrivaled, objective intelligence into Internet conditions, Dyn ensures traffic gets delivered faster, safer, and more reliably than ever.
WebRTC services have already permeated corporate communications in the form of videoconferencing solutions. However, WebRTC has the potential of going beyond and catalyzing a new class of services providing more than calls with capabilities such as mass-scale real-time media broadcasting, enriched and augmented video, person-to-machine and machine-to-machine communications. In his session at @ThingsExpo, Luis Lopez, CEO of Kurento, will introduce the technologies required for implementing these ideas and some early experiments performed in the Kurento open source software community in areas ...
Too often with compelling new technologies market participants become overly enamored with that attractiveness of the technology and neglect underlying business drivers. This tendency, what some call the “newest shiny object syndrome,” is understandable given that virtually all of us are heavily engaged in technology. But it is also mistaken. Without concrete business cases driving its deployment, IoT, like many other technologies before it, will fade into obscurity.
Who are you? How do you introduce yourself? Do you use a name, or do you greet a friend by the last four digits of his social security number? Assuming you don’t, why are we content to associate our identity with 10 random digits assigned by our phone company? Identity is an issue that affects everyone, but as individuals we don’t spend a lot of time thinking about it. In his session at @ThingsExpo, Ben Klang, Founder & President of Mojo Lingo, will discuss the impact of technology on identity. Should we federate, or not? How should identity be secured? Who owns the identity? How is identity ...
The buzz continues for cloud, data analytics and the Internet of Things (IoT) and their collective impact across all industries. But a new conversation is emerging - how do companies use industry disruption and technology enablers to lead in markets undergoing change, uncertainty and ambiguity? Organizations of all sizes need to evolve and transform, often under massive pressure, as industry lines blur and merge and traditional business models are assaulted and turned upside down. In this new data-driven world, marketplaces reign supreme while interoperability, APIs and applications deliver un...
Electric power utilities face relentless pressure on their financial performance, and reducing distribution grid losses is one of the last untapped opportunities to meet their business goals. Combining IoT-enabled sensors and cloud-based data analytics, utilities now are able to find, quantify and reduce losses faster – and with a smaller IT footprint. Solutions exist using Internet-enabled sensors deployed temporarily at strategic locations within the distribution grid to measure actual line loads.
The Internet of Everything is re-shaping technology trends–moving away from “request/response” architecture to an “always-on” Streaming Web where data is in constant motion and secure, reliable communication is an absolute necessity. As more and more THINGS go online, the challenges that developers will need to address will only increase exponentially. In his session at @ThingsExpo, Todd Greene, Founder & CEO of PubNub, will explore the current state of IoT connectivity and review key trends and technology requirements that will drive the Internet of Things from hype to reality.
The Internet of Things (IoT) is growing rapidly by extending current technologies, products and networks. By 2020, Cisco estimates there will be 50 billion connected devices. Gartner has forecast revenues of over $300 billion, just to IoT suppliers. Now is the time to figure out how you’ll make money – not just create innovative products. With hundreds of new products and companies jumping into the IoT fray every month, there’s no shortage of innovation. Despite this, McKinsey/VisionMobile data shows "less than 10 percent of IoT developers are making enough to support a reasonably sized team....
You have your devices and your data, but what about the rest of your Internet of Things story? Two popular classes of technologies that nicely handle the Big Data analytics for Internet of Things are Apache Hadoop and NoSQL. Hadoop is designed for parallelizing analytical work across many servers and is ideal for the massive data volumes you create with IoT devices. NoSQL databases such as Apache HBase are ideal for storing and retrieving IoT data as “time series data.”