Welcome!

Apache Authors: Gil Allouche, Liz McMillan, William Schmarzo, Christopher Harrold, Elizabeth White

Blog Feed Post

Cloudera Strengthens Hadoop Security with Acquisition of Gazzang: Builds on additional community efforts to deliver end-to-end security offering

By

One thing I really love about being in the technology field is watching things get done that just a short while ago seemed impossible. I felt that way again when reading the press release below.  In the early days of production systems built around Apache Hadoop, security was only possible by limiting access to your cluster. Later, more and more security related capabilities were added, including better access control, authentication, auditing, and data provenance. Many players delivered niche solutions for encrypting data, but not so long ago most solutions I saw introduced new weaknesses for each solution.  Then some very positive things started happening.  One is Intel corporation started a deep focus on enhanced security, including creating an open source community activity that leveraged smart design that could leverage Intel Data Protection Technology with AES-NI (Project Rhino) in 2013. Cloudera continued to focus on security and find-grain access control with capabilities like Sentry.  Another very positive development was the application of engineering and security talent by an amazing firm named Gazzang. One of the big advances from Gazzang: well engineered key management.

The news below is the product of many of these factors plus the vision and leadership of very smart people at Gazzang, Intel and Cloudera. The result– something that was absolutely impossible just a few years ago, is now achievable. Security still takes forethought, but the fact that well engineered end to end encryption is now possible is a dramatically positive step.

From: http://ctolink.us/Tbddag

Cloudera Strengthens Hadoop Security with Acquisition of Gazzang

Jun/03/2014

Combines Apache Sentry and Intel’s Project Rhino with Gazzang’s Encryption and Key Management to Build the Industry’s Most Robust End-to-End Security Offering for Hadoop Environments

PALO ALTO, Calif. – June 3, 2014 – Cloudera, a leader in enterprise analytic data management powered by Apache Hadoop™, today announced that it has acquired Gazzang, the big data security experts, to dramatically strengthen its security offerings, building on the roadmap laid out last year when Cloudera first delivered Sentry. Terms of the deal were not disclosed.

The addition will immediately deliver enterprise-grade data encryption and key management, addressing head on the challenges associated with securing and processing sensitive and legally protected data within the Hadoop ecosystem. Thus fulfilling a requirement in myriad compliance regulations like HIPAA-HITECH, PCI-DSS, FERPA and the EU Data Protection Directive.

While Cloudera customers will continue to have a choice of a broad range of cross-platform data protection methods available from Cloudera partners, Cloudera now offers encryption for all data-at-rest stored inside the Hadoop cluster – using an approach that is transparent to applications using the data, thereby minimizing the costs associated with enabling encryption.

Cloudera plans to focus the efforts of the Gazzang team on additional security challenges in Hadoop. The team will become the heart of the Cloudera Center for Security Excellence focusing exclusively on Hadoop security. The Center will focus on:

    • Comprehensive data and cluster security technologies - including “follow the data” authorization and encryption policies riding on Cloudera’s data lineage tracking capabilities.
    • Security testing and certification - including continuous vulnerability assessment, performance optimization, and developing regulatory compliance playbooks.
    • Security ecosystem partner enablement - developing security integration APIs and certifying partner products.

In addition to immediately providing a transparent data-at-rest encryption and key management solution to enterprise customers – addressing one of the biggest gaps in Hadoop security – Cloudera, Intel and Gazzang form a powerful team of big data security and silicon performance optimization expertise that will improve security in core Hadoop through the open source community.

Cloudera is continuing to invest broadly in the open source community to support and accelerate security features into project Rhino—an open source effort founded by Intel in early 2013. Project Rhino is a broad based open source security architecture addressing many of the major pillars of enterprise security including: perimeter security, entitlements and access control and data protection.

“Data security is no longer a checkbox for IT organizations or operations departments, it has become a top business priority,” said Tom Reilly, chief executive officer, Cloudera. “At the same time compliance requirements for protecting data continue to expand in scope where data access comes under scrutiny. We’re entering a whole new era with the rise of the Industrial Internet and the Internet of Things where there is vastly more data being streamed from billions of devices. Centralizing and accessing that net-new data to unlock its value is therefore a challenge when you consider the security requirements. That’s what we’re solving now.”

Simplifying the process of injecting core security features such as encryption and key management into highly scalable environments will enable customers to move beyond test and development workloads to real-world implementations much more quickly and easily. For example, companies that are weighing the value of putting workloads in public cloud environments against security concerns will now be able to move forward by putting in place additional process-based access controls. This limits access to encrypted data only to authorized system functions – rather than specific users or roles – so a cloud administrator, who likely does not need access to the sensitive encrypted data, cannot run commands that grant them access. This is critical for compliance initiatives that require organizations to restrict data access based on “business need to know.”

“Enterprises are adopting big data solutions, despite what some mainstream press has stated, but only when they can address data security and compliance requirements. That Cloudera can now address the enterprise’s most critical security requirement — data encryption — directly into the platform is a big win for security-sensitive customers,” said Adrian Lane of the analyst firm Securosis. “What’s more, Gazzang’s transparent form of encryption scales right along with NoSQL clusters, so Cloudera customers get data security at big data scale. This is an astute acquisition by Cloudera.”

Today a rapidly growing number of large enterprises are building enterprise data hubs built on Hadoop to address a wide variety of data challenges and increasingly to work with data in more ways, not only for processing and archiving, but now for self-service BI and advanced analytics. The success of Hadoop has also drawn the attention of big, established players in the market, including most leading enterprise software companies. Many with decades of experience serving large and demanding customers now are building out software and systems that incorporate Hadoop.

Cloudera has driven enterprise capabilities and more power into the Hadoop platform than any other company as evidenced by the incorporation of real- time query with its open source Cloudera Impala; real-time search support with Lucene and Solr; security with Cloudera’s Apache Sentry project; integrated governance, compliance, reporting and disaster recovery—all on to the Hadoop platform.

Cloudera plans to incorporate Gazzang’s technology into its Cloudera Enterprise offering. Existing customers will benefit immediately as the new products become part of the company’s existing offering. Cloudera will provide support for the Gazzang customer base.

 

About Gazzang

Gazzang provides data security solutions and expertise to help enterprises protect sensitive information and maintain performance in big data and cloud environments. Our technology enables SaaS vendors, health care organizations, financial institutions, public sector agencies and more to meet regulatory compliance initiatives, secure personally identifiable information and prevent unauthorized access to sensitive data and systems. The company is headquartered in Austin, Texas and backed by Austin Ventures and Silver Creek Ventures.

About Cloudera

Cloudera is revolutionizing enterprise data management by offering the first unified Platform for Big Data, an enterprise data hub built on Apache Hadoop™. Cloudera offers enterprises one place to store, process and analyze all their data, empowering them to extend the value of existing investments while enabling fundamental new ways to derive value from their data. Only Cloudera offers everything needed on a journey to an enterprise data hub, including software for business critical data challenges such as storage, access, management, analysis, security and search. As the leading educator of Hadoop professionals, Cloudera has trained over 22,000 individuals worldwide. Over 1,000 partners and a seasoned professional services team help deliver greater time to value. Finally, only Cloudera provides proactive and predictive support to run an enterprise data hub with confidence. Leading organizations in every industry plus top public sector organizations globally run Cloudera in production.www.cloudera.com

Connect with Cloudera

Read our blogs: http://www.cloudera.com/blog/ andhttp://vision.cloudera.com/

Follow us on Twitter:http://twitter.com/cloudera

Visit us on Facebook:http://www.facebook.com/cloudera

Cloudera, Cloudera’s Platform for Big Data, Cloudera Enterprise Data Hub Edition, Cloudera Enterprise Flex Edition, Cloudera Enterprise Basic Edition and CDH are trademarks or registered trademarks of Cloudera Inc. in the United States, and in jurisdictions throughout the world. All other company and product names may be trade names or trademarks of their respective owners.

Read the original blog entry...

More Stories By Bob Gourley

Bob Gourley writes on enterprise IT. He is a founder and partner at Cognitio Corp and publsher of CTOvision.com

@ThingsExpo Stories
Amazon has gradually rolled out parts of its IoT offerings in the last year, but these are just the tip of the iceberg. In addition to optimizing their back-end AWS offerings, Amazon is laying the ground work to be a major force in IoT – especially in the connected home and office. Amazon is extending its reach by building on its dominant Cloud IoT platform, its Dash Button strategy, recently announced Replenishment Services, the Echo/Alexa voice recognition control platform, the 6-7 strategic...
For basic one-to-one voice or video calling solutions, WebRTC has proven to be a very powerful technology. Although WebRTC’s core functionality is to provide secure, real-time p2p media streaming, leveraging native platform features and server-side components brings up new communication capabilities for web and native mobile applications, allowing for advanced multi-user use cases such as video broadcasting, conferencing, and media recording.
IoT generates lots of temporal data. But how do you unlock its value? You need to discover patterns that are repeatable in vast quantities of data, understand their meaning, and implement scalable monitoring across multiple data streams in order to monetize the discoveries and insights. Motif discovery and deep learning platforms are emerging to visualize sensor data, to search for patterns and to build application that can monitor real time streams efficiently. In his session at @ThingsExpo, ...
Verizon Communications Inc. (NYSE, Nasdaq: VZ) and Yahoo! Inc. (Nasdaq: YHOO) have entered into a definitive agreement under which Verizon will acquire Yahoo's operating business for approximately $4.83 billion in cash, subject to customary closing adjustments. Yahoo informs, connects and entertains a global audience of more than 1 billion monthly active users** -- including 600 million monthly active mobile users*** through its search, communications and digital content products. Yahoo also co...
There will be new vendors providing applications, middleware, and connected devices to support the thriving IoT ecosystem. This essentially means that electronic device manufacturers will also be in the software business. Many will be new to building embedded software or robust software. This creates an increased importance on software quality, particularly within the Industrial Internet of Things where business-critical applications are becoming dependent on products controlled by software. Qua...
In addition to all the benefits, IoT is also bringing new kind of customer experience challenges - cars that unlock themselves, thermostats turning houses into saunas and baby video monitors broadcasting over the internet. This list can only increase because while IoT services should be intuitive and simple to use, the delivery ecosystem is a myriad of potential problems as IoT explodes complexity. So finding a performance issue is like finding the proverbial needle in the haystack.
Machine Learning helps make complex systems more efficient. By applying advanced Machine Learning techniques such as Cognitive Fingerprinting, wind project operators can utilize these tools to learn from collected data, detect regular patterns, and optimize their own operations. In his session at 18th Cloud Expo, Stuart Gillen, Director of Business Development at SparkCognition, discussed how research has demonstrated the value of Machine Learning in delivering next generation analytics to imp...
Large scale deployments present unique planning challenges, system commissioning hurdles between IT and OT and demand careful system hand-off orchestration. In his session at @ThingsExpo, Jeff Smith, Senior Director and a founding member of Incenergy, will discuss some of the key tactics to ensure delivery success based on his experience of the last two years deploying Industrial IoT systems across four continents.
The Internet of Things will challenge the status quo of how IT and development organizations operate. Or will it? Certainly the fog layer of IoT requires special insights about data ontology, security and transactional integrity. But the developmental challenges are the same: People, Process and Platform. In his session at @ThingsExpo, Craig Sproule, CEO of Metavine, demonstrated how to move beyond today's coding paradigm and shared the must-have mindsets for removing complexity from the develo...
The 19th International Cloud Expo has announced that its Call for Papers is open. Cloud Expo, to be held November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA, brings together Cloud Computing, Big Data, Internet of Things, DevOps, Digital Transformation, Microservices and WebRTC to one location. With cloud computing driving a higher percentage of enterprise IT budgets every year, it becomes increasingly important to plant your flag in this fast-expanding business opportuni...
Basho Technologies has announced the latest release of Basho Riak TS, version 1.3. Riak TS is an enterprise-grade NoSQL database optimized for Internet of Things (IoT). The open source version enables developers to download the software for free and use it in production as well as make contributions to the code and develop applications around Riak TS. Enhancements to Riak TS make it quick, easy and cost-effective to spin up an instance to test new ideas and build IoT applications. In addition to...
IoT is rapidly changing the way enterprises are using data to improve business decision-making. In order to derive business value, organizations must unlock insights from the data gathered and then act on these. In their session at @ThingsExpo, Eric Hoffman, Vice President at EastBanc Technologies, and Peter Shashkin, Head of Development Department at EastBanc Technologies, discussed how one organization leveraged IoT, cloud technology and data analysis to improve customer experiences and effi...
Internet of @ThingsExpo, taking place November 1-3, 2016, at the Santa Clara Convention Center in Santa Clara, CA, is co-located with the 19th International Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry players in the world and ThingsExpo Silicon Valley Call for Papers is now open.
"We've discovered that after shows 80% if leads that people get, 80% of the conversations end up on the show floor, meaning people forget about it, people forget who they talk to, people forget that there are actual business opportunities to be had here so we try to help out and keep the conversations going," explained Jeff Mesnik, Founder and President of ContentMX, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York City, NY.
With 15% of enterprises adopting a hybrid IT strategy, you need to set a plan to integrate hybrid cloud throughout your infrastructure. In his session at 18th Cloud Expo, Steven Dreher, Director of Solutions Architecture at Green House Data, discussed how to plan for shifting resource requirements, overcome challenges, and implement hybrid IT alongside your existing data center assets. Highlights included anticipating workload, cost and resource calculations, integrating services on both sides...
Manufacturers are embracing the Industrial Internet the same way consumers are leveraging Fitbits – to improve overall health and wellness. Both can provide consistent measurement, visibility, and suggest performance improvements customized to help reach goals. Fitbit users can view real-time data and make adjustments to increase their activity. In his session at @ThingsExpo, Mark Bernardo Professional Services Leader, Americas, at GE Digital, discussed how leveraging the Industrial Internet a...
Big Data engines are powering a lot of service businesses right now. Data is collected from users from wearable technologies, web behaviors, purchase behavior as well as several arbitrary data points we’d never think of. The demand for faster and bigger engines to crunch and serve up the data to services is growing exponentially. You see a LOT of correlation between “Cloud” and “Big Data” but on Big Data and “Hybrid,” where hybrid hosting is the sanest approach to the Big Data Infrastructure pro...
"My role is working with customers, helping them go through this digital transformation. I spend a lot of time talking to banks, big industries, manufacturers working through how they are integrating and transforming their IT platforms and moving them forward," explained William Morrish, General Manager Product Sales at Interoute, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York City, NY.
A critical component of any IoT project is what to do with all the data being generated. This data needs to be captured, processed, structured, and stored in a way to facilitate different kinds of queries. Traditional data warehouse and analytical systems are mature technologies that can be used to handle certain kinds of queries, but they are not always well suited to many problems, particularly when there is a need for real-time insights.
You think you know what’s in your data. But do you? Most organizations are now aware of the business intelligence represented by their data. Data science stands to take this to a level you never thought of – literally. The techniques of data science, when used with the capabilities of Big Data technologies, can make connections you had not yet imagined, helping you discover new insights and ask new questions of your data. In his session at @ThingsExpo, Sarbjit Sarkaria, data science team lead ...