| By Maureen O'Gara | Article Rating: |
|
| December 3, 2012 07:00 AM EST | Reads: |
2,996 |
Amazon Web Services used re:Invent, its very first customer and partner conference this week in Vegas, to announce the coming of a cloud data warehouse service called Redshift meant to undercut and disrupt the pricey "old guard" brands of Oracle, IBM, Teradata, EMC Greenplum and HP.
Redshift literally represents a shift in Amazon's targeting.
It's going up-market looking for customers among the big corporates that have supposedly overcome their doubts about running mission-critical apps in the public cloud and are now down to figuring out which ones to move first and how quickly.

Amazon evidently thinks it's on the cusp of this great inflection point and to bait the big fellas onto its cloud - as well as defend its 60% IaaS market share against the likes of Google, Rackspace and Microsoft - it's giving them an alternative to the on-premise data warehouses that large companies say are "too expensive and a pain in the butt to manage" and that smaller companies feel are simply beyond their reach according AWS boss Andy Jassy.
Redshift, which is probably the closest AWS has come to peddling an actual application, is a typical pay-as-you-go service with no payment upfront.
It's supposed to cost about one-tenth a traditional data warehouse.
That's why Amazon's figures its high-volume low-margin model is going to take business way from the 60% to 80% gross margin crowd and their scale-lacking private "cloudwash" clouds.
Redshift is currently in limited preview and technical details are kind of thin, but it's reportedly designed to be easy to provision and automates setup, operation and cluster regulation.
It promises to work with all the popular business analytics tools and give great performance. It's a columnar data store used to make certain kinds of ad hoc queries against and its query return on almost any size dataset is supposed to be really fast because of its basic design and because of compression on the server nodes.
The Register thinks it may be based on a parallelized version of the PostgreSQL open source database à la Netezza and Greenplum since it uses PostgreSQL drives to link to third-party BI tools. Anyway, it speaks standard SQL and has JDBC and ODBC hooks for the BI programs.
ITPorPortal says it consists of ParAccel-licensed components, available in two underlying node variants that can contain either 2TB or 16TB of compressed customer data per node. A user can start with a single small node and scale up to a 32 nodes with 64TB of capacity or use fatter nodes and scale up to 1.6PB of capacity.
There was no mention of flash storage to boost I/O as likely as it may be.
Amazon, the Internet retailer, AWS' parent company, which spends a few million a year on a conventional data warehouse, tested Redshift on a two-node cluster and reportedly ran six of its toughest queries on a dataset with two billion rows of data. It found that it ran 10 times faster than its on-premise warehouse and cost $3.65 an hour, or about $32,000 a year, peanuts compared to what it's now spending.
AWS says classic on-premise data warehouses run $19,000-$25,000 a terabyte of storage a year including a few administrators, hardware, software and maintenance.
Redshift is supposed to launch early next year priced at under $1,000 a terabyte a year with the ability to scale to a petabyte or more of storage for those who promise to stick around for a while. It's got one-year and three-year reservations. The price quoted is for a three-year deal on a heavily used 13-node Redshift cluster.
Using it on-demand will cost more. Figure 85 cents an hour for 2TB nodes and $6.80 an hour for 16TB nodes.
NASA's Jet Propulsion Lab and Netflix are already users.
Published December 3, 2012 Reads 2,996
Copyright © 2012 SYS-CON Media, Inc. — All Rights Reserved.
Syndicated stories and blog feeds, all rights reserved by the author.
More Stories By Maureen O'Gara
Maureen O'Gara the most read technology reporter for the past 20 years, is the Cloud Computing and Virtualization News Desk editor of SYS-CON Media. She is the publisher of famous "Billygrams" and the editor-in-chief of "Client/Server News" for more than a decade. One of the most respected technology reporters in the business, Maureen can be reached by email at maureen(at)sys-con.com or paperboy(at)g2news.com, and by phone at 516 759-7025. Twitter: @MaureenOGara
- Cloud People: A Who's Who of Cloud Computing
- Windows Azure IaaS Reaches General Availability
- Predixion Software Announces General Availability of the Latest Version of its Predictive Analytics Platform
- Cloud Expo New York: The Big Challenge of Big Data & Hadoop Integration
- Agile Solutions for Cloud, Big Data, Mobility Services
- MicroStrategy Announces General Availability of MicroStrategy 9.3.1
- Cloud Computing: Cutting Costs, Boosting Profits
- AMAX Launches StorMax(TM) CFS, powered by IBM(R) General Parallel File System(TM) (GPFS(TM))
- Big Data: Visualizing the Strategic Business Imperative
- Benefits of Cloud Computing
- NIST to Sponsor FFRDC Widespread Adoption of Integrated CyberSecurity
- MicroStrategy Announces General Availability of MicroStrategy 9.3.1
- Cloud People: A Who's Who of Cloud Computing
- Windows Azure IaaS Reaches General Availability
- Portable Experimenter’s Platform, Powered by Raspberry Pi
- Predixion Software Announces General Availability of the Latest Version of its Predictive Analytics Platform
- SUSE Receives Common Criteria Security Certifications
- Basho Announces Open Source Riak CS and General Availability of Riak CS Enterprise v1.3
- Cloud Expo New York: Big Time - Introducing Hadoop on Azure
- Cloud Expo New York: Real-Time Analytics Using an In-Memory Data Grid
- Cloud Expo New York: The Big Challenge of Big Data & Hadoop Integration
- Help Desk Solution Empowers Employees
- Public Cloud’s Got a Silver Lining: Gartner
- Agile Solutions for Cloud, Big Data, Mobility Services
- The Top 250 Players in the Cloud Computing Ecosystem
- Web Services Using ColdFusion and Apache CXF
- Cloud People: A Who's Who of Cloud Computing
- Red Hat Named "Platinum Sponsor" of Virtualization Conference & Expo
- Cloud Expo New York Call for Papers Now Open
- Eclipse "Pollinate" Project to Integrate with Apache Beehive
- An Introduction to Ant
- Cloud Expo 2011 East To Attract 10,000 Delegates and 200 Exhibitors
- Beehive Code Now Available in Apache
- Apache's Tomcat 5.5 is First Release Ever to Use Eclipse JDT Java Compiler
- 4th International Cloud Computing Conference & Expo Starts Today
- "Beehive" Now Officially an Open Source Project: Apache Beehive



















