Welcome!

Apache Authors: Liz McMillan, Elizabeth White, Pat Romanski, John Mertic, Janakiram MSV

Related Topics: Containers Expo Blog, Java IoT, Microservices Expo, Agile Computing, @CloudExpo, Apache

Containers Expo Blog: Blog Feed Post

Object Storage Not Yet Defined

Agreed that object storage platforms scale better than file systems & NAS

The ExecEvent Object Storage Summit earlier this month continued to generate buzz on the industry, which is very exciting. Amplidata was represented – in spirit – at the Summit by our partners Intel and Quantum; due to an insane travel and show schedule this fall that kept us from attending personally.  We’re grateful for the mention in Storage Switzerland’s sponsor briefing articles. Very cool! With all the great stuff that has been happening for Amplidata lately, including the awesome performance test results by Howard Marks, we felt a bit like we were missing our own birthday party. We’ll be there next time!

The event fostered a few “What is Object Storage?” posts from, amongst others, George Crump. Jim O’Reilly also posted a very interesting article, although I’m not sure if he was at the event. If he wasn’t, he should be next time!

Both articles add to the body of knowledge that is rapidly evolving on what object storage is, and why customers should adopt it – so, every article helps. With a topic as technical as object storage, it’s easy to evangelize with a deep technical dive.  But that misses the “elegant simplicity” point.  Hence we love George’s use of the car park analogy which we ourselves often embrace.  His article was a helpful at-a-glance overview.  On a more technical level, Jim’s explanation of such concepts as immutable blobs, “the original version is the only version”, objects still look like files etc. offer more on how object storage really works. George’s analysis on how “Objects are given unique ID numbers” is what’s missing in Jim’s article. I guess, what we’re saying is “read both articles.”

But read them critically, and you will see that we’re not there yet. As you can read in Jim’s article, the paradigm has been around much longer than many of us know and we’re not complete in defining the best use cases, implementations, architectures, etc. For example, I’m not at all sure about the reduced metadata George writes about. I believe that over time, as we start using richer applications, we will be storing more metadata, not less. To me, Jim’s statement “To be an object, a blob of data needs a much more detailed descriptor record than what file systems use.” is more accurate.

Both articles also cover the “why” of Object Storage. I’m not sure I see the use of Jim’s deduplication paragraph, and I think we are missing erasure coding as an alternative to RAID in his article (replication can be expensive too!). Jim accurately mentions that block storage was I/O focused, but omits the exceptional throughput performance some of the object stores deliver. A good thing is that Jim sees the scalability, flexibility and cost-saving opportunities. Finally, I very much like his use cases: Google Picasa, Amazon S3, Genome etc. and it is very interesting to read that Jim sees potential for object storage in the Big Data analytics space.

So back to George’s take on why we need object storage. Agreed that object storage platforms scale better than file systems & NAS but, again, not so much because of the metadata. File systems have different challenges, such as the granularity of the hardware, limitations on numbers of files or the number of levels in the hierarchy. Distributed file systems tried to solve some of these issues, but object storage is just a much simpler approach. Agreed that adding NAS heads is an expensive and not so great solution!

The second topic I thought was interesting was the issue of “bit rot”. Bit rot is a real problem and will lead to data loss with traditional storage technologies, but not every object store will solve that. How I understood it is that it is the underlying data protection scheme that solves the problem of bit rot, not necessarily Object Storage. Erasure Coding detects bit rot and prevents data loss.  I don’t think you could restore the content of an object using the identifier, but maybe there is some really cool technology out there that I don’t know of. As George wrote “The storage system does not need an elaborate RAID protection algorithm nor do its administrators need to suffer through long RAID rebuild cycles”, I think he actually alludes to Erasure Coding but didn’t want to go that deep in this article.

Another interesting point in George’s article is the issue with backups. Once you go into the petabyte range, it becomes very unwieldy to backup data. He mentions the backup window, but add to that the overhead cost. George promotes using the unique IDs to make sure “that there are always copies of each object available on-site and off-site.” Again with the proper underlying protection schemes (erasure coding) you can rule out backups altogether!

I’m sure both George and Jim will appreciate the feedback – I fully agree with the benefits object storage brings to track iterations of files and the paragraph on geo dispersion, which we have termed geo-spreading. Finally, I hope to read some more of George’s thoughts about how object storage can help to monetize archived data as that, to me, is a key argument for this new but then again not so new storage paradigm. This is obviously not the end of the discussion; a lot will and needs to be said about this new paradigm. I’m looking forward to attending the next Object Storage events…

Read the original blog entry...

More Stories By Tom Leyden

Tom Leyden is VP Product Marketing at Scality. Scality was founded in 2009 by a team of entrepreneurs and technologists. The idea wasn’t storage, per se. When the Scality team talked to the initial base of potential customers, the customers wanted a system that could “route” data to and from individual users in the most scalable, efficient way possible. And so began a non-traditional approach to building a storage system that no one had imagined before. No one thought an object store could have enough performance for all the files and attachments of millions of users. No one thought a system could remain up and running through software upgrades, hardware failures, capacity expansions, and even multiple hardware generations coexisting. And no one believed you could do all this and scale to petabytes of content and billions of objects in pure software.

@ThingsExpo Stories
Providing secure, mobile access to sensitive data sets is a critical element in realizing the full potential of cloud computing. However, large data caches remain inaccessible to edge devices for reasons of security, size, format or limited viewing capabilities. Medical imaging, computer aided design and seismic interpretation are just a few examples of industries facing this challenge. Rather than fighting for incremental gains by pulling these datasets to edge devices, we need to embrace the i...
In 2014, Amazon announced a new form of compute called Lambda. We didn't know it at the time, but this represented a fundamental shift in what we expect from cloud computing. Now, all of the major cloud computing vendors want to take part in this disruptive technology. In his session at 20th Cloud Expo, John Jelinek IV, a web developer at Linux Academy, will discuss why major players like AWS, Microsoft Azure, IBM Bluemix, and Google Cloud Platform are all trying to sidestep VMs and containers...
Web Real-Time Communication APIs have quickly revolutionized what browsers are capable of. In addition to video and audio streams, we can now bi-directionally send arbitrary data over WebRTC's PeerConnection Data Channels. With the advent of Progressive Web Apps and new hardware APIs such as WebBluetooh and WebUSB, we can finally enable users to stitch together the Internet of Things directly from their browsers while communicating privately and securely in a decentralized way.
IoT is at the core or many Digital Transformation initiatives with the goal of re-inventing a company's business model. We all agree that collecting relevant IoT data will result in massive amounts of data needing to be stored. However, with the rapid development of IoT devices and ongoing business model transformation, we are not able to predict the volume and growth of IoT data. And with the lack of IoT history, traditional methods of IT and infrastructure planning based on the past do not app...
Fifty billion connected devices and still no winning protocols standards. HTTP, WebSockets, MQTT, and CoAP seem to be leading in the IoT protocol race at the moment but many more protocols are getting introduced on a regular basis. Each protocol has its pros and cons depending on the nature of the communications. Does there really need to be only one protocol to rule them all? Of course not. In his session at @ThingsExpo, Chris Matthieu, co-founder and CTO of Octoblu, walked through how Octob...
The Internet of Things can drive efficiency for airlines and airports. In their session at @ThingsExpo, Shyam Varan Nath, Principal Architect with GE, and Sudip Majumder, senior director of development at Oracle, discussed the technical details of the connected airline baggage and related social media solutions. These IoT applications will enhance travelers' journey experience and drive efficiency for the airlines and the airports.
SYS-CON Events announced today that Catchpoint, a leading digital experience intelligence company, has been named “Silver Sponsor” of SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. Catchpoint Systems is a leading Digital Performance Analytics company that provides unparalleled insight into your customer-critical services to help you consistently deliver an amazing customer experience. Designed for digital business, C...
With major technology companies and startups seriously embracing IoT strategies, now is the perfect time to attend @ThingsExpo 2016 in New York. Learn what is going on, contribute to the discussions, and ensure that your enterprise is as "IoT-Ready" as it can be! Internet of @ThingsExpo, taking place June 6-8, 2017, at the Javits Center in New York City, New York, is co-located with 20th Cloud Expo and will feature technical sessions from a rock star conference faculty and the leading industry p...
In his General Session at 17th Cloud Expo, Bruce Swann, Senior Product Marketing Manager for Adobe Campaign, explored the key ingredients of cross-channel marketing in a digital world. Learn how the Adobe Marketing Cloud can help marketers embrace opportunities for personalized, relevant and real-time customer engagement across offline (direct mail, point of sale, call center) and digital (email, website, SMS, mobile apps, social networks, connected objects).
Things are changing so quickly in IoT that it would take a wizard to predict which ecosystem will gain the most traction. In order for IoT to reach its potential, smart devices must be able to work together. Today, there are a slew of interoperability standards being promoted by big names to make this happen: HomeKit, Brillo and Alljoyn. In his session at @ThingsExpo, Adam Justice, vice president and general manager of Grid Connect, will review what happens when smart devices don’t work togethe...
"Tintri was started in 2008 with the express purpose of building a storage appliance that is ideal for virtualized environments. We support a lot of different hypervisor platforms from VMware to OpenStack to Hyper-V," explained Dan Florea, Director of Product Management at Tintri, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York City, NY.
"There's a growing demand from users for things to be faster. When you think about all the transactions or interactions users will have with your product and everything that is between those transactions and interactions - what drives us at Catchpoint Systems is the idea to measure that and to analyze it," explained Leo Vasiliou, Director of Web Performance Engineering at Catchpoint Systems, in this SYS-CON.tv interview at 18th Cloud Expo, held June 7-9, 2016, at the Javits Center in New York Ci...
The 20th International Cloud Expo has announced that its Call for Papers is open. Cloud Expo, to be held June 6-8, 2017, at the Javits Center in New York City, brings together Cloud Computing, Big Data, Internet of Things, DevOps, Containers, Microservices and WebRTC to one location. With cloud computing driving a higher percentage of enterprise IT budgets every year, it becomes increasingly important to plant your flag in this fast-expanding business opportunity. Submit your speaking proposal ...
20th Cloud Expo, taking place June 6-8, 2017, at the Javits Center in New York City, NY, will feature technical sessions from a rock star conference faculty and the leading industry players in the world. Cloud computing is now being embraced by a majority of enterprises of all sizes. Yesterday's debate about public vs. private has transformed into the reality of hybrid cloud: a recent survey shows that 74% of enterprises have a hybrid cloud strategy.
SYS-CON Events announced today that Super Micro Computer, Inc., a global leader in Embedded and IoT solutions, will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 7-9, 2017, at the Javits Center in New York City, NY. Supermicro (NASDAQ: SMCI), the leading innovator in high-performance, high-efficiency server technology, is a premier provider of advanced server Building Block Solutions® for Data Center, Cloud Computing, Enterprise IT, Hadoop/Big Data, HPC and E...
SYS-CON Events announced today that Linux Academy, the foremost online Linux and cloud training platform and community, will exhibit at SYS-CON's 20th International Cloud Expo®, which will take place on June 6-8, 2017, at the Javits Center in New York City, NY. Linux Academy was founded on the belief that providing high-quality, in-depth training should be available at an affordable price. Industry leaders in quality training, provided services, and student certification passes, its goal is to c...
In the next five to ten years, millions, if not billions of things will become smarter. This smartness goes beyond connected things in our homes like the fridge, thermostat and fancy lighting, and into heavily regulated industries including aerospace, pharmaceutical/medical devices and energy. “Smartness” will embed itself within individual products that are part of our daily lives. We will engage with smart products - learning from them, informing them, and communicating with them. Smart produc...
Fact is, enterprises have significant legacy voice infrastructure that’s costly to replace with pure IP solutions. How can we bring this analog infrastructure into our shiny new cloud applications? There are proven methods to bind both legacy voice applications and traditional PSTN audio into cloud-based applications and services at a carrier scale. Some of the most successful implementations leverage WebRTC, WebSockets, SIP and other open source technologies. In his session at @ThingsExpo, Da...
Why do your mobile transformations need to happen today? Mobile is the strategy that enterprise transformation centers on to drive customer engagement. In his general session at @ThingsExpo, Roger Woods, Director, Mobile Product & Strategy – Adobe Marketing Cloud, covered key IoT and mobile trends that are forcing mobile transformation, key components of a solid mobile strategy and explored how brands are effectively driving mobile change throughout the enterprise.
Smart Cities are here to stay, but for their promise to be delivered, the data they produce must not be put in new siloes. In his session at @ThingsExpo, Mathias Herberts, Co-founder and CTO of Cityzen Data, discussed the best practices that will ensure a successful smart city journey.