Sunday, September 29, 2013

Disk-pushers, get reel: Even GOOGLE relies on tape



What was old is new again tape storage revival. Aivars Lode Avantce

Disk-pushers, get reel: Even GOOGLE relies on tape
Prepare to be beaten by your old, cheap rival

By Chris Mellor, 

Tangled magnetic tape
Current state
Oracle T1000 C tape drive

Future state
LTO Roadmap
5 ways to reduce advertising network latency
Comment Tape has spent some time on the ropes, but now it's back in the ring. After suffering five or more years of onslaught from pro-disk fanatics drunk on disk deduplication technologies, reality has struck home. Tape is cheaper than disk*. Tape is more reliable than disk and, the killer, tape's storage capacity can go on increasing for years.
In fact, IBM is currently preparing to demonstrate a near-40X improvement on LTO-6 capacity, while the rate of disk capacity increase is much slower.
Tape is the only storage medium that can confidently say it can keep up with the growing amounts transactions, images, music, plus compliance needs, social media conversations and machine-to-machine messages that are being thrown at our IT systems. Sure, store it on flash and disk when it's hot but bleed it off to tape when it's old and cold and can't be thrown away.
Tape, once the backup medium, is now reverting to an archive role. At the same time, the amount of data being archived is shooting up as data-generation rates increase.
There used to be more than a dozen tape and spool formats, and various methods of writing data to tape, but now there are just three main ones: from IBM, LTO and Oracle, with a fourth, HP's DAT, that is fading.
IBM's proprietary TS1140 format stores 4TB of raw data on a tape reel and Big Blue also makes a range of tape libraries to use the stuff, including its high-end TS3500 with 15,000 slots for cartridges, robots to pick the cartridges and slip them into drives, and a total capacity of compressed data in excess of 2.7 exabytes. With mainframe use of tape there is a near captive market for IBM although Oracle, with its acquired Sun/StorageTek technology plays a strong role here too.
Oracle T10000 drive
Oracle's is the second surviving proprietary tape format. Its T10000d holds 8.5TB of raw data, and, like IBM, Oracle makes a range of libraries with its StreamLine 8500 at the high end. This holds 10,000+ tapes in a single system and more than 100,000 in 10 linked systems. Its most famous customer for this is now Google.
LTO-6 tape
LTO, the Linear Tape Open consortium, was an invention of HP, IBM and Seagate, with the first sale by IBM in August 2000.
The idea was to create an open tape format for Windows and Unix servers with media and drive products from and interchangeable between the three consortium members. The combination of this with good technology decimated the other, proprietary tape formats for those servers, including DLT, Super-AIT, VXA and many others.
One after the other they all folded, leaving LTO supreme. Quantum joined the LTO by buying Certance, Seagate's ex-tape arm, in 2005. Nowadays only IBM and HP make drives, with Quantum selling LTO drives but not making them.
IBM, Oracle, HP and Quantum are three of the main tape library vendors, with SpectraLogic being a fourth. Talking of a tape resurgence, Spectra sold 550PB of tape storage in the second half of 2012.
Tape is a sequential access medium, but a virtual file:folder access system for it – called LTFS (Linear Tape File System) – has been created by IBM. This provides a drag-and-drop Windows file:folder-like access method to reading and writing files on tape.
It means ordinary users can, in theory, write files to tape and read files on tape without going through a backup software package, each of which has its own interface. This promises to revolutionise how tape is used, democratising, so to speak, its access.
That's the current tape technology state of play. But what's coming?
Each of the three main formats above has a roadmap out to one or two future generations which generally increase both capacity and data transfer speed. For example, LTO suppliers are currently shipping LTO-6, the latest generation. Sitting in the wings are LTO-7 and -8.
These increase both capacity and speed. LTO-7 provides 6.4TB of raw capacity (16TB compressed at a 2.5:1 ratio) and a 315MB/sec raw data transfer speed, which compares to LTO-6's 210MB/sec. LTO-8 should provide 12.8TB of capacity and a 427MB/sec transfer speed, both for raw data.
We might expect a new generation to become available 30 months or so after the preceding one. That's generally how it works. Historically each LTO generation drive can read and write the preceding generation's drive and read tapes from the generation before that, thus easing migration to the latest format.
We predict future LTO generations like LTO-9 and LTO-10 which double transfer speed and capacity compared to the preceding generation, not that the LTO consortium is committing to these format futures.
Oracle has a similar roadmap for its T10000 format. A coming T10000e could offer 12-20TB capacity and transfer speeds in the range of 400 - 600MB/sec, though we might think 300 - 350MB/sec might be more realistic. We're sure Oracle is more precise in its numbers when talking to its tape customers.
IBM? The same pattern we're sure, although Big Blue is coy about publicising it. El Reg storage desk also predicts that TS1150 and TS1170 formats could follow on from the existing TS1140 format, again with a general doubling of both capacity and transfer speed.
IBM has demonstrated a 35TB raw capacity tape. It is now preparing a demonstration of a tape holding 125TB using a refinement of today's barium ferrite tape media, not a totally new recording technology. Assume that works, then a TS3500 library using such tapes could hold 84 exabytes of data. It's a fantastically humungous amount of data. Nothing else will come close in terms of the cost/GB of storage, nothing.
Assume LTO keeps on doubling its format every 30 months or so and we could see a 102TB capacity LTO-11 in 12 to 13 years, 2025-2026.
How can we be sure this is reasonable and not a technological pipe dream? It's because the physical size of a bit on tape is, relative to a disk bit, gigantic. There is simply lots of room to shrink the physical size of tape bits without prejudicing bit stability, as is happening with current PMR disk recording technology which is forcing a move to a new recording technology. The 125TB tape project involves shrinking the tape bit size or areal density to 100Gbit/in2 which compares to the advanced disk areal densities in use now in the 620-690Gbit/in2 area.
DAT's future is a fading to insignificance as disk and cloud backup generally replace it.
Tape is the archive medium and it is being used now by some of the biggest archivers of data around such as Google and Amazon
.

No comments:

Post a Comment