Exponential Growth in Storage Consider data storage, which is critical for the genomics world today. The 3.2 billion base pairs of your genome correspond to about 725 megabytes of data, or 0.75 gigabytes of storage. In 1981, if you were to store your uncompressed genome, a 1-gigabyte hard drive of storage cost half a million dollars. Today, it’s 50 million times cheaper at under 1 cent per gigabyte.
Reference Quote
Similar Quotes
To the extent that genomes can be thought of as compressed encodings of biological structures, they are spectacularly efficient. All the trillions of cells in the human body-not just the tens of billions in the brain-are guided in one way or another by the information contained in 30,000 or so genes. The best high-quality set of pictures of the body- the National Institutes of Health Visible Human Project, a series of high-resolution digital photos of slices taken from volunteer Joseph Paul Jernigan (deceased)-takes up about 60 gigabytes, enough (if left uncompressed) to fill about 100 CD-ROMs-and still not enough detail to capture individual cells. The genome, in contrast, contains only about 3 billion nucleotides, the equivalent (at two bits per nucleotide) of less than two-thirds of a gigabyte, or a single CD-ROM.
The beauty in the genome is of course that it's so small. The human genome is only on the order of a gigabyte of data...which is a tiny little database. If you take the entire living biosphere, that's the assemblage of 20 million species or so that constitute all the living creatures on the planet, and you have a genome for every species the total is still about one petabyte, that's a million gigabytes - that's still very small compared with Google or the Wikipedia and it's a database that you can easily put in a small room, easily transmit from one place to another. And somehow mother nature manages to create this incredible biosphere, to create this incredibly rich environment of animals and plants with this amazingly small amount of data.
Go Premium
Support Quotosaurus while enjoying an ad-free experience and premium features.
View PlansYour synapses store all your knowledge and skills as roughly 100 terabytes’ worth of information, while your DNA stores merely about a gigabyte, barely enough to store a single movie download.
To remind you, the Human Genome Project took about a decade to sequence a single human genome, completed in April of 2003 at an approximate cost of nearly $3 billion. Today Illumina’s latest generation sequencer has the potential to sequence your genome in an hour and for $100 — or 87,600 times faster and 30 million times cheaper.
the very beginning of time until the year 2003,” says Google Executive Chairman Eric Schmidt, “humankind created five exabytes of digital information. An exabyte is one billion gigabytes — or a 1 with eighteen zeroes after it. Right now, in the year 2010, the human race is generating five exabytes of information every two days. By the year 2013, the number will be five exabytes produced every ten minutes … It’s no wonder we’re exhausted.
Imagine that the genome is a book.
There are twenty-three chapters, called CHROMOSOMES.
Each chapter contains several thousand stories, called GENES.
Each story is made up of paragraphs, called EXTONS, which are interrupted by advertisements called INTRONS.
Each paragraph is made up of words, called CODONS.
Each word is written in letters called BASES.
The CD-ROM's worth of information in the genome really wouldn't be enough to paint a bitmapped picture of an embryo, but it is enough to describe a process for building one. An artist who only wants to paint a picture that looks like a kind of tree has much less to remember than an artist who wants to paint a particular Ponderosa Pine from memory; in a similar way, if some alien's genome had to encode every cell in a body, it would need much more information (many more nucleotides) than our genomes do, because ours specify a general way to build a creature rather than an exact picture of every detail of the finished product. Our genomes are lossy because they specify methods rather than pictures, but it is precisely that lossiness that allows them to so efficiently supervise the construction of complex biological structure.
I’m a high-tech low-life. A cutting edge, state-of-the-art bi-coastal multi-tasker and I can give you a gigabyte in a nanosecond!
Biology doesn't know in advance what the end product will be; there's no Stuffit Compressor to convert a human being into a genome. But the genome itself is very much akin to a compression scheme, a terrifically efficient description of how to build something of great complexity-perhaps more efficient than anything yet developed in the labs of computer scientists (never mind the complexities of the brain, there are trillions of cells in the rest of the body, and they are all supervised by the same 30,000-gene genome). And although there is no counterpart in nature to a program that compresses a picture into a compact description, there is a natural counterpart to the program that decompresses the compressed encoding, and that's the cell. Genome in, organism out. Through the logic of gene expression, cells are self-regulating factories that translate genomes into biological structure.
Exponential Growth in Computation How about computation? In 1971, Intel put out its first computer chip, the Intel 4004. It had 2,300 transistors on it, at $1 each. Intel no longer actually tells you how many transistors are on their chips, but the recent Core i9 had 7 billion transistors at less than a millionth of a penny each. This represents a 27-billion-fold increase in price performance in forty-five years.
The era of garage biology is upon us. Want to participate? Take a moment to buy yourself a molecular biology lab on eBay. A mere $1,000 will get you a set of precision pipettors for handling liquids and an electrophoresis rig for analyzing DNA. Side trips to sites like BestUse and LabX (two of my favorites) may be required to round out your purchases with graduated cylinders or a PCR thermocycler for amplifying DNA. If you can’t afford a particular gizmo, just wait six months — the supply of used laboratory gear only gets better with time. Links to sought-after reagents and protocols can be found at DNAHack. And, of course, Google is no end of help.
Enhance Your Quote Experience
Enjoy ad-free browsing, unlimited collections, and advanced search features with Premium.
"How does the body push the comparatively tiny genome so far? Many researchers want to put the weight on learning and experience, apparently believing that the contribution of the genes is relatively unimportant. But though the ability to learn is clearly one of the genome's most important products, such views overemphasize learning and significantly underestimate the extent to which the genome can in fact guide the construction of enormous complexity. If the tools of biological self-assembly are powerful enough to build the intricacies of the circulatory system or the eye without requiring lessons from the outside world, they are also powerful enough to build the initial complexity of the nervous system without relying on external lessons.
The discrepancy melts away as we appreciate the true power of the genome. We could start by considering the fact that the currently accepted figure of 30,000 could well prove to be too low. Thirty thousand (or thereabouts) is, at press time, the best estimate for how many protein-coding genes are in the human genome. But not all genes code for proteins; some, not counted in the 30,000 estimate, code for small pieces of RNA that are not converted into proteins (called microRNA), of "pseudogenes," stretches of DNA, apparently relics of evolution, that do not properly encode proteins. Neither entity is fully understood, but recent reports (from 2002 and 2003) suggest that both may play some role in the all-important process of regulating the IFS that control whether or not genes are expressed. Since the "gene-finding" programs that search the human genome sequence for genes are not attuned to such things-we don't yet know how to identify them reliably-it is quite possible that the genome contains more buried treasure."
The genome is a book that wrote itself, continually adding, deleting and amending over four billion years.
As the old joke goes: “Software, free. User manual, $10,000.” But it’s no joke. A couple of high-profile companies, like Red Hat, Apache, and others make their living selling instruction and paid support for free software. The copy of code, being mere bits, is free. The lines of free code become valuable to you only through support and guidance. A lot of medical and genetic information will go this route in the coming decades. Right now getting a full copy of all your DNA is
Loading...