With the entry of the 21st century, Genomic sequencing has become one of the hottest research fields in biological world. The declining price of genomic sequencing has expanded the range of its application in the study of disease diagnosis and prediction. Despite of its importance, how much do you know about it?
What is genomic sequencing?
Genomic sequencing is the process of determining the precise order of nucleotides within a DNA molecule. It includes any method or technology that is used to determine the order of the four bases—adenine, guanine, cytosine, and thymine—in a strand of DNA. The advent of rapid genomic sequencing methods has greatly accelerated biological and medical research and discovery.
It is often compared to “decoding”, but a sequence is still very much in code. In a sense, a genome sequence is simply a very long string of letters in a mysterious language.
When you read a sentence, the meaning is not just in the sequence of the letters. It is also in the words those letters make and in the grammar of the language. Similarly, the human genome is more than just its sequence.
Imagine the genome as a book written without capitalization or punctuation, without breaks between words, sentences, or paragraphs, and with strings of nonsense letters scattered between and even within sentences.
Sequencing the genome doesn’t immediately lay open the genetic secrets of an entire species. Even with a rough draft of the human genome sequence in hand, much work remains to be done. Scientists still have to translate those strings of letters into an understanding of how the genome works: what the various genes that make up the genome do, how different genes are related, and how the various parts of the genome are coordinated. That is, they have to figure out what those letters of the genome sequence mean.
Why is genomic sequencing so important?
Sequencing the genome is an important step towards understanding it.
At the very least, the genome sequence will represent a valuable shortcut, helping scientists find genes much more easily and quickly. A genome sequence does contain some clues about where genes are, even though scientists are just learning to interpret these clues.
Scientists also hope that being able to study the entire genome sequence will help them understand how the genome as a whole works—how genes work together to direct the growth, development and maintenance of an entire organism.
Finally, genes account for less than 25 percent of the DNA in the genome, and so knowing the entire genome sequence will help scientists study the parts of the genome outside the genes. This includes the regulatory regions that control how genes are turned on an off, as well as long stretches of “nonsense” or “junk” DNA—so called because we don’t yet know what, if anything, it does.
How do you sequence a genome?
The quick answer to this question is: in pieces. The whole genome can’t be sequenced all at once because available methods of DNA sequencing can only handle short stretches of DNA at a time.
So instead, scientists must break the genome into small pieces, sequence the pieces, and then reassemble them in the proper order to arrive at the sequence of the whole genome. Much of the work involved in sequencing lies in putting together this giant biological jigsaw puzzle.
There are two approaches to the task of cutting up the genome and putting it back together again. One strategy, known as the “clone-by-clone” approach, involves first breaking the genome up into relatively large chunks, called clones, about 150,000 base pairs (bp) long. Scientists use genome mapping techniques (discussed in further detail later) to figure out where in the genome each clone belongs. Next they cut each clone into smaller, overlapping pieces the right size for sequencing—about 500 BP each. Finally, they sequence the pieces and use the overlaps to reconstruct the sequence of the whole clone.
The other strategy, called “whole-genome shotgun” method, involves breaking the genome up into small pieces, sequencing the pieces, and reassembling the pieces into the full genome sequence.
More detailed genomic sequencing approaches will be introduced in the following posts.