Web Site: www.ijettcs.org Email: editor@ijettcs.org, editorijettcs@gmail.com Volume 2, Issue 2, March April 2013 ISSN 2278-6856
sequence consisting of four alphabets: A, C, G and T. Each alphabet is related to a nucleotide. It is usually quite long. For instance, the DNA sequence of Litmus, its real length is with 2856 nucleotides long: ATCGAATTCGCGCTGAGTCACAATTCGCGCTGAG TCACAATTCGCGCTGAGTCACAATTGTGACTCAG CCGCGAATTCCTGCAGCCCCGAATTCCGCATTGC AGAGATAATTGTATTTAAGTGCCTAGATACAATA AACGCCATTTGACCATTCACCACATTGGTGTGCA CCTCCAAGCTCGCGCACCGTACCGTCTCGAGGAA TTCCTGCAGGATATCTGGATCCACGAAGCTTCCC ATGGTGACGTCAC. From this sequence a several useful properties could be shown: a) There is almost no difference between a real DNA sequence and a faked one. b) There are a large number of DNA sequences publicly available in various web-sites. A rough estimation would put the number of DNA sequences publicly available to be around 55 million [ 1 ]. By using the above facts, we designed a DNA based encryption method. In general most of the methods that use DNA sequences are in general used for encrypting a binary string. We design a method where any text can be encrypted into a fake DNA sequence S. This transformed sequence S is sent by a sender to the receiver.
1. INTRODUCTION
Deoxyribonucleic Acid (DNA) is the molecule that contains the genetic information and functioning of all living organisms and viruses. Genetic information is encoded as a sequence of nucleotides (guanine, adenine, thymine, and cytosine) recorded using the letters G, A, T, and C. Base pairs guaninecytosine and adenine-thymine which is attached to a sugar and a phosphate molecule allow the DNA helix to maintain a regular helical structure that is independent of its sequence. Bases are sequenced differently for different information that needs to be transmitted. This is similar to that of any different sequences of letters form words and sequences of words form sentences.
Structure of DNA In recent years, much research work has been done on DNA based encryption schemes. A DNA sequence is a Volume 2, Issue 2 March April 2013
Page 231
3. EXAMPLE
Let the message S to be encrypted be HELLO, so that | S | = 5. Replacing each alphabet by the corresponding fifth alphabet we obtain the MJQQT. Table 2 this is converted into M: 01101101 J: 01101010 Q: 01110001 T: 01110100 Using table 1 we obtain the following M: GCTG J: GCCC Q: GTAG T: GTGA From this we generate the 4 x 5 matrix
G G G G G C C T T T A T C A A G G C G G A
Alphabet
Binary Representation 00 10 01 11
A C G T
Table 2 Table 2 represents the ASCII conversion into binary numbers [ 3 ]. 2.1 Encryption Algorithm Let S be the message to be encoded. Let length of S be k. Step 1 Shift each letter in the message to a new letter where the shift value is k to generate S1. Step 2 Convert S1 into a binary string B using ASCII value conversion. Volume 2, Issue 2 March April 2013
So that the message to be send to the receiver is GGGGGCCTTTTCAAGGCGGA. Suppose the received message is GGGGGGGAAGGGGATCCGGGGTAGACGC. The length of the received string is 28. So the size of the matrix will be 4 x 7. The corresponding matrix is
G G G G G G G A A G G G G A A T C C G G G G T A G A C G C
Page 232
Page 233