• No results found

“A Cryptographic Model for HINDI Text Using New Substitution Method”

N/A
N/A
Protected

Academic year: 2022

Share "“A Cryptographic Model for HINDI Text Using New Substitution Method”"

Copied!
10
0
0

Loading.... (view fulltext now)

Full text

(1)

IJRIT International Journal Of Research In Information Technology, Volume 3, Issue 8, August 2015, Pg. 1-10

International Journal of Research in Information Technology (IJRIT)

www.ijrit.com ISSN 2001-5569

Research Paper on

“A Cryptographic Model for HINDI Text Using New Substitution Method”

Vaibhav Kumar

Master's of Technology, Computer Science Engineering, Roorkee College Of Engineering, Rookee., Affilated to Uttrakhand Technical University, Dehradun.

Abstract

Earlier computers were used by big organizations, universities etc for sending text to each other with the help of computer networks. That time security was not having that much of concern. But now the whole scenario has changed, as now-a-days every second person is using computer networks for sending e-mails, online shopping, banking etc. So data security over computer network has paramount concern.

As the information security has major changes for last 10 years. Earlier the security to the information was provided physically but as the introduction of computers changed the whole scenario now the information became electronic so we need some tools to protect it from the illicit user as well as from the hackers while transmitting over a network and also on a computer. Here we can use cryptographic techniques.

Cryptography is a Greek word which means “secret writing”. In this the information (text) to be protected is called as plaintext and then a function is applied to plaintext for encryption and the resulting data is called as cipher text. This cipher text is different from plaintext as it is the encrypted text which can not be easy to break and get the original information. The reverse process is applied on the cipher text to convert it into the plaintext or the original information and the process is called as decryption. This is the technique which is used in this thesis to protect HINDI text.

Combination of encryption and decryption for secure communication is an application, which combines both (i.e., Encryption, decryption) techniques to make the communication more secure. It is concerned with embedding information in a secure and robust manner. This thesis also attempts to provide an alternate technique for encrypting HINDI text and also decryption technique.

Keywords: Encryption, Decryption, HINDI Encryption Algorithm, Network security, cryptography.

Introduction

In today’s world there are several factors which affect the security of important data. So, we must have a technique to protect our data and restrict it from the unauthorized access and secure it from being misused on a computer networks. We have so many techniques those are used in securing data from illicit use over the

(2)

IJRIT International Journal Of Research In Information Technology, Volume 3, Issue 8, August 2015, Pg. 1-10

network or on a machine. Techniques like password, biometrics and cryptography are being used now-a-days.

Passwords are not considered that much safe as they have low entropy. Biometrics are very costly and have some limitations with them. So, finally we can say the cryptography is the optimal solution to our problem.

Cryptography is the best solution for security [Stinson, 2006]. A message is to be send secretly is a plaintext.

But by encryption it is converted into cipher text (hidden data) now it is safe enough to be send. In similar way the reverse process which is decryption is used to get back the message i.e. plaintext it means converting back a plaintext from a cipher text by using cryptographic techniques.

Cryptology is the study of decryption and encryption and cryptography is the application. For encoding a plaintext, the plaintext is converted into a series of bits or numbers alpha-numeric may be used which include A to Z and 0 to 9 values [Alqdah, 2008]. One of most commonly used method is to encode a plaintext with its ASCII value, where each symbol is represented in 8 bit. While in decoding every bit or number is converted back to the plaintext. The stream cipher is operated on the message or plaintext symbol by symbol or bit by bit.

Whereas a block cipher is operated on blocks of symbols and a substitution cipher is used for replacing the symbols/letters or bits in a plaintext by others or without changing the order. A product cipher alternates the substitution and transposition. This concept of block cipher versus stream cipher applies only to product and substitution ciphers not to transposition ciphers.

2. Cryptography

Cryptography is the best way to secure important data from unauthorized access. It is the art of writing the data in the codes and the analysis and design of different mathematical functions that ensures the secure communication among millions of adversaries. Cryptography is the science of secret writing. The basic functionality of cryptography is the ability to send the information between participants in a way that prevents others from reading it [Kakkar, 2004]. Cryptographic models contain both a secret value and an algorithm. A key is called as secret value. The reason behind having a key is that, it is very difficult to generate a new algorithm every time so we can change the key as many times as we want.

3. Encryption

For ensuring security of data, encryption is the best technique which can be used anywhere and everywhere, where there is the need of security of data. The method of hiding the information from a plaintext or message in a way that it can be recovered only by applying the reverse process i.e. decryption is call encryption [Alqdah, 2008]. The motive of encryption is to avoid unauthorized access to the information. Encryption is done when the plaintext is processed through some techniques like mathematical operations or table reference, shifting technique. A different type of data is generated from these processes. Plaintext is said to be unencrypted data while cipher text is said to be encrypted data. This shows the real data in a different form. Encryption key is used for encrypting the plaintext that depends on key based algorithms.

Key based algorithms can be classified into two categories, first one is Symmetric Encryption (SE) encryption and decryption is done using a single key. Second is Asymmetric Encryption (AE) here two keys are used public

(3)

IJRIT International Journal Of Research In Information Technology, Volume 3, Issue 8, August 2015, Pg. 1-10

and private. Public key used for the encryption while the private key is used for decryption [Kakkar, 2004].

There are several types of key based algorithms like PGP, DES, RSA, elliptic curve but all are based on high mathematical manipulations.

4. Methodology of work

For achieving the objectives following methodology is used.

• Algorithms for sub keys generation.

• Key Generation Algorithm is used to generate a sequence by multiplying a quaternary vector and a random matrix key. The generated sequence is divided among equal value basins. Then these basins are mapped with the plaintext to generate cipher text. This will give us three models of cryptography.

• Encryption Algorithm is used to generate sub keys and mapping is done to generate cipher text from the plain text.

• Algorithms are checked for different inputs.

• Summary and conclusion of work.

Fig 1: Flow Chart of Encryption Algorithm

Input text

Convert it into phonetic transcription

Write the Alphanumerical equivalent

Apply the Key with the Encryption Algorithm

Cipher text is ready

(4)

IJRIT International Journal Of Research In Information Technology, Volume 3, Issue 8, August 2015, Pg. 1-10

Fig 2: Flow Chart of Decryption Algorithm

5. Working of Cryptographic Model

In this work two algorithms will be used. Algorithm 1 will generate a sequence by using a matrix key which will be multiplied with a quaternary vector and a sign function will be applied on the product. A model will be generated with help of this sequence. Hence this is a substitution algorithm where a single key is used. This new algorithm makes use of poly alphabet cipher instead of mono alphabet cipher. A cryptographic model will be developed and each having its own limitations and advantages.

5.1 Conversion to phonetics

As Hindi language scripts is syllabic in nature and have a one to one correspondence between the letters and the sounds, this is a simple process of mapping the letters to a phonetic alphabet.

5.1.1 Vowels

A vowel is a sound in spoken language, pronounced with an open vocal tract so that there is no build-up of air

Take Cipher Text

Apply Decryption Algorithm

Write Equivalent Alphabetical value of Alphanumerics

Convert the Alphabets into Phonetics Transcription

Convert Phonetic into

Plaintext

(5)

IJRIT International Journal Of Research In Information Technology, Volume 3, Issue 8, August 2015, Pg. 1-10 Table 1: Vowels of Hindi Language

5.1.2 Consonants

Consonant is a sound in spoken language, which is pronounced with a constriction or closure at some point along the vocal tract.

Table 2: Consonants

(6)

IJRIT International Journal Of Research In Information Technology, Volume 3, Issue 8, August 2015, Pg. 1-10

5.1.3 Half Consonants

H is a Halant used for denoting consonants without the inherent vowel or half consonants. The concept of half consonant is based on the orthographical representations of Indian languages which are essentially syllabic in nature. Each letter representing a consonant has an inherent vowel associated with it, usually, schwa or ‘a’. A true consonant i.e., without any vowel associated with it is marked by a special ligature known as Halant. Thus, a structure of the form HCV is essentially a consonant followed by a CV, or in other words, a consonant cluster followed by a vowel.

Table 3: Half Consonants

The Half sounds which are not present in the above mentioned list are denoted by 0C. 0C is a sound of consonant without the inherent vowel schwa or ‘a’. the half sound used in Hindi module are shown in Table 4.

Table 4: Half sounds for Hindi Module

(7)

IJRIT International Journal Of Research In Information Technology, Volume 3, Issue 8, August 2015, Pg. 1-10

By above standards, the phonetic transcription of the input text can be easily done. For example let the word is

‘eqckjd’, its phonetic transcription is shown in Table 5.

Table 5: Phonetic Transcription of word ‘eqckjd’

(8)

IJRIT International Journal Of Research In Information Technology, Volume 3, Issue 8, August 2015, Pg. 1-10

Thus, phonetic transcription of the word ‘eqckjd’ is “m5b2rk”. Similarly, phonetic transcription of the some sample Hindi words are shown in Table 6.

Table 6: Phonetic Transcription of Hindi Words

5.2 Algorithm for generating the sequence

1. Consider 0 to n values where n is a positive integer.

2. Convert each value into quaternary form.

3. Represent the values into a matrix form (n+1)*(digit number).

4. Subtract 2 from each element of matrix.

5. Consider a random matrix key.

6. Multiply output of step 4 with random matrix key.

7. Convert all negative values to -2, positive to 2 and zero by 0.

8. Add 2 to each element of output of step 7.

9. Convert quaternary values from step 8 into decimal to generate the sequence.

5.3 Encryption and Decryption Process

Table 7: Encryption of word ‘£jx®“k’

Plaintext £ j x® “k

Phonetic Transcription

Kh r g,11 sh

Alpha Numeric Equivalent

21,18 28 17, - 18,29

(9)

IJRIT International Journal Of Research In Information Technology, Volume 3, Issue 8, August 2015, Pg. 1-10

Key 10 2 2 2

add 31,28 30 19,- 20,31

Mod 36 31,28 30 19,- 20,31

Cipher Text u,r t i,11 j,u

Table 8: Decryption of word ‘£jx®“k’

Cipher Text

u.,r t i,11 j,u

Alpha Numeric Equivalent

31,28 30 19,- 20,31

Mod 36 31,28 30 19,- 20,31

Key 10 2 2 2

Subtract 21,28 28 17,- 18,29

Phonetic Transcription

Kh r g,11 sh

Plain Text £ j x® “k

5.4 Conclusion:

Encryption model uses an algorithm which considers a matrix key and to generate a sequence some steps are performed. This model is a new symmetric encryption technique. In this model it is observed that, for a given key, the total computation overhead is high. The complexity of the model by its strength it is exponential in nature and by its construction is O(n). It is also observed that by slight variations in key a lot of variations in cipher text is identified which provides more strength to the proposed model. This algorithm is secure against cipher text only attacks. And key may not be retrieved by any other attack but plain text may be retrieved.

A model to identify the garbled keys in transmission is discussed in chapter 5. This mechanism uses the same algorithm for the generation of sub keys and also for the generation of sequence which is used to generate hash value. And hence hash value will be appended to key while transmitted to the Key Distribution Centre. If any errors takes place in the key while transmission, there will be a mismatch between received hash value and generated hash value and accordingly the participating parties ask for retransmission of key. This facilitates for identifying and rectifying garbled keys while transmission through unreliable network.

In this work in introduction it is conveyed that cryptography ensures security, confidentiality and privacy to the data while transmitting over a network which is not secure. Objectives are made from the literature survey. The only solution to the data integrity, security and authentication is cryptography. There are a lot of cryptographic techniques for English language but this model will provide a cryptographic model that will be used for Hindi

(10)

IJRIT International Journal Of Research In Information Technology, Volume 3, Issue 8, August 2015, Pg. 1-10

language means for securing information containing Hindi text can be encrypted and decrypted by using this work.

The main contribution of my work is the design, analysis and efficiency of a cryptographic model. The proposed algorithm “Key Generation and Key Substitution Algorithm” is designed to ensure security of Hindi Language data either data is on a machine or in a network. Proposed algorithm is based on the important issue i.e. Data Security. Analysis shows the complexity of algorithm and efficiency also. Analysis is done theoretically.

References

Acharya B, Panigrahy S. K, Patra S. K and Panda G, “Image Encryption Using Advanced Hill Cipher Algorithm”, International Journal of Recent Trends in Engineering, Vol. 1, No. 1, 2009, pp. 663-667.

Acharya B, Patra S. K and Panda G, “Involuntary, Permuted and Reiterative Key Matrix Generation Methods for Hill Cipher System”, International Journal of Recent Trends in Engineering, Vol. 1, No. 1, 2009, pp. 106- 108.

Alqdah M and Hui L. Y, “Simple Encryption and Decryption Algorithm”, International Jouranl of Computer Science and Security, Vol. 1, 2008, pp. 14-17.

Kakkar A and Bansal P. K, ‘Reliable Encryption Algorithm used for Communication”, M. E. Thesis, Thapar University, 2004.

Stinson D. R, “Cryptography Theory And Practice”, 3rd Edition Chapman Hall, vol. 1, 2006, pp. 13-37.

Sun H. M, Wu M. E, Ting W. C and Hinek M. J, “Dual RSA and Its Security Analysis”, IEEE Transactions on Information Theory, Vol. 53, no. 8, 2007, pp. 2922-2933.

Torres R. H, Oliveira A, Xexeo J. A. M, Souza W. A. R and Linden R, “Identification of Keys and Cryptographic Algorithm using Genetic Algorithm and Graph Theory”, IEEE Latin America Transactions, Vol.

3, 2011, pp. 178-183.

Verma S, Choubey R and Soni R, “AN Efficient Developed New Symmetric Key Cryptography Algorithm for Information Security” , International Journal of Emerging Technology and Advanced Engineering, Vol. 1, 2012, pp. 18-21.

Wan Z, Liu Jand Deng R. H, “A Hierarchical Attribute-Based Solution for Flexible and Scalable Access Control in Cloud Computing”, IEEE Transaction on Information Forensics and Security, Vol. 7, No. 2, 2012, pp. 743- 754.

Wang K, “An Encrypt and Decrypt Algorithm Implementation on FPGA’s”, International Conference on Semantics, Knowledge and Grid, Vol. 1, 2009, pp. 298-301.

Wang M. Y, Su C. P, Horng C.L and Huang C.T, : Single and Multicore Configurable AES Architectures for

References

Related documents

IJRIT International Journal of Research in Information Technology, Volume 2, Issue 11, November 2014,

Murtadha Ali Nsaif Shukur , IJRIT 139 determines the base station (or base stations) that provides the service to a mobile station—a process that is performed when a mobile

KAMALJIT SINGH BHATIA , IJRIT 213 International Journal of Research in Information Technology (IJRIT) International Journal of Research in Information Technology (IJRIT)

Computer vision, on the other hand, is often considered high-level image processing out of which a machine/computer/software intends to decipher the physical contents of

Load balancing is one of the main challenges in cloud computing which is required to distribute the dynamic workload across multiple nodes to ensure that no single node

IJREAT International Journal of Research in Engineering & Advanced Technology, Volume 2, Issue 6, Dec-Jan, 2015.. Enhancing The Security Of Koblitz’s Method Using

Although a major contribution of C4-derived carbon was not unexpected for the Tana River (given that large parts of the drainage basin consist of savanna grasslands), the

They should be helped to appreciate the achievements of science in showing how the complex and diverse phenomena of the natural world can be described in terms of a number of