PSSM amino-acid composition based rules for gene identification
Journal: International Journal of Advanced Technology and Engineering Exploration (IJATEE) (Vol.5, No. 46)Publication Date: 2018-08-26
Authors : Heena Farooq Bhat; M. Arif Wani;
Page : 318-325
Keywords : Gene prediction; Classification; Feature extraction; Binding proteins; Rule induction; PSSM.;
Abstract
One of the major aspects in recognizing the molecular mechanism of the cell is to understand the significance or function of each protein encoded in the genome. For that purpose, genome annotation proves to be very supportive. One of the most obligatory phases of genome annotation is the prediction of the genes. Several methods or techniques have been developed in order to locate or predict the patterns of genes in genome sequence. However, still, the recognition of genes is found to be very complicated problem. Recognizing the corresponding gene of a given protein sequence by means of conventional tools is error prone. Hence, the recognition of genes is a very demanding task. In this paper, we first concentrate on the problem of gene prediction and its challenges. We then present a new method for identifying genes. This new method follows a two-step procedure. First, we present new features extracted from protein sequences and these features are derived from a position specific scoring matrix (PSSM). The PSSM profiles are converted into uniform numeric representation. Then, a new structured approach has been applied on PSSM vector which uses a decision tree based technique for obtaining rules. The rules derived from an algorithm correspond to genes. This new method has been demonstrated on genome DNAset dataset. It is observed that the experimental results of new approach produces better results.
Other Latest Articles
- Modeling simulation and performance evaluation of low voltage power line communication channel
- Comparison of Mechanical Properties of Austempered, Normalized and As-Weld Carbon Steel Weldment
- Analysis of the dyscalculia with the implementation of a design workshop in CATIA in a primary school in Puebla
- Modelling the Increasing Differential Effects of the First Inter-Competition Coefficient on the Biodiversity Value; Competition between two Phytoplankton Species
- Situational analysis of the subjective well-being of university software developers in Puebla
Last modified: 2018-11-04 14:43:28