ON THE HIGH PERFORMANCE COMPUTING FOR MOTIF DISCOVERY IN DNA SEQUENCES
Journal: International Journal of Advanced Research (Vol.6, No. 7)Publication Date: 2018-07-10
Authors : Farrukh Arslan.;
Page : 880-887
Keywords : Motif discovery R programming language high performance computing random projection.;
Abstract
In bioinformatics, one of the most important research problems is the Motif discovery in DNA sequences. The algorithm having accuracy and speed has always been the goal of research in bioinformatics, for solving this problem. Therefore, the idea of this research study is to modify the random projection algorithm to be implemented using high performance computing technique (i.e., the R package pbdMPI). The steps that are needed to achieve this objective are the main focus of this study, i.e. preprocessing data, splitting data according to number of batches, modifying and implementing random projection in the pbdMPI package, and then aggregating the results. To validate this approach, some experiments have been conducted. Several benchmarking data were used in this study by sensitivity analysis on number of cores and batches. Experimental results show that computational cost can be reduced. Thus, the proposed approach can be used for the motif discovery effectively and efficiently.
Other Latest Articles
- A METHOD OF IMPROVING DETECTION RATIO THROUGH CLUSTER SECURITY THRESHOLD MANAGEMENT IN CFFS
- DIFFUSION TENSOR IMAGING OF OPTIC RADIATION IN MULTIPLE SCLEROSIS: CORRELATION WITH VEP
- SELECTION OF SIX TYPES OF ISOLATES OF INDIGENOUS ARBUSCULAR MYCORRHIZAL FUNGI FOR GROWTH, YIELD AND ESSENTIAL OIL CONTENT OF SHALLOTS(ALLIUM ASCALONICUM L)
- BIOSORPTION OF HEAVY METALS USING ASPERGILLUS SPECIES ISOLATED FROM CONTAMINATED SOIL
- HEPATOPROTECTIVE EFFECT OF COMBINATION OF TENDER LEAVES OF MANGIFERA INDICA LINN AND TENDER COCONUT WATER IN HEP G2 CELLS
Last modified: 2018-08-22 18:50:53