ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

ON THE HIGH PERFORMANCE COMPUTING FOR MOTIF DISCOVERY IN DNA SEQUENCES

Journal: International Journal of Advanced Research (Vol.6, No. 7)

Publication Date:

Authors : ;

Page : 880-887

Keywords : Motif discovery R programming language high performance computing random projection.;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

In bioinformatics, one of the most important research problems is the Motif discovery in DNA sequences. The algorithm having accuracy and speed has always been the goal of research in bioinformatics, for solving this problem. Therefore, the idea of this research study is to modify the random projection algorithm to be implemented using high performance computing technique (i.e., the R package pbdMPI). The steps that are needed to achieve this objective are the main focus of this study, i.e. preprocessing data, splitting data according to number of batches, modifying and implementing random projection in the pbdMPI package, and then aggregating the results. To validate this approach, some experiments have been conducted. Several benchmarking data were used in this study by sensitivity analysis on number of cores and batches. Experimental results show that computational cost can be reduced. Thus, the proposed approach can be used for the motif discovery effectively and efficiently.

Last modified: 2018-08-22 18:50:53