ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Brief Survey on DNA Sequence Mining

Journal: International Journal of Computer Science and Mobile Computing - IJCSMC (Vol.2, No. 11)

Publication Date:

Authors : ;

Page : 129-134

Keywords : Frequent patterns; DNA; Tandem Repeat; Motifs; KDD;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Sequence Mining is one of the most commonly used technique in data mining. Sequence mining is the process of mining frequent patterns from a large datasets. The exiting algorithms have some limitations in predicting frequent patterns, in terms of time, space complexity and accuracy. To overcome these drawbacks, this paper made a study on existing sequence mining algorithms and generate a new algorithm for generating frequent patterns from the biological sequences(DNA)..This paper attempt to locate all the tandem repeats in a DNA sequence. A repeated substring is called a tandem repeat if each occurrence of the substring is directly adjacent to each other. The future scope of this paper is not only predicting the frequent patterns; but will also satisfy some factors such as: space complexity, time and predict accurate solution to the required problem. With the help of these three things into consideration an effective algorithm can be defined for predicting the tandem repeat in a given DNA sequence.

Last modified: 2013-11-22 02:52:10