Brief Survey on DNA Sequence MiningJournal: International Journal of Computer Science and Mobile Computing - IJCSMC (Vol.2, No. 11)
Publication Date: 2013-11-30
Authors : NN Das Poonam;
Page : 129-134
Keywords : Frequent patterns; DNA; Tandem Repeat; Motifs; KDD;
Sequence Mining is one of the most commonly used technique in data mining. Sequence mining is the process of mining frequent patterns from a large datasets. The exiting algorithms have some limitations in predicting frequent patterns, in terms of time, space complexity and accuracy. To overcome these drawbacks, this paper made a study on existing sequence mining algorithms and generate a new algorithm for generating frequent patterns from the biological sequences(DNA)..This paper attempt to locate all the tandem repeats in a DNA sequence. A repeated substring is called a tandem repeat if each occurrence of the substring is directly adjacent to each other. The future scope of this paper is not only predicting the frequent patterns; but will also satisfy some factors such as: space complexity, time and predict accurate solution to the required problem. With the help of these three things into consideration an effective algorithm can be defined for predicting the tandem repeat in a given DNA sequence.
Other Latest Articles
Last modified: 2013-11-22 02:52:10