ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

A Genome Scaffold Algorithm for Extending Repetitive and Non-repetitive Contigs

Journal: Journal of Interdisciplinary Sciences (Vol.1, No. 1)

Publication Date:

Authors : ; ;

Page : 49-60

Keywords : The Next Gene Sequencing; Gene Assembly; Hash Index; Mapping Conditions; Repetitive and Non-repetitive Contigs;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

The next generation sequencing platform produces massive short reads data. It is needed to assemble these short reads for further analysis. Unfortunately, gene assembly algorithm is still faced by a variety of complex challenges, especially for repetitive sequences. Based on De Bruijn Graph and SWA, we proposed a new genome scaffold algorithm named HashRepScaffold for extending repetitive contigs and non-repetitive contigs. By aligning original reads back to both ends of repetitive contigs and non-repetitive contigs, HashRepScaffold finds the best repetitive contig to exend the non-repetitive contig. Finally, simulation study was used to evaluate the performances of HashRepScaffold. The results indicated that HashRepScaffold have an excellent performances in extending repetitive and non-repetitive contigs.

Last modified: 2018-04-13 00:18:08