Distribution of Unique Sequences in the Human Genome
Journal: Austin Journal of Computational Biology and Bioinformatics (Vol.2, No. 1)Publication Date: 2015-02-16
Authors : Kazuharu Misawa;
Page : 1-4
Keywords : Unique sequences; human genome; hash code; centromere; chromosome terminal region;
Abstract
Programmable sequence-specific endonucleases are powerful tools for genome alteration with high precision. For example, the CRISPR system is an efficient tool for genome engineering in eukaryotic cells by simply specifying a 20-bp targeting sequence within its guide RNA. When studying large genomes, however, the design of target sequences is complicated by the redundancy of sequences. The distribution of unique sequences in the genome is of interest. In this paper, I describe the development of a novel method, UF, for detecting unique 20-bp sequences in entire genomes. UF stands for "Unique Finder". By using UF, the distribution of unique sequences in the human genome was investigated. It was found that 60% of the human genome is unique on average. However, non-unique regions of human genome are concentrated on centromeres and terminal regions of the chromosomes. The proportions of unique sequences are about 80% in the rest part of the genome. The program for obtaining unique sequences is available at https://sourceforge.jp/projects/ parallelgwas/releases/
Other Latest Articles
- EVALUATION OF PERMEABILITY OF PERVIOUS CONCRETE WITH ADDING FINE AGGREGATE
- HYDRO-GEOLOGY, GROUNDWATER RESOURCES AND ANALYSIS OF POTENTIAL AQUIFERS IN MALANI IGNEOUS SUITE- A CASE STUDY OF SIROHI DISTRICT OF RAJASTHAN
- Computational Structure Analysis and Function Prediction of an Uncharacterized Protein (I6U7D0) of Pyrococcus furiosus COM1
- ENHANCED SEISMIC RESISTANCE OF STEEL BUILDINGS USING VISCOUS FLUID DAMPERS
- Graphics Card Processing: Acceleration of Multiple Sequence Alignment
Last modified: 2017-02-22 20:26:11