Lossless Text Compression Technique Using Syllable Based Morphology
Journal: The International Arab Journal of Information Technology (Vol.8, No. 1)Publication Date: 2011-01-01
Authors : Ibrahim Akman Hakan Bayindir Serkan Ozleme Zehra Akin Sanjay Misra;
Page : 66-74
Keywords : Algorithm; text compression technique; syllable; multi-syllabic languages;
Abstract
In this paper, we present a new lossless text compression technique which utilizes syllable-based morphology of multi-syllabic languages. The proposed algorithm is designed to partition words into its syllables and then to produce their shorter bit representations for compression. The method has six main components namely source file, filtering unit, syllable unit, compression unit, dictionary file and target file. The number of bits in coding syllables depends on the number of entries in the dictionary file. The proposed algorithm is implemented and tested using 20 different texts of different lengths collected from different fields. The results indicated a compression of up to 43%
Other Latest Articles
- AModel for English to Urdu and Hindi Machine Translation System using Translation Rules and Artificial Neural Network
- Novel Robust Multilevel 3D Visualization Technique for Web Based GIS
- Social Issues in Wireless Sensor Networks with Healthcare Perspective
- A Steganography Scheme on JPEG Compressed Cover Image with High Embedding Capacity
- Binary Phoneme Classification Using Fixed and Adaptive Segment Based Neural Network Approach
Last modified: 2019-04-28 18:17:16