Efficient Parallel Compression and Decompression for Large XML Files
Journal: The International Arab Journal of Information Technology (Vol.13, No. 4)Publication Date: 2016-07-01
Authors : Mohammad Ali; Minhaj Ahmad Khan;
Page : 403-408
Keywords : XML; distributed computing; XML compression; GZIP; performance;
Abstract
EXtensible Markup Language (XML) is gaining popularity and is being used widely on internet for storing and exchanging data. Large XML files when transferred on network create bottleneck and also degrade the query performance. Therefore, efficient mechanisms of compression and decompression are applied to XML files. In this paper, an algorithm for performing XML compression and decompression is suggested. The suggested approach reads an XML file, removes tags, divides the XML file into different parts and then compresses each different part on a separate core for achieving efficiency.We compare performance results of the proposed algorithm with parallel compression and decompression of XML files using GZIP. The performance results show that the suggested algorithm performs 24%, 53% and 72% better than the parallel GZIP compression and decompression on Intel Xeon, Intel core i7 and Intel core i3 based architectures respectively.
Other Latest Articles
- Selectivity Estimation of Range Queries in Data Streams using Micro-Clustering
- Control and Management of Coal Mines with Control Information Systems
- Threshold-based Steganography: A Novel Technique for Improved Payload and SNR
- Task Scheduling Using Probabilistic Ant Colony Heuristics
- Arabic/Farsi Handwritten Digit Recognition using Histogram of Oriented Gradient and Chain Code Histogram
Last modified: 2019-11-13 21:51:54