Tracking Recurring Concepts from Evolving Data Streams using Ensemble Method
Journal: The International Arab Journal of Information Technology (Vol.16, No. 6)Publication Date: 2019-11-01
Authors : Yange Sun; Zhihai Wang; Jidong Yuan; Wei Zhang;
Page : 1044-1052
Keywords : Data streams; ensemble classification; change detection; recurring concept; Jensen-Shannon divergence.;
Abstract
Ensemble models are the most widely used methods for classifying evolving data stream. However, most of the existing data stream ensemble classification algorithms do not consider the issue of recurring concepts, which commonly exist in real-world applications. Motivated by this challenge, an Ensemble with internal Change Detection (ECD) was proposed to enhance performance by exploring the recurring concepts. It is done by maintaining a pool of classifiers, which dynamically adds and removes classifiers in response to the change detector. The algorithm adopts a two window change detection model, which adopts the Jensen-Shannon divergence to measure the distance of the distributions between old and recent data. When a change is detected, the repository of stored historical concepts is checked for reuse. Experimental results on both synthetic and real-world data streams demonstrate that the proposed algorithm not only outperforms the state-of-art methods on standard evaluation metrics, but also adapts well in different types of concept drift scenarios especially when concept s reappear.
Other Latest Articles
- Pneumonia in patients with chronic kidney disease V D stage: pathogenetic aspects of complex therapy and outcomes
- Block Size Analysis for Discrete Wavelet Watermarking and Embedding a Vector Image as a Watermark
- A Comprehensive Study of Modern and High Speed TCP-Variant in Linux Kernel: TCP CUBIC
- PLDL: A Novel Method for Label Distribution Learning
- A New Approach to Improve Association Rules for Big Data in Cloud Environment
Last modified: 2019-11-11 21:39:40