An interpretable ensemble model framework for real-time anomaly detection and prediction of Ethereum blockchain transactions
Journal: International Journal of Advanced Technology and Engineering Exploration (IJATEE) (Vol.10, No. 103)Publication Date: 2023-06-30
Authors : Sabri Hisham Mokhairi Makhtar; Azwa Abdul Aziz;
Page : 676-695
Keywords : Ethereum; Blockchain; Features extraction; Ensemble method; Anomaly detection.;
Abstract
The blockchain ecosystem is often referred to as a technology that ensures security. However, there have been concerns in the real world regarding the security of blockchain applications, like what happens with conventional database systems. The anonymous design of blockchain provides cyber-attackers with opportunities to commit crimes, resulting in an increase in scams, phishing, code manipulation of smart contracts, Ponzi schemes, and other fraudulent activities. Consequently, many individuals and national economies worldwide have suffered significant losses. Detecting fraudulent behavior in blockchain transactions manually is infeasible due to the enormous amount of data involved. Therefore, the optimal method for identifying abnormalities within the blockchain network is to combine a blockchain platform with a machine learning approach. This study employs filter method techniques such as mutual information (MI), analysis of variance (ANOVA), and recursive feature elimination (RFE) to identify the ideal set of features based on the maximum accuracy value, considering the feature dimension (k value). The study screens and ranks the top 10 feature sets using the feature importance random forest (RF) classifier, based on the dataset produced by the best filter approach (yielding higher accuracy). Subsequently, an ensemble methodology is used to create the final model, utilizing the final dataset consisting of 10 features. The purpose of this approach is to enhance the level of anomaly detection in the blockchain network. To determine the effectiveness of the proposed model, experiments are conducted, comparing it against individual classifiers such as extreme gradient boosting (XGB), decision tree (DT), logistic regression (LR), random forest (RF), and k-nearest neighbor (KNN). The study's findings reveal that the ensemble voting approach achieves a 96.78% accuracy rate, surpassing the accuracy of the individual classifier models that utilize optimal features. Additionally, the study's findings suggest that the selection of features and their quantity significantly impact the output of the model.
Other Latest Articles
- Trusted Digital Repositories in Context: Definitions, Diagnoses, and Directions for Better Practice
- The Canadian Archivist Douglas Brymner and his «Noble Dream»
- A Memorandum to the Swedish Prime Minister in 1921: a Brief Introduction to Ukrainian History in the National Archives of Sweden
- Repressive Policy of the Russian Government Towards the Participants of the Movements for the Restoration of the Poland Statehood (1830–1874)
- On the Place of Zhovkva and its Surroundings: the Documents on the History of the Estate of Wysocki & Żółkiewski Families in Lviv Land in the 15th–16th Centuries (Part 2)
Last modified: 2023-07-10 18:05:41