ResearchBib Share Your Research, Maximize Your Social Impacts
Sign for Notice Everyday Sign up >> Login

Feature Selection Algorithm Based on Correlation between Muti Metric Network Traffic Flow Features

Journal: The International Arab Journal of Information Technology (Vol.14, No. 3)

Publication Date:

Authors : ; ; ;

Page : 362-371

Keywords : Port identification; deep packet inspection; netflow flow; machine learning;

Source : Downloadexternal Find it from : Google Scholarexternal

Abstract

Traffic identification is a hot issue in recent years, in order to overcome shortcomings of port-based and Deep Packet Inspection (DPI), machine learning algorithm has gained wide attention, but nowadays research focus on traffic identification based on full packets dataset, which would be a great challenge to identify online traffic flow. It is a way to overcome this shortcoming by considering the sampled flow records as identification object. In this paper, flow records NOC_SET is constructed as dataset, and inherent NETFLOW and extended flow metrics are regarded as features. This paper proposes feature selection algorithm MSAS to select features with high correlation. And classical machine learning algorithms are used to identify traffic. Experimental results show that machine learning flow identification algorithm based on sampled flow records has almost the same identification results as method based on full packets dataset, and the proposed feature selection algorithm MSAS can improve the result of application identification.

Last modified: 2019-05-08 18:27:00