N-Gram Analysis in SVM Training Phase Reduction Using Dataset Feature Filtering for Malware Detection
Journal: International Journal of Science and Research (IJSR) (Vol.3, No. 9)Publication Date: 2014-09-05
Authors : Pagidimarri Venu; Dasu Vaman Ravi Prasad;
Page : 550-554
Keywords : n-gram analysis; malware variants; kernel trick; SVM; WEKA tool;
Abstract
An n-gram is a sub-sequence of n items from a given sequence. Various areas of statistical natural language processing and genetic sequence analysis are using N-gram Analysis. In which sequence analysis is the process of comparing the sequence or series of attributes in order to find the similarity. Malicious software that is designed by attackers for disturbing computers is called as malware. The principal belong to the same family of malware eventhough Malware variants will have distinct byte level representations. The byte level content is different because small changes to the malware source code can result in significantly different compiled object code. In which programs are used as operational code (opcode) density histograms obtained through dynamic analysis. The process of testing and evaluation of application or a program during running time is called as dynamic analysis. A SVM is used for classification or regression problems. Kernel trickis a technique by SVM to transform your data and then based on these transformations it finds an optimal boundary between the possible outputs. We employ static analysis to classify malware which is identified a prefilter stage using hex values of files, that can reduce the feature set and therefore reduce the training effort. The result shows that the relationships between features are complex and simple statistics filtering approaches do not provide a Practical approach. One of the approach, hex decimal based produces a suitable filter. The entire system will be implemented in WEKA tool.
Other Latest Articles
- Overview of Rice and Sugarcane Cultivation in Akole Tahasil Dist. A'Nagar (M.S, India)
- Seroprevalence of Paratuberculosis in Goats and Sheep in Arusha, Northern Tanzania
- Silk Suture Technique: A Simple, Easy, Effective Treatment of Wrist Ganglion and Review of Literature
- Exposure Prevalence of the Nursing Staff to the Occupational Risks in the Regional Hospital of Vlora in 2013
- Effects of Long Term Exposure to a 2G Cell Phone Radiation (900 - 1900 MHz) on Mouse Testis
Last modified: 2021-06-30 21:07:44