Preprocessing of Low Response Data for Predictive Modeling
Journal: International Journal of Trend in Scientific Research and Development (Vol.3, No. 3)Publication Date: 2019-10-06
Authors : Farzana Naz Imaad Shafi Md Kamre Alam;
Page : 157-160
Keywords : Computer Engineering; Logistic Regression; Datasets; Principal component analysis; Variable Reduction;
Abstract
For training a model, the raw data have to go through various preprocessing phases like Cleaning, Missing Values Imputation, Dimension Variable reduction, and Sampling. These steps are data and problem specific and affect the accuracy of the model at a very large extent. For the current scenario, we have 2.2M records with 511 variables. This data was used in a Direct Mail Campaign of some Life Insurance Products and now we know which record had a positive response for the campaign. Rows records 2,259,747 Columns 511 Rows with positive response 2,739, i.e. Response Rate 0.1212 . The dataset is not complete, i.e. we have to take care of missing values. Farzana Naz | Imaad Shafi | Md Kamre Alam ""Preprocessing of Low Response Data for Predictive Modeling"" Published in International Journal of Trend in Scientific Research and Development (ijtsrd), ISSN: 2456-6470, Volume-3 | Issue-3 , April 2019, URL: https://www.ijtsrd.com/papers/ijtsrd21667.pdf
Paper URL: https://www.ijtsrd.com/engineering/computer-engineering/21667/preprocessing-of-low-response-data-for-predictive-modeling/farzana-naz
Other Latest Articles
- Content Based Image Retrieval An Assessment
- RFID Technology Adoption Rate in Warehousing A Study of Manufacturing Companies in Johor
- İkinci Dünya Savaşı Sırasında Türkiye’de Gerçekleştirilen Esir Değişimlerinin Dönemin Basınında Sunumu
- Environmental Impact of Geothermal Power Plant
- Violence against Women with Special Reference to Domestic Violence Act, 2005
Last modified: 2019-06-11 15:40:35