Machine Learning Techniques for Building a Large Scale Production Ready Classifier
Proceeding: The Second International Conference on Data Mining, Internet Computing, and Big Data (BigData2015)Publication Date: 2015-06-29
Authors : Arthi Venkataraman;
Page : 1-16
Keywords : Natural Language Processing; Machine Learning; Active Learning; Classification; Clustering; Name Entity Recognition; Committee-based approach; Cluster than Label; Shallow semantic parsing; Ontology; Cognitive Process Automation;
Abstract
This paper brings out the various techniques we have followed to build a production ready scalable classifier system to classify the tickets raised by employees of an organization. The end users raise the tickets in Natural language which is then automatically classified by the classifier. This is a practical applied research paper in the area of machine learning. We have applied different machine learning techniques like active learning for improving the accuracy of the prediction and have used clustering for handling the data issues found in the training data. The approach we used for the core classifier combined the results of multiple machine learning algorithms using suitable scoring techniques. Use of this system has given more than 50% improvement in the tickets re-assignment index and more than 80% accuracy has been achieved in correctly identifying the classes for the tickets. The system is able to perform at scale, has response times well within the expectations and handles the peak load. Key takeaways from this paper include: How to build live production ready classifier system How to overcome the data related challenges while building such a system Solution architecture for the classifier system Deployment architecture for the classifier system Being prepared for the kind of post deployment challenges one can face for such a system Benefits of building such a system include Improved Productivity, improved End user experience and quick turnaround time.
Other Latest Articles
- A PSEUDOPHYLLIDEAN SENGA MADHUKAR II SP. NOV. FROM A FRESHWATER FISH MASTACEMBELUS ARMATUS FROM GODAVARI BASIN MAHARASHTRA STATE (INDIA)
- AN ANALYSIS OF ATTITUDE OF SECONDARY SCHOOL TEACHERS TOWARDS CONTINUOUS COMPREHENSIVE EVALUATION
- ‘Characterisation of seed-oil of Citrullus colocynthis (L.) Schard.’
- A STUDY ON CAREER MATURITY OF SECONDARY SCHOOL STUDENTS IN RELATION TO SCHOOL MANAGEMENT
- A STUDY ON CAREER MATURITY OF SECONDARY SCHOOL STUDENTS IN RELATION TO SCHOOL MANAGEMENT
Last modified: 2015-07-11 16:52:06