A Methodology In Processing Descriptive Analytics Using MMDA Traffic Update Tweets Tokenization And Classification Tree In Discovering Knowledge
Journal: International Journal of Scientific & Technology Research (Vol.5, No. 12)Publication Date: 2016-12-15
Authors : Tristan Jay P. Calaguas; Menchita F. Dumlao;
Page : 64-68
Keywords : tokenization; classification; tweets; traffic; methodology; knowledge discovery; update traffic;
Abstract
Traffic on National Capital Region of the Philippines is going as one of many problems facing by the local government and Filipino citizen who are residing in Metro Manila. In addition a Filipino citizen that is working in Metro Manila is experiencing a waste of Twenty Eight Thousand hours in traffic which results unproductivity. Due to traffic that causes long commutes it take away an individual from exercise activities that results fatigue in their health. In relation with this due to lack of exercise that causing by the traffic each year One Hundred Seventy Thousand Filipinos die from cardiovascular diseases up from Eighty Five Thousand more than Twenty years ago according to 2009 study by the Department of Health DOH. Population increase is one of many causes of traffic in Metro Manila. As population is growing the more car riders and commuters volume will be in the road including delivery trucks Pedi cabs jeeps and provincial buses that signify that there is a high employment rate in the country that causes traffic. However to sustain the public needs MMDA is the government agency that provides public services to Filipino citizens through providing updated public traffic information. For past years MMDA used Telephony lines and Television Broadcasting for traffic information dissemination which is very costly in maintenance that made them to adopt Twitter to post Traffic updates and advisories to the public .Since this government agency uses Twitter in disseminating information through posting tweet there is a need for a methodology on how these tweets will analyze so that citizens will have an insight in decision making to avoid specific time of traffic in metro manila. From this condition the researcher will adopt the use of MMDA tweets as the primary data source and apply the CRISP as the knowledge discovery standard processes that to be used in building methodology for descriptive analytics. In this experimental research several processes were used to convert the semi structured MMDA tweets into structured data matrix. SQL was used for storing retrieving and pattern matching while PHP string functions were used to tokenize the tweet and transform it into array so that the tokens can store in database using iterative structure. After loading all token to its specific table we abled to have a data matrix that comprised of time routed roads traffic status and day information that was used in data mining to discover knowledge. Lastly we used J48 Classification Algorithm to classify the time usually the traffic happens in many routed roads from NCR. As the result we discovered that from Eight Oclock to Nine Forty One in the morning the commuters are experiencing a traffic and from One O Clock in the afternoon to Eight OClock in the evening the commuters are also experiencing a traffic in C5 North Bound to South Bound and Edsa North Bound to South Bound every Tuesday and Friday with the accuracy of 75.72.
Other Latest Articles
- The Effect Of Resting Time To Work Productivity On Surya Hutani Jaya Company
- Cytotoxic Effect Of Verapamil On Human Embryonic Kidney Cell Line
- Impact Of Heating Temperature On Primary And Secondary Breakup Of Pure Vegetable Oil Spray
- A Research On Fibre Optic Data Transmission
- National Innovation System And Culture A Cross-Country Analysis
Last modified: 2017-06-11 22:53:58