A STUDY ON MINING PRODUCT OPINIONS AND REVIEWS ON THE WEB USING WEB SCRAPING
Journal: International Journal of Engineering Sciences & Research Technology (IJESRT) (Vol.7, No. 3)Publication Date: 2018-03-31
Authors : Nilesh Kumar Dokania; Jaspreet Kaur;
Page : 365-373
Keywords : Web mining; Web scraping; Innovation; R&D; Web content mining; structured data mining; unstructured data mining; semi-structured data mining.;
Abstract
As enterprises expand and post increasing information about their business activities on their websites, website data promises to be a valuable source for investigating innovation. This article examines the practicalities and effectiveness of web mining as a research method for innovation studies. We use web mining to explore the R&D activities of 296 UK-based green goods small and mid-size enterprises. We find that website data offers additional insights when compared with other traditional unobtrusive research methods, such as patent and publication analysis. We examine the strengths and limitations of enterprise innovation web mining in terms of a wide range of data quality dimensions, including accuracy, completeness, currency, quantity, flexibility and accessibility. We observe that far more companies in our sample report undertaking R&D activities on their web sites than would be suggested by looking only at conventional data sources. While traditional methods offer information about the early phases of R&D and invention through publications and patents, web mining offers insights that are more downstream in the innovation process. Handling website data is not as easy as alternative data sources, and care needs to be taken in executing search strategies. Website information is also self-reported and companies may vary in their motivations for posting (or not posting) information about their activities on websites. Nonetheless, we find that web mining is a significant and useful complement to current methods, as well as offering novel insights not easily obtained from other unobtrusive sources. Web Mining is extracting information from the web re-sources and finding interesting patterns that can be useful from ever expanding database of World Wide Web. Whenever we talk about data, we conclude that there is a huge range of data on World Wide Web. Due to heterogeneity and unstructured nature of the data available on the WWW, Web mining uses various data mining techniques to discover useful knowledge from Web hyperlinks, page content and usage log. Web Content Mining is a component of Data Mining. The main uses of web content mining are to gather, categorize, organize and provide the best possible information available on the Web to the user requesting the information. This paper deals with a preliminary discussion of Web content mining, contributions in the field of web mining, the prominent successful tools and algorithms.
Other Latest Articles
- FACIAL EMOTION RECOGNITION USING CONVOLUTION NEURAL NETWORK
- SCHEDULING FOR RESOURCE OPTIMISATION USING FUZZY LOGIC
- ANALYSIS OF SUPPLY CHAIN MANAGEMENT IN INFRASTRUCTURE AND CONSTRUCTION PROJECT PLANNING FOR MAHARASHTRA REGION
- SIZE BASED CHARACTERIZATION OF SEISMIC MAGNITUDES
- MOBILE SOCIAL CLOUD COMPUTING: OPEN CHALLENGES
Last modified: 2018-03-17 20:05:31