ANALYSIS OF LEXICAL, SYNTACTIC AND SEMANTIC FEATURES FOR SEMANTIC TEXTUAL SIMILARITY
Journal: International Journal of Computer Engineering and Technology (IJCET) (Vol.9, No. 5)Publication Date: 2018-12-28
Authors : VANGAPELLI SOWMYA MANTENA S.V.S BHADRI RAJU BULUSU VISHNU VARDHAN;
Page : 1-9
Keywords : Semantic Textual Similarity; Lexical; syntactic; Semantic; Pearson Correlation Coefficient;
Abstract
Semantic Textual Similarity (STS) calculates the degree of semantic equivalence between two textual snippets, even though they do not share common words. The textual snippets are words, phrases, sentences, paragraphs or documents. In this work, the textual snippets are sentences. The similarity between the sentences can be measured with the aid of lexical, syntactic and semantic features entrenched in the sentences. In SemEval workshop, the STS task is to measure the semantic similarity between the sentence pairs. The dataset contains the sentence pairs and the human annotated real values from 0-5. In this paper, the experimental analysis of various lexical, syntactic and semantic features on STS 2016 dataset is carried out. This analysis is useful while building the models with machine learning algorithms with the aid of these features. The impact of the individual features on semantic textual equivalence is assessed between the feature generated values and human annotated values using Pearson Correlation Coefficient.
Other Latest Articles
- COMPARISON OF USAGE OF CROWDSOURCING IN TRADITIONAL AND AGILE SOFTWARE DEVELOPMENT METHODOLOGIES ON THE BASIS OF EFFECTIVENESS
- WEB USAGE MINING FOR OPINION TARGET RECOGNITION WITH THE AID OF FUZZY CLUSTERING BASED RANDOM FOREST CLASSIFICATION
- COMPARATIVE ANALYSIS OF IMAGE DENOISING TECHNIQUES FOR ENHANCING REAL-TIME IMAGES
- ANALYTICAL STUDY ON BLOCK CHAIN TECHNOLOGY WITH REFERENCE TO CRYPTO CURRENCY AND AN OVERVIEW ON INDIA’S OWN CRYPTO CURRENCY
- ANALYSIS OF TEMPLATES MATCHING FOR HUMAN BODY PARTS RECOGNITION
Last modified: 2018-12-08 17:10:51