MULTIWORD EXPRESSION EXTRACTION FROM NOISY TEXT USING LINGUISTIC RULES
Journal: IMPACT : International Journal of Research in Engineering & Technology ( IMPACT : IJRET ) (Vol.3, No. 5)Publication Date: 2015-06-05
Authors : VINEET KUMAR BIRLA;
Page : 17-22
Keywords : Corpora; Noisy Text; Charniak Parser; Multiword; Unique Words; Named Entities; Extractor;
Abstract
Language Technology units such as Machine Translations require dictionaries. But available dictionaries are simple set of word pair [3]. Since the text is collection of inter-related sentences and in which group of words may mean differently than the meaning of individual words, dictionary proves insufficient to provide requisite knowledge to language technology units. To enable Language Technology units with requisite information, therefore, multiword expressions are required. While syntactic multiword extraction is simpler, that of semantic Multiword expression is difficult for process automation, since the identification itself is difficult. This papers presents algorithm for extraction of Multiword Expression from a given English text.
Other Latest Articles
- CORRELATION BETWEEN SOME OF CHEMICAL COMPOSITION ELEMENTS OF ZOOPLANKTON AS WELL AS PRODUCTION EFFICIENCY AND QUALITATIVE COMPOSITION OF HIGH FATTY ACIDS PROFILE IN CARP MEAT
- PISCICULTURAL-BIOLOGICAL FOUNDATIONS FOR FORMATION AND EXPLOITATION OF PADDLEFISH BROOD STOCKS IN CONDITIONS OF INTRODUCTION
- INMPACT OF PHOTOPERIOD DURATION ON THE GROWTH OF RAINBOW TROUT (ONCORHYNHUS MYKIS WALBAUM, 1792) ERTAIN SPECIMENS OF
- INFLUENCE OF ECHINACEA PURPUREA AT THE SOME HEMATOLOGICAL AND BIOCHEMICAL PARAMETERS OF ONE YEARS CARPS BLOOD
- STIMULATION OF PLANKTON DEVELOPMENT IN THE PONDS BY DISTILLERY DREGS WHEN CULTIVATING ONE-YEAR CARP IN POLYCULTURE
Last modified: 2015-06-17 19:18:02