Comparison of LSI algorithms without and with pre-processing: using text document based search
Journal: ACCENTS Transactions on Information Security (TIS) (Vol.1, No. 4)Publication Date: 2016-10-18
Authors : Sheikh Muhammad Saqib; Khalid Mahmood; Tariq Naeem;
Page : 44-51
Keywords : Iterative residual rescaling; Term frequency; Inverse document frequency; Latent semantic indexing; Pre-processing.;
Abstract
Searching of document/text is the most important need of each student or computer user. Searching through particular index or term is the old fashion, now a day's user want to search documents according to some phrase, query or requirement i.e. extraction of meaningful information from large collection according to some textual query. Different methods such as iterative residual rescaling (IRR), term frequency (TF), inverse document frequency (IDF), multi words are using to handle such issues. Latent semantic indexing (LSI) is an important method for current literature of information retrieval. LSI can find similar documents on particular textual phrase. Here author has implemented two algorithms (without and with pre-processing) of LSI for text documents. As a result, both algorithms can obtain the similar results but their processing time will be different.
Other Latest Articles
- Extraction of key/title/aspect words from document using wordnet
- ANALYSIS OF THE FREQUENCY OF SOME POLYMORPHISMS OF THE CYTOKINE GENES IL-10, IL-4, TNF IN PATIENTS WITH CHRONIC HEPATITIS B
- COMPARATIVE ANALYSIS OF CONTRIBUTION OF POLYMORPHISM OF GENETIC MARKERS TO FORMATION OF DIFFERENT PATHOLOGIES (BY THE EXAMPLE OF HYPERTENSION AND VIRAL HEPATITIS C)
- PROGNOSTIC SIGNIFICANCE OF ANGIOGENESIS MARKER EXPRESSION IN LOW-GRADE CANCER OF THE DISTAL COLON
- An efficient AES and RC6 based cloud-user data security with attack detection mechanism
Last modified: 2016-10-24 17:22:25