Stemming as a basis for some non-conventional methods of information retrieval

Abstract The article presents various techniques of stemming, arguing that they are the most important phase in preparing the text for inclusion into full-text databases, especially those using non-Boolean search models. Stemming is a process of text processing us¬ing stemming algorithms, the purpo...

Full description

Saved in:
Bibliographic Details
Main Authors: Polona Vilar, Jure Dimec
Format: Article
Language:English
Published: Slovenian Library Association & University of Ljubljana Press (Založba Univerze v Ljubljani) 2000-10-01
Series:Knjižnica
Subjects:
Online Access:https://journals.uni-lj.si/knjiznica/article/view/13960
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850250091461345280
author Polona Vilar
Jure Dimec
author_facet Polona Vilar
Jure Dimec
author_sort Polona Vilar
collection DOAJ
description Abstract The article presents various techniques of stemming, arguing that they are the most important phase in preparing the text for inclusion into full-text databases, especially those using non-Boolean search models. Stemming is a process of text processing us¬ing stemming algorithms, the purpose of which is an automated selection of indexing terms used for content description. The article presents a statistic approach to stemming, morphological and semantical aspects of stemming, and several stemming algorithms. The authors also speak about evaluation criteria and linguistic dependence of such algorithms. At the end, they give more detailed descriptions of some stemming algorithms developed for English, Slovene, French, Japanese and Arabic languages.
format Article
id doaj-art-cf3fc1106451410990dcd963c416ee49
institution OA Journals
issn 0023-2424
1581-7903
language English
publishDate 2000-10-01
publisher Slovenian Library Association & University of Ljubljana Press (Založba Univerze v Ljubljani)
record_format Article
series Knjižnica
spelling doaj-art-cf3fc1106451410990dcd963c416ee492025-08-20T01:58:19ZengSlovenian Library Association & University of Ljubljana Press (Založba Univerze v Ljubljani)Knjižnica0023-24241581-79032000-10-0144410.55741/knj.44.4.13960Stemming as a basis for some non-conventional methods of information retrievalPolona Vilar0Jure Dimec1Polona Vilar je zaposlena na Oddelku za bibliotekarstvo, Filozofska fakulteta Univerze v Ljubljani. Naslov: Aškerčeva 2,1000 Ljubljana. Naslov elektronske pošte: polona.vilar@ff.uni-lj.siDr. Jure Dimec je zaposlen na Inštitutu za biomedicinsko informatiko pri Medicinski fakulteti Univerze v Ljubljani. Naslov: Vrazov trg 2,1000 Ljubljana. Naslov elektronske pošte: jure.dimec@mf.uni-lj.siAbstract The article presents various techniques of stemming, arguing that they are the most important phase in preparing the text for inclusion into full-text databases, especially those using non-Boolean search models. Stemming is a process of text processing us¬ing stemming algorithms, the purpose of which is an automated selection of indexing terms used for content description. The article presents a statistic approach to stemming, morphological and semantical aspects of stemming, and several stemming algorithms. The authors also speak about evaluation criteria and linguistic dependence of such algorithms. At the end, they give more detailed descriptions of some stemming algorithms developed for English, Slovene, French, Japanese and Arabic languages.https://journals.uni-lj.si/knjiznica/article/view/13960information scienceindexingautomatic indexingstemmingalgorithms
spellingShingle Polona Vilar
Jure Dimec
Stemming as a basis for some non-conventional methods of information retrieval
Knjižnica
information science
indexing
automatic indexing
stemming
algorithms
title Stemming as a basis for some non-conventional methods of information retrieval
title_full Stemming as a basis for some non-conventional methods of information retrieval
title_fullStr Stemming as a basis for some non-conventional methods of information retrieval
title_full_unstemmed Stemming as a basis for some non-conventional methods of information retrieval
title_short Stemming as a basis for some non-conventional methods of information retrieval
title_sort stemming as a basis for some non conventional methods of information retrieval
topic information science
indexing
automatic indexing
stemming
algorithms
url https://journals.uni-lj.si/knjiznica/article/view/13960
work_keys_str_mv AT polonavilar stemmingasabasisforsomenonconventionalmethodsofinformationretrieval
AT juredimec stemmingasabasisforsomenonconventionalmethodsofinformationretrieval