On identifying name equivalences in digital libraries. Name equivalence, Surname matching, Author identification, Databases

The services provided by digital libraries can be much improved by correctly identifying variants of the same name. For example, this will allow for better retrieval of all the works by a certain author. We focus on variants caused by abbreviations of first names, and show that significant achieveme...

Full description

Saved in:
Bibliographic Details
Main Author: Dror G. Feitelson
Format: Article
Language:English
Published: University of Borås 2004-01-01
Series:Information Research: An International Electronic Journal
Subjects:
Online Access:http://informationr.net/ir/9-4/paper192.html
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832569978915651584
author Dror G. Feitelson
author_facet Dror G. Feitelson
author_sort Dror G. Feitelson
collection DOAJ
description The services provided by digital libraries can be much improved by correctly identifying variants of the same name. For example, this will allow for better retrieval of all the works by a certain author. We focus on variants caused by abbreviations of first names, and show that significant achievements are possible by simple lexical analysis and comparison of names. This is done in two steps: first a pairwise matching of names is performed, and then these are used to find cliques of equivalent names. However, these steps can each be performed in a variety of ways. We therefore conduct an experimental analysis using two real datasets to find which approaches actually work well in practice. Interestingly, this depends on the size of the repository, as larger repositories may have many more similar names.
format Article
id doaj-art-0f8d8a2e2faa43d0b2349eb58766a36c
institution Kabale University
issn 1368-1613
language English
publishDate 2004-01-01
publisher University of Borås
record_format Article
series Information Research: An International Electronic Journal
spelling doaj-art-0f8d8a2e2faa43d0b2349eb58766a36c2025-02-02T17:57:33ZengUniversity of BoråsInformation Research: An International Electronic Journal1368-16132004-01-0194192On identifying name equivalences in digital libraries. Name equivalence, Surname matching, Author identification, DatabasesDror G. FeitelsonThe services provided by digital libraries can be much improved by correctly identifying variants of the same name. For example, this will allow for better retrieval of all the works by a certain author. We focus on variants caused by abbreviations of first names, and show that significant achievements are possible by simple lexical analysis and comparison of names. This is done in two steps: first a pairwise matching of names is performed, and then these are used to find cliques of equivalent names. However, these steps can each be performed in a variety of ways. We therefore conduct an experimental analysis using two real datasets to find which approaches actually work well in practice. Interestingly, this depends on the size of the repository, as larger repositories may have many more similar names.http://informationr.net/ir/9-4/paper192.htmlName equivalenceSurname matchingAuthor identificationDatabases
spellingShingle Dror G. Feitelson
On identifying name equivalences in digital libraries. Name equivalence, Surname matching, Author identification, Databases
Information Research: An International Electronic Journal
Name equivalence
Surname matching
Author identification
Databases
title On identifying name equivalences in digital libraries. Name equivalence, Surname matching, Author identification, Databases
title_full On identifying name equivalences in digital libraries. Name equivalence, Surname matching, Author identification, Databases
title_fullStr On identifying name equivalences in digital libraries. Name equivalence, Surname matching, Author identification, Databases
title_full_unstemmed On identifying name equivalences in digital libraries. Name equivalence, Surname matching, Author identification, Databases
title_short On identifying name equivalences in digital libraries. Name equivalence, Surname matching, Author identification, Databases
title_sort on identifying name equivalences in digital libraries name equivalence surname matching author identification databases
topic Name equivalence
Surname matching
Author identification
Databases
url http://informationr.net/ir/9-4/paper192.html
work_keys_str_mv AT drorgfeitelson onidentifyingnameequivalencesindigitallibrariesnameequivalencesurnamematchingauthoridentificationdatabases