On identifying name equivalences in digital libraries. Name equivalence, Surname matching, Author identification, Databases
The services provided by digital libraries can be much improved by correctly identifying variants of the same name. For example, this will allow for better retrieval of all the works by a certain author. We focus on variants caused by abbreviations of first names, and show that significant achieveme...
Saved in:
Main Author: | |
---|---|
Format: | Article |
Language: | English |
Published: |
University of Borås
2004-01-01
|
Series: | Information Research: An International Electronic Journal |
Subjects: | |
Online Access: | http://informationr.net/ir/9-4/paper192.html |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832569978915651584 |
---|---|
author | Dror G. Feitelson |
author_facet | Dror G. Feitelson |
author_sort | Dror G. Feitelson |
collection | DOAJ |
description | The services provided by digital libraries can be much improved by correctly identifying variants of the same name. For example, this will allow for better retrieval of all the works by a certain author. We focus on variants caused by abbreviations of first names, and show that significant achievements are possible by simple lexical analysis and comparison of names. This is done in two steps: first a pairwise matching of names is performed, and then these are used to find cliques of equivalent names. However, these steps can each be performed in a variety of ways. We therefore conduct an experimental analysis using two real datasets to find which approaches actually work well in practice. Interestingly, this depends on the size of the repository, as larger repositories may have many more similar names. |
format | Article |
id | doaj-art-0f8d8a2e2faa43d0b2349eb58766a36c |
institution | Kabale University |
issn | 1368-1613 |
language | English |
publishDate | 2004-01-01 |
publisher | University of Borås |
record_format | Article |
series | Information Research: An International Electronic Journal |
spelling | doaj-art-0f8d8a2e2faa43d0b2349eb58766a36c2025-02-02T17:57:33ZengUniversity of BoråsInformation Research: An International Electronic Journal1368-16132004-01-0194192On identifying name equivalences in digital libraries. Name equivalence, Surname matching, Author identification, DatabasesDror G. FeitelsonThe services provided by digital libraries can be much improved by correctly identifying variants of the same name. For example, this will allow for better retrieval of all the works by a certain author. We focus on variants caused by abbreviations of first names, and show that significant achievements are possible by simple lexical analysis and comparison of names. This is done in two steps: first a pairwise matching of names is performed, and then these are used to find cliques of equivalent names. However, these steps can each be performed in a variety of ways. We therefore conduct an experimental analysis using two real datasets to find which approaches actually work well in practice. Interestingly, this depends on the size of the repository, as larger repositories may have many more similar names.http://informationr.net/ir/9-4/paper192.htmlName equivalenceSurname matchingAuthor identificationDatabases |
spellingShingle | Dror G. Feitelson On identifying name equivalences in digital libraries. Name equivalence, Surname matching, Author identification, Databases Information Research: An International Electronic Journal Name equivalence Surname matching Author identification Databases |
title | On identifying name equivalences in digital libraries. Name equivalence, Surname matching, Author identification, Databases |
title_full | On identifying name equivalences in digital libraries. Name equivalence, Surname matching, Author identification, Databases |
title_fullStr | On identifying name equivalences in digital libraries. Name equivalence, Surname matching, Author identification, Databases |
title_full_unstemmed | On identifying name equivalences in digital libraries. Name equivalence, Surname matching, Author identification, Databases |
title_short | On identifying name equivalences in digital libraries. Name equivalence, Surname matching, Author identification, Databases |
title_sort | on identifying name equivalences in digital libraries name equivalence surname matching author identification databases |
topic | Name equivalence Surname matching Author identification Databases |
url | http://informationr.net/ir/9-4/paper192.html |
work_keys_str_mv | AT drorgfeitelson onidentifyingnameequivalencesindigitallibrariesnameequivalencesurnamematchingauthoridentificationdatabases |