Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora
Academic dictionary writing is making greater and greater use of the TEI Guidelines’ dictionary module. And as increasing numbers of TEI dictionaries become available, there is an ever more palpable need to work towards greater interoperability among dictionary writing systems and other language res...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | deu |
Published: |
Text Encoding Initiative Consortium
2015-12-01
|
Series: | Journal of the Text Encoding Initiative |
Subjects: | |
Online Access: | https://journals.openedition.org/jtei/1356 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832578492881960960 |
---|---|
author | Karlheinz Mörth Laurent Romary Gerhard Budin Daniel Schopper |
author_facet | Karlheinz Mörth Laurent Romary Gerhard Budin Daniel Schopper |
author_sort | Karlheinz Mörth |
collection | DOAJ |
description | Academic dictionary writing is making greater and greater use of the TEI Guidelines’ dictionary module. And as increasing numbers of TEI dictionaries become available, there is an ever more palpable need to work towards greater interoperability among dictionary writing systems and other language resources that are needed by dictionaries and dictionary tools. In particular this holds true for the crucial role that statistical data obtained from language resources play in lexicographic workflow—a role that also has to be reflected in the model of the data produced in these workflows. Presenting a range of current projects, the authors address two main questions in this area: How can the relationship between a dictionary and other language resources be conceptualized, irrespective of whether they are used in the production of the dictionary or to enrich existing lexicographic data? And how can this be documented using the TEI Guidelines? Discussing a variety of options, this paper proposes a customization of the TEI dictionary module that tries to respond to the emerging requirements in an environment of increasingly intertwined language resources. |
format | Article |
id | doaj-art-663c0dd436f84f66b77b6798b05c76a8 |
institution | Kabale University |
issn | 2162-5603 |
language | deu |
publishDate | 2015-12-01 |
publisher | Text Encoding Initiative Consortium |
record_format | Article |
series | Journal of the Text Encoding Initiative |
spelling | doaj-art-663c0dd436f84f66b77b6798b05c76a82025-01-30T13:56:25ZdeuText Encoding Initiative ConsortiumJournal of the Text Encoding Initiative2162-56032015-12-01810.4000/jtei.1356Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and CorporaKarlheinz MörthLaurent RomaryGerhard BudinDaniel SchopperAcademic dictionary writing is making greater and greater use of the TEI Guidelines’ dictionary module. And as increasing numbers of TEI dictionaries become available, there is an ever more palpable need to work towards greater interoperability among dictionary writing systems and other language resources that are needed by dictionaries and dictionary tools. In particular this holds true for the crucial role that statistical data obtained from language resources play in lexicographic workflow—a role that also has to be reflected in the model of the data produced in these workflows. Presenting a range of current projects, the authors address two main questions in this area: How can the relationship between a dictionary and other language resources be conceptualized, irrespective of whether they are used in the production of the dictionary or to enrich existing lexicographic data? And how can this be documented using the TEI Guidelines? Discussing a variety of options, this paper proposes a customization of the TEI dictionary module that tries to respond to the emerging requirements in an environment of increasingly intertwined language resources.https://journals.openedition.org/jtei/1356lexicographylanguage resourcesdigital corporastatistics |
spellingShingle | Karlheinz Mörth Laurent Romary Gerhard Budin Daniel Schopper Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora Journal of the Text Encoding Initiative lexicography language resources digital corpora statistics |
title | Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora |
title_full | Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora |
title_fullStr | Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora |
title_full_unstemmed | Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora |
title_short | Modeling Frequency Data: Methodological Considerations on the Relationship between Dictionaries and Corpora |
title_sort | modeling frequency data methodological considerations on the relationship between dictionaries and corpora |
topic | lexicography language resources digital corpora statistics |
url | https://journals.openedition.org/jtei/1356 |
work_keys_str_mv | AT karlheinzmorth modelingfrequencydatamethodologicalconsiderationsontherelationshipbetweendictionariesandcorpora AT laurentromary modelingfrequencydatamethodologicalconsiderationsontherelationshipbetweendictionariesandcorpora AT gerhardbudin modelingfrequencydatamethodologicalconsiderationsontherelationshipbetweendictionariesandcorpora AT danielschopper modelingfrequencydatamethodologicalconsiderationsontherelationshipbetweendictionariesandcorpora |