Ethical challenges in collecting pre-existing digital data for linguistic research

Pre-existing digital data are a valuable resource for linguistic research. Collecting these materials is often thought of as straightforward (“the data exist anyway”) and ethical dilemmas are given little consideration. In this article, we discuss microethical issues we encountered while collecting...

Full description

Saved in:
Bibliographic Details
Main Authors: Dalmaijer Evi, Stommel Wyke, Pas Berber, Spooren Wilbert
Format: Article
Language:English
Published: De Gruyter 2025-03-01
Series:Linguistics
Subjects:
Online Access:https://doi.org/10.1515/ling-2023-0132
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1849738321298718720
author Dalmaijer Evi
Stommel Wyke
Pas Berber
Spooren Wilbert
author_facet Dalmaijer Evi
Stommel Wyke
Pas Berber
Spooren Wilbert
author_sort Dalmaijer Evi
collection DOAJ
description Pre-existing digital data are a valuable resource for linguistic research. Collecting these materials is often thought of as straightforward (“the data exist anyway”) and ethical dilemmas are given little consideration. In this article, we discuss microethical issues we encountered while collecting electronic text messages, photos, and videos posted on a digital platform and app during paramedical treatment. Since ethics and methods are intertwined, we discuss the various ethical and methodological aspects of collecting these sensitive digital data for our linguistic research project and reflect on the benefits and limitations of the choices we made during this process. We specifically highlight the interdependence of ethics with technology and discuss how this can be even more challenging when working in a specific institutional context characterized by different conceptions of ethics and technology. Our article highlights the importance of microethics complementing prevalent ethical guidelines. We show that when pre-existing digital data are available in non-public digital spheres, it is difficult for researchers to define in advance in ethical protocols or guidelines how the data can be collected and what ethical measures should be taken. We argue that ethical reflections should be at the center of research, including research on pre-existing digital data, guiding the decisions to be made at all stages.
format Article
id doaj-art-54b36beb2f574baf939fefa36b8de8a1
institution DOAJ
issn 0024-3949
1613-396X
language English
publishDate 2025-03-01
publisher De Gruyter
record_format Article
series Linguistics
spelling doaj-art-54b36beb2f574baf939fefa36b8de8a12025-08-20T03:06:37ZengDe GruyterLinguistics0024-39491613-396X2025-03-0163240742710.1515/ling-2023-0132Ethical challenges in collecting pre-existing digital data for linguistic researchDalmaijer Evi0Stommel Wyke1Pas Berber2Spooren Wilbert3Centre for Language Studies, 6029Radboud University, Erasmusplein 1, 6525 HT, Nijmegen, The NetherlandsCentre for Language Studies, 6029Radboud University, Erasmusplein 1, 6525 HT, Nijmegen, The NetherlandsInterdisciplinary Research Hub on Digitalization and Society, Radboud University, Erasmusplein 1, 6525 HT, Nijmegen, The NetherlandsCentre for Language Studies, 6029Radboud University, Erasmusplein 1, 6525 HT, Nijmegen, The NetherlandsPre-existing digital data are a valuable resource for linguistic research. Collecting these materials is often thought of as straightforward (“the data exist anyway”) and ethical dilemmas are given little consideration. In this article, we discuss microethical issues we encountered while collecting electronic text messages, photos, and videos posted on a digital platform and app during paramedical treatment. Since ethics and methods are intertwined, we discuss the various ethical and methodological aspects of collecting these sensitive digital data for our linguistic research project and reflect on the benefits and limitations of the choices we made during this process. We specifically highlight the interdependence of ethics with technology and discuss how this can be even more challenging when working in a specific institutional context characterized by different conceptions of ethics and technology. Our article highlights the importance of microethics complementing prevalent ethical guidelines. We show that when pre-existing digital data are available in non-public digital spheres, it is difficult for researchers to define in advance in ethical protocols or guidelines how the data can be collected and what ethical measures should be taken. We argue that ethical reflections should be at the center of research, including research on pre-existing digital data, guiding the decisions to be made at all stages.https://doi.org/10.1515/ling-2023-0132microethicspre-existing datadigital data collectiontechnology
spellingShingle Dalmaijer Evi
Stommel Wyke
Pas Berber
Spooren Wilbert
Ethical challenges in collecting pre-existing digital data for linguistic research
Linguistics
microethics
pre-existing data
digital data collection
technology
title Ethical challenges in collecting pre-existing digital data for linguistic research
title_full Ethical challenges in collecting pre-existing digital data for linguistic research
title_fullStr Ethical challenges in collecting pre-existing digital data for linguistic research
title_full_unstemmed Ethical challenges in collecting pre-existing digital data for linguistic research
title_short Ethical challenges in collecting pre-existing digital data for linguistic research
title_sort ethical challenges in collecting pre existing digital data for linguistic research
topic microethics
pre-existing data
digital data collection
technology
url https://doi.org/10.1515/ling-2023-0132
work_keys_str_mv AT dalmaijerevi ethicalchallengesincollectingpreexistingdigitaldataforlinguisticresearch
AT stommelwyke ethicalchallengesincollectingpreexistingdigitaldataforlinguisticresearch
AT pasberber ethicalchallengesincollectingpreexistingdigitaldataforlinguisticresearch
AT spoorenwilbert ethicalchallengesincollectingpreexistingdigitaldataforlinguisticresearch