Innovation indicators based on firm websites-Which website characteristics predict firm-level innovation activity?

Web-based innovation indicators may provide new insights into firm-level innovation activities. However, little is known yet about the accuracy and relevance of web-based information for measuring innovation. In this study, we use data on 4,487 firms from the Mannheim Innovation Panel (MIP) 2019, th...

Full description

Saved in:
Bibliographic Details
Main Authors: Janna Axenbeck, Patrick Breithaupt
Format: Article
Language:English
Published: Public Library of Science (PLoS) 2021-01-01
Series:PLoS ONE
Online Access:https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0249583&type=printable
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1850046163924811776
author Janna Axenbeck
Patrick Breithaupt
author_facet Janna Axenbeck
Patrick Breithaupt
author_sort Janna Axenbeck
collection DOAJ
description Web-based innovation indicators may provide new insights into firm-level innovation activities. However, little is known yet about the accuracy and relevance of web-based information for measuring innovation. In this study, we use data on 4,487 firms from the Mannheim Innovation Panel (MIP) 2019, the German contribution to the European Community Innovation Survey (CIS), to analyze which website characteristics perform as predictors of innovation activity at the firm level. Website characteristics are measured by several data mining methods and are used as features in different Random Forest classification models that are compared against each other. Our results show that the most relevant website characteristics are textual content, the use of English language, the number of subpages and the amount of characters on a website. In our main analysis, models using all website characteristics jointly yield AUC values of up to 0.75 and increase accuracy scores by up to 18 percentage points compared to a baseline prediction based on the sample mean. Moreover, predictions with website characteristics significantly differ from baseline predictions according to a McNemar test. Results also indicate a better performance for the prediction of product innovators and firms with innovation expenditures than for the prediction of process innovators.
format Article
id doaj-art-91087f8923284b0bb43b32fb5c225a02
institution DOAJ
issn 1932-6203
language English
publishDate 2021-01-01
publisher Public Library of Science (PLoS)
record_format Article
series PLoS ONE
spelling doaj-art-91087f8923284b0bb43b32fb5c225a022025-08-20T02:54:31ZengPublic Library of Science (PLoS)PLoS ONE1932-62032021-01-01164e024958310.1371/journal.pone.0249583Innovation indicators based on firm websites-Which website characteristics predict firm-level innovation activity?Janna AxenbeckPatrick BreithauptWeb-based innovation indicators may provide new insights into firm-level innovation activities. However, little is known yet about the accuracy and relevance of web-based information for measuring innovation. In this study, we use data on 4,487 firms from the Mannheim Innovation Panel (MIP) 2019, the German contribution to the European Community Innovation Survey (CIS), to analyze which website characteristics perform as predictors of innovation activity at the firm level. Website characteristics are measured by several data mining methods and are used as features in different Random Forest classification models that are compared against each other. Our results show that the most relevant website characteristics are textual content, the use of English language, the number of subpages and the amount of characters on a website. In our main analysis, models using all website characteristics jointly yield AUC values of up to 0.75 and increase accuracy scores by up to 18 percentage points compared to a baseline prediction based on the sample mean. Moreover, predictions with website characteristics significantly differ from baseline predictions according to a McNemar test. Results also indicate a better performance for the prediction of product innovators and firms with innovation expenditures than for the prediction of process innovators.https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0249583&type=printable
spellingShingle Janna Axenbeck
Patrick Breithaupt
Innovation indicators based on firm websites-Which website characteristics predict firm-level innovation activity?
PLoS ONE
title Innovation indicators based on firm websites-Which website characteristics predict firm-level innovation activity?
title_full Innovation indicators based on firm websites-Which website characteristics predict firm-level innovation activity?
title_fullStr Innovation indicators based on firm websites-Which website characteristics predict firm-level innovation activity?
title_full_unstemmed Innovation indicators based on firm websites-Which website characteristics predict firm-level innovation activity?
title_short Innovation indicators based on firm websites-Which website characteristics predict firm-level innovation activity?
title_sort innovation indicators based on firm websites which website characteristics predict firm level innovation activity
url https://journals.plos.org/plosone/article/file?id=10.1371/journal.pone.0249583&type=printable
work_keys_str_mv AT jannaaxenbeck innovationindicatorsbasedonfirmwebsiteswhichwebsitecharacteristicspredictfirmlevelinnovationactivity
AT patrickbreithaupt innovationindicatorsbasedonfirmwebsiteswhichwebsitecharacteristicspredictfirmlevelinnovationactivity