A novel multivariate time series dataset of outdoor sport activities

Abstract This study introduces a novel multivariate time series dataset of 228 outdoor sport activities recorded by individual non-competitive athlete in uncontrolled environments. The dataset includes three features: Heart Rate, Speed, and Altitude, and covers five sport categories: walking, runnin...

Full description

Saved in:
Bibliographic Details
Main Author: Matarmaa Jarno
Format: Article
Language:English
Published: Springer 2025-01-01
Series:Discover Data
Subjects:
Online Access:https://doi.org/10.1007/s44248-025-00019-5
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1832585477430968320
author Matarmaa Jarno
author_facet Matarmaa Jarno
author_sort Matarmaa Jarno
collection DOAJ
description Abstract This study introduces a novel multivariate time series dataset of 228 outdoor sport activities recorded by individual non-competitive athlete in uncontrolled environments. The dataset includes three features: Heart Rate, Speed, and Altitude, and covers five sport categories: walking, running, skiing, roller-skiing, and biking. The data was collected using two types of Garmin sport watches. The original dataset was carefully pre-processed using typical data cleansing methods such as gaps filling, and value format transformations. Furthermore, activity filtering was implemented for missing sensor value data and using domain knowledge of sport categories. Full length sequences, varying from 10 min to several hours, were split into equal length segments, approximately 1 min. To address the small number of instances data was augmented using several consecutive segments from the same activity. However, only a small part of the whole original data was used as a computational cost–information gain tradeoff. Three-dimensional dataset is divided into three parts, each dimension to its own comma separated value (CSV) file. The dataset aims to provide a unique resource for researchers and practitioners in the field of sports science, human performance analysis, and activity recognition. It aims to complement the very limited or non-existent publicly available sport activity datasets.
format Article
id doaj-art-19e00bfc1dac456c82cf3830309e641e
institution Kabale University
issn 2731-6955
language English
publishDate 2025-01-01
publisher Springer
record_format Article
series Discover Data
spelling doaj-art-19e00bfc1dac456c82cf3830309e641e2025-01-26T12:47:48ZengSpringerDiscover Data2731-69552025-01-013111110.1007/s44248-025-00019-5A novel multivariate time series dataset of outdoor sport activitiesMatarmaa Jarno0Institute of Radio-Electronics and Information Technology, Ural Federal UniversityAbstract This study introduces a novel multivariate time series dataset of 228 outdoor sport activities recorded by individual non-competitive athlete in uncontrolled environments. The dataset includes three features: Heart Rate, Speed, and Altitude, and covers five sport categories: walking, running, skiing, roller-skiing, and biking. The data was collected using two types of Garmin sport watches. The original dataset was carefully pre-processed using typical data cleansing methods such as gaps filling, and value format transformations. Furthermore, activity filtering was implemented for missing sensor value data and using domain knowledge of sport categories. Full length sequences, varying from 10 min to several hours, were split into equal length segments, approximately 1 min. To address the small number of instances data was augmented using several consecutive segments from the same activity. However, only a small part of the whole original data was used as a computational cost–information gain tradeoff. Three-dimensional dataset is divided into three parts, each dimension to its own comma separated value (CSV) file. The dataset aims to provide a unique resource for researchers and practitioners in the field of sports science, human performance analysis, and activity recognition. It aims to complement the very limited or non-existent publicly available sport activity datasets.https://doi.org/10.1007/s44248-025-00019-5Multivariate time seriesOutdoor sportSport exercisesSport dataset
spellingShingle Matarmaa Jarno
A novel multivariate time series dataset of outdoor sport activities
Discover Data
Multivariate time series
Outdoor sport
Sport exercises
Sport dataset
title A novel multivariate time series dataset of outdoor sport activities
title_full A novel multivariate time series dataset of outdoor sport activities
title_fullStr A novel multivariate time series dataset of outdoor sport activities
title_full_unstemmed A novel multivariate time series dataset of outdoor sport activities
title_short A novel multivariate time series dataset of outdoor sport activities
title_sort novel multivariate time series dataset of outdoor sport activities
topic Multivariate time series
Outdoor sport
Sport exercises
Sport dataset
url https://doi.org/10.1007/s44248-025-00019-5
work_keys_str_mv AT matarmaajarno anovelmultivariatetimeseriesdatasetofoutdoorsportactivities
AT matarmaajarno novelmultivariatetimeseriesdatasetofoutdoorsportactivities