Machine learning of swimming data via wisdom of crowd and regression analysis
Every performance, in an officially sanctioned meet, by a registered USA swimmer is recorded into an online database with times dating back to 1980. For the first time, statistical analysis and machine learning methods are systematically applied to 4,022,631 swim records. In this study, we investiga...
Saved in:
Main Authors: | , , , |
---|---|
Format: | Article |
Language: | English |
Published: |
AIMS Press
2017-03-01
|
Series: | Mathematical Biosciences and Engineering |
Subjects: | |
Online Access: | https://www.aimspress.com/article/doi/10.3934/mbe.2017031 |
Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
_version_ | 1832590057216671744 |
---|---|
author | Jiang Xie Junfu Xu Celine Nie Qing Nie |
author_facet | Jiang Xie Junfu Xu Celine Nie Qing Nie |
author_sort | Jiang Xie |
collection | DOAJ |
description | Every performance, in an officially sanctioned meet, by a registered USA swimmer is recorded into an online database with times dating back to 1980. For the first time, statistical analysis and machine learning methods are systematically applied to 4,022,631 swim records. In this study, we investigate performance features for all strokes as a function of age and gender. The variances in performance of males and females for different ages and strokes were studied, and the correlations of performances for different ages were estimated using the Pearson correlation. Regression analysis show the performance trends for both males and females at different ages and suggest critical ages for peak training. Moreover, we assess twelve popular machine learning methods to predict or classify swimmer performance. Each method exhibited different strengths or weaknesses in different cases, indicating no one method could predict well for all strokes. To address this problem, we propose a new method by combining multiple inference methods to derive Wisdom of Crowd Classifier (WoCC). Our simulation experiments demonstrate that the WoCC is a consistent method with better overall prediction accuracy. Our study reveals several new age-dependent trends in swimming and provides an accurate method for classifying and predicting swimming times. |
format | Article |
id | doaj-art-74ed5af272834eba83f9097309303252 |
institution | Kabale University |
issn | 1551-0018 |
language | English |
publishDate | 2017-03-01 |
publisher | AIMS Press |
record_format | Article |
series | Mathematical Biosciences and Engineering |
spelling | doaj-art-74ed5af272834eba83f90973093032522025-01-24T02:39:37ZengAIMS PressMathematical Biosciences and Engineering1551-00182017-03-0114251152710.3934/mbe.2017031Machine learning of swimming data via wisdom of crowd and regression analysisJiang Xie0Junfu Xu1Celine Nie2Qing Nie3School of Computer Engineering and Science, Shanghai University, 99 Shangda Road, Shanghai 200444, ChinaSchool of Computer Engineering and Science, Shanghai University, 99 Shangda Road, Shanghai 200444, ChinaUniversity High School, 4771 Campus Drive, Irvine, CA 92612, USADepartment of Mathematics, Center for Mathematical and Computational Biology, University of California, Irvine, CA 92697, USAEvery performance, in an officially sanctioned meet, by a registered USA swimmer is recorded into an online database with times dating back to 1980. For the first time, statistical analysis and machine learning methods are systematically applied to 4,022,631 swim records. In this study, we investigate performance features for all strokes as a function of age and gender. The variances in performance of males and females for different ages and strokes were studied, and the correlations of performances for different ages were estimated using the Pearson correlation. Regression analysis show the performance trends for both males and females at different ages and suggest critical ages for peak training. Moreover, we assess twelve popular machine learning methods to predict or classify swimmer performance. Each method exhibited different strengths or weaknesses in different cases, indicating no one method could predict well for all strokes. To address this problem, we propose a new method by combining multiple inference methods to derive Wisdom of Crowd Classifier (WoCC). Our simulation experiments demonstrate that the WoCC is a consistent method with better overall prediction accuracy. Our study reveals several new age-dependent trends in swimming and provides an accurate method for classifying and predicting swimming times.https://www.aimspress.com/article/doi/10.3934/mbe.2017031big datastatistical analysispredictionswimming modeltime series data |
spellingShingle | Jiang Xie Junfu Xu Celine Nie Qing Nie Machine learning of swimming data via wisdom of crowd and regression analysis Mathematical Biosciences and Engineering big data statistical analysis prediction swimming model time series data |
title | Machine learning of swimming data via wisdom of crowd and regression analysis |
title_full | Machine learning of swimming data via wisdom of crowd and regression analysis |
title_fullStr | Machine learning of swimming data via wisdom of crowd and regression analysis |
title_full_unstemmed | Machine learning of swimming data via wisdom of crowd and regression analysis |
title_short | Machine learning of swimming data via wisdom of crowd and regression analysis |
title_sort | machine learning of swimming data via wisdom of crowd and regression analysis |
topic | big data statistical analysis prediction swimming model time series data |
url | https://www.aimspress.com/article/doi/10.3934/mbe.2017031 |
work_keys_str_mv | AT jiangxie machinelearningofswimmingdataviawisdomofcrowdandregressionanalysis AT junfuxu machinelearningofswimmingdataviawisdomofcrowdandregressionanalysis AT celinenie machinelearningofswimmingdataviawisdomofcrowdandregressionanalysis AT qingnie machinelearningofswimmingdataviawisdomofcrowdandregressionanalysis |