Turicchi, Jake ORCID: https://orcid.org/0000-0003-1174-813X, O'Driscoll, Ruairi ORCID: https://orcid.org/0000-0003-3995-0073, Finlayson, Graham ORCID: https://orcid.org/0000-0002-5620-2256, Duarte, Cristiana ORCID: https://orcid.org/0000-0002-6566-273X, Palmeira, A L ORCID: https://orcid.org/0000-0001-6508-0599, Larsen, Sofus C ORCID: https://orcid.org/0000-0002-0838-9378, Heitmann, Berit L ORCID: https://orcid.org/0000-0002-6809-4504 and Stubbs, R James ORCID: https://orcid.org/0000-0002-0843-9064 (2020) Data Imputation and Body Weight Variability Calculation Using Linear and Nonlinear Methods in Data Collected From Digital Smart Scales: Simulation and Validation Study. JMIR mHealth and uHealth, 8 (9). e17977.
Text
PDF - Published Version Available under License Creative Commons Attribution. |
Abstract
Background:
Body weight variability (BWV) is common in the general population and may act as a risk factor for obesity or diseases. The correct identification of these patterns may have prognostic or predictive value in clinical and research settings. With advancements in technology allowing for the frequent collection of body weight data from electronic smart scales, new opportunities to analyze and identify patterns in body weight data are available.
Objective:
This study aims to compare multiple methods of data imputation and BWV calculation using linear and nonlinear approaches
Methods:
In total, 50 participants from an ongoing weight loss maintenance study (the NoHoW study) were selected to develop the procedure. We addressed the following aspects of data analysis: cleaning, imputation, detrending, and calculation of total and local BWV. To test imputation, missing data were simulated at random and using real patterns of missingness. A total of 10 imputation strategies were tested. Next, BWV was calculated using linear and nonlinear approaches, and the effects of missing data and data imputation on these estimates were investigated.
Results:
Body weight imputation using structural modeling with Kalman smoothing or an exponentially weighted moving average provided the best agreement with observed values (root mean square error range 0.62%-0.64%). Imputation performance decreased with missingness and was similar between random and nonrandom simulations. Errors in BWV estimations from missing simulated data sets were low (2%-7% with 80% missing data or a mean of 67, SD 40.1 available body weights) compared with that of imputation strategies where errors were significantly greater, varying by imputation method.
Conclusions:
The decision to impute body weight data depends on the purpose of the analysis. Directions for the best performing imputation methods are provided. For the purpose of estimating BWV, data imputation should not be conducted. Linear and nonlinear methods of estimating BWV provide reasonably accurate estimates under high proportions (80%) of missing data.
Item Type: | Article |
---|---|
Status: | Published |
DOI: | 10.2196/17977 |
School/Department: | School of Education, Language and Psychology |
URI: | https://ray.yorksj.ac.uk/id/eprint/5980 |
University Staff: Request a correction | RaY Editors: Update this record