Many studies have shown the effectiveness of ma- chine learning in estimating psychological or physiological states using physiological data as input. However, it is ethically and physically difficult to collect a large amount of data without bias in an uncontrolled environment. Specifically, the amount of data in rare cases is especially small compared to common data. Therefore, the distribution bias may cause overfitting in machine learning. In this paper, we propose a SMOTE-based method to alleviate the distribution bias by data augmentation in the regression problem using a dataset containing time-series physiological data. The effectiveness of the proposed method was confirmed for datasets of thermal sensation and core body temperature collected in uncontrolled environments. The results show that our method improves the performance of regression models for minor cases with a bit of decline in the mean average error.
- Hiroki Yoshikawa, Akira Uchiyama, Teruo Higashino, "Time-Series Physiological Data Balancing for Regression," Proc. of the 2021 IEEE International Conference on Artificial Intelligence and Computer Applications (ICAICA 2021), Dalian, China, June 28-30, 2021. doi:https://doi.org/10.1109/ICAICA52286.2021.9498128