ECG signal reconstruction based on facial videos via combined explicit and implicit supervision
Li, Bin; Zhang, Wei; Li, Xiaobai; Fu, Hong; Xu, Feng (2023-05-03)
Li, Bin
Zhang, Wei
Li, Xiaobai
Fu, Hong
Xu, Feng
Elsevier
03.05.2023
Li, B., Zhang, W., Li, X., Fu, H., & Xu, F. (2023). ECG signal reconstruction based on facial videos via combined explicit and implicit supervision. Knowledge-Based Systems, 272, 110608. https://doi.org/10.1016/j.knosys.2023.110608
https://creativecommons.org/licenses/by-nc-nd/4.0/
© 2023. This manuscript version is made available under the CC-BY-NC-ND 4.0 license https://creativecommons.org/licenses/by-nc-nd/4.0/
https://creativecommons.org/licenses/by-nc-nd/4.0/
© 2023. This manuscript version is made available under the CC-BY-NC-ND 4.0 license https://creativecommons.org/licenses/by-nc-nd/4.0/
https://creativecommons.org/licenses/by-nc-nd/4.0/
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:oulu-202409276094
https://urn.fi/URN:NBN:fi:oulu-202409276094
Tiivistelmä
Abstract
Accurate and continuous electrocardiogram (ECG) signal measurements are of paramount importance for the prevention of cardiovascular disease. Existing physiological measurement methods require the placement of sensors on the human body, which proves uncomfortable and inconvenient during long-term monitoring. Recent research has found that heart rates can be extracted from noncontact RGB facial videos by measuring subtle color changes in the skin region. However, designing an accurate feature representation structure for physiological measurements remain a challenge. This study presents a novel end-to-end ECG signal synthesis network using short facial video sequences. We combined explicit and implicit supervision structures, which allows the network to perceive physiological-related features from unlabeled feature sets and adopts a channel and frame aggregation attention mechanism to boost the network acquisition of periodic spatiotemporally correlated features. Finally, the generated features were fed into a physiological estimator consisting of a serially connected “Generative Adversarial” module for ECG signal reconstruction. Experimental results on publicly available datasets and a new dataset collected by us demonstrate that the proposed network outperforms state-of-the-art methods in ECG signal measurements. This is a powerful tool for use in telemedicine and health monitoring.
Accurate and continuous electrocardiogram (ECG) signal measurements are of paramount importance for the prevention of cardiovascular disease. Existing physiological measurement methods require the placement of sensors on the human body, which proves uncomfortable and inconvenient during long-term monitoring. Recent research has found that heart rates can be extracted from noncontact RGB facial videos by measuring subtle color changes in the skin region. However, designing an accurate feature representation structure for physiological measurements remain a challenge. This study presents a novel end-to-end ECG signal synthesis network using short facial video sequences. We combined explicit and implicit supervision structures, which allows the network to perceive physiological-related features from unlabeled feature sets and adopts a channel and frame aggregation attention mechanism to boost the network acquisition of periodic spatiotemporally correlated features. Finally, the generated features were fed into a physiological estimator consisting of a serially connected “Generative Adversarial” module for ECG signal reconstruction. Experimental results on publicly available datasets and a new dataset collected by us demonstrate that the proposed network outperforms state-of-the-art methods in ECG signal measurements. This is a powerful tool for use in telemedicine and health monitoring.
Kokoelmat
- Avoin saatavuus [38865]