CSI2Depth: Spatio-Temporal Depth Images from Wi-Fi CSI Data via Transformer Networks and Conditional Generative Adversarial Networks
Álvarez Casado, Constantino; Lage Cañellas, Manuel; Mustaniemi, Janne; Pedone, Matteo; Silvén, Olli; Bordallo López, Miguel (2025-06-16)
Avaa tiedosto
Sisältö avataan julkiseksi: 16.06.2026
Álvarez Casado, Constantino
Lage Cañellas, Manuel
Mustaniemi, Janne
Pedone, Matteo
Silvén, Olli
Bordallo López, Miguel
Springer
16.06.2025
Álvarez Casado, C., Lage Cañellas, M., Mustaniemi, J., Pedone, M., Silvén, O., Bordallo López, M. (2025). CSI2Depth: Spatio-Temporal Depth Images from Wi-Fi CSI Data via Transformer Networks and Conditional Generative Adversarial Networks. In: Petersen, J., Dahl, V.A. (eds) Image Analysis. SCIA 2025. Lecture Notes in Computer Science, vol 15725. Springer, Cham. https://doi.org/10.1007/978-3-031-95911-0_26
https://rightsstatements.org/vocab/InC/1.0/
© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG
https://rightsstatements.org/vocab/InC/1.0/
© 2025 The Author(s), under exclusive license to Springer Nature Switzerland AG
https://rightsstatements.org/vocab/InC/1.0/
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:oulu-202508115255
https://urn.fi/URN:NBN:fi:oulu-202508115255
Tiivistelmä
Abstract
Depth estimation is important for 3D reconstruction, robotics, and augmented reality. It is also increasingly relevant in Integrated Sensing and Communication (ISAC), where spatial awareness supports adaptive network optimization, beamforming, and digital twins. Traditional depth and RF-based sensors provide accurate measurements but often require active transmissions or specialized hardware. In contrast, Wi-Fi Channel State Information (CSI) enables passive sensing using existing wireless infrastructure. However, classical RF models struggle with multipath propagation, diffraction, and shadowing, limiting depth estimation in complex indoor environments. This study presents a novel proof-of-concept for generating depth maps from raw 5 GHz single-input multiple-output (SIMO) CSI data. A transformer encoder extracts spatio-temporal features from CSI amplitude and phase, which are processed by a conditional generative adversarial network (cGAN) to synthesize depth maps. Unlike prior CSI-based work focused on activity recognition or point cloud estimation, depth maps provide structured outputs that are computationally efficient and compatible with computer vision techniques. Evaluated on the public MM-Fi dataset, the model captures temporal depth variations associated with scene geometry and obstructions. The results demonstrate the feasibility of passive depth estimation from CSI under stable infrastructure with adequate signal quality and antenna diversity, contributing to the development of ISAC-enabled wireless systems. The code is available at: https://github.com/Arritmic/csi2depth.
Depth estimation is important for 3D reconstruction, robotics, and augmented reality. It is also increasingly relevant in Integrated Sensing and Communication (ISAC), where spatial awareness supports adaptive network optimization, beamforming, and digital twins. Traditional depth and RF-based sensors provide accurate measurements but often require active transmissions or specialized hardware. In contrast, Wi-Fi Channel State Information (CSI) enables passive sensing using existing wireless infrastructure. However, classical RF models struggle with multipath propagation, diffraction, and shadowing, limiting depth estimation in complex indoor environments. This study presents a novel proof-of-concept for generating depth maps from raw 5 GHz single-input multiple-output (SIMO) CSI data. A transformer encoder extracts spatio-temporal features from CSI amplitude and phase, which are processed by a conditional generative adversarial network (cGAN) to synthesize depth maps. Unlike prior CSI-based work focused on activity recognition or point cloud estimation, depth maps provide structured outputs that are computationally efficient and compatible with computer vision techniques. Evaluated on the public MM-Fi dataset, the model captures temporal depth variations associated with scene geometry and obstructions. The results demonstrate the feasibility of passive depth estimation from CSI under stable infrastructure with adequate signal quality and antenna diversity, contributing to the development of ISAC-enabled wireless systems. The code is available at: https://github.com/Arritmic/csi2depth.
Kokoelmat
- Avoin saatavuus [42497]