A Review of Causal Methods for High-Dimensional Data
Berkessa, Zewude A.; Läärä, Esa; Waldmann, Patrik (2024-12-30)
Berkessa, Zewude A.
Läärä, Esa
Waldmann, Patrik
IEEE
30.12.2024
Z. A. Berkessa, E. Läärä and P. Waldmann, "A Review of Causal Methods for High-Dimensional Data," in IEEE Access, vol. 13, pp. 11892-11917, 2025, doi: 10.1109/ACCESS.2024.3524261.
https://creativecommons.org/licenses/by/4.0/
© The Author(s) 2024. This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
https://creativecommons.org/licenses/by/4.0/
© The Author(s) 2024. This work is licensed under a Creative Commons Attribution 4.0 License. For more information, see https://creativecommons.org/licenses/by/4.0/.
https://creativecommons.org/licenses/by/4.0/
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:oulu-202501131133
https://urn.fi/URN:NBN:fi:oulu-202501131133
Tiivistelmä
Abstract
Causal learning from observational data is an important scientific endeavor, but the statistical and computational challenges posed by the high-dimensionality of many modern datasets are substantial. Peculiarities such as spurious correlations, endogeneity, noise accumulation, and deflated empirical covariance estimation complicate analysis. These issues may lead to confounding bias, which can be misleading when attempting to learn the true causal relationships and causal effects between variables. In this survey, we provide a comprehensive review of causal analysis and the theory behind high dimensionality. Next, we discuss the effects of high dimensionality on causal estimation methods and their corresponding solutions. Finally, we present evaluation metrics and software tools for both causal effect estimation and causal discovery.
Causal learning from observational data is an important scientific endeavor, but the statistical and computational challenges posed by the high-dimensionality of many modern datasets are substantial. Peculiarities such as spurious correlations, endogeneity, noise accumulation, and deflated empirical covariance estimation complicate analysis. These issues may lead to confounding bias, which can be misleading when attempting to learn the true causal relationships and causal effects between variables. In this survey, we provide a comprehensive review of causal analysis and the theory behind high dimensionality. Next, we discuss the effects of high dimensionality on causal estimation methods and their corresponding solutions. Finally, we present evaluation metrics and software tools for both causal effect estimation and causal discovery.
Kokoelmat
- Avoin saatavuus [38865]