GPU acceleration of average gradient method for solving partial differential equations
Puro, Touko; Pohjonen, Aarne (2025-01-13)
Puro, Touko
Pohjonen, Aarne
Linköping university electronic press
13.01.2025
Puro, T., & Pohjonen, A. (2025, January 13). GPU acceleration of average gradient method for solving partial differential equations. Proceedings of the Second SIMS EUROSIM Conference on Modelling and Simulation, SIMS EUROSIM 2024. https://doi.org/10.3384/ecp212.066
https://creativecommons.org/licenses/by/4.0/
© 2025 Touko Puro, Aarne Pohjonen. This work is licensed under a Creative Commons Attribution 4.0 International License.
https://creativecommons.org/licenses/by/4.0/
© 2025 Touko Puro, Aarne Pohjonen. This work is licensed under a Creative Commons Attribution 4.0 International License.
https://creativecommons.org/licenses/by/4.0/
Julkaisun pysyvä osoite on
https://urn.fi/URN:NBN:fi:oulu-202502211790
https://urn.fi/URN:NBN:fi:oulu-202502211790
Tiivistelmä
Abstract
Previously presented method of calculating local average gradients for solvingpartial differential equations (PDEs) is enhanced by accelerating it with graphics processingunits (GPUs) and combining a previous technique of interpolating between grid points in thecalculation of the gradients instead of using interpolation to create a denser grid.For accelerating the calculation with GPUs, we have ported the original naive Matlabimplementation to C++ and CUDA, and after optimizing the code we observe a speedupfactors more than two thousand, which is largely due to the original code not being optimized.
Previously presented method of calculating local average gradients for solvingpartial differential equations (PDEs) is enhanced by accelerating it with graphics processingunits (GPUs) and combining a previous technique of interpolating between grid points in thecalculation of the gradients instead of using interpolation to create a denser grid.For accelerating the calculation with GPUs, we have ported the original naive Matlabimplementation to C++ and CUDA, and after optimizing the code we observe a speedupfactors more than two thousand, which is largely due to the original code not being optimized.
Kokoelmat
- Avoin saatavuus [38840]