TurbSim: OpenMP parallelization in factorization and FFT #3020

andrew-platt · 2025-10-02T04:55:51Z

Needs some review and testing.

Feature or improvement description
TurbSim has been single-threaded forever. This PR adds parallelization through OpenMP and should in theory run faster as a result.

If OpenMP is enabled at compile time, parallelization for MKL will be automatically turned off. We saw this the MKL parallelization is an issue in #3018 when the MKL would try to parallelize some tiny matrix manipulation incurring massive overhead costs.

Still need to extend to the following routines:

CalcFourierCoeffs_API
CalcFourierCoeffs_General
CalcFourierCoeffs_None

Related issue, if one exists
#3018

Impacted areas of the software
TurbSim only

Additional supporting information

A PRIVATE(<allocatable_array>) in a !$OMP PARALLEL DO pragma doesn't always allocate the copy of that array. This leads to segfaults. I have messy workarounds for this. There may be a better way to do it, but I have no idea if the ROCM Flang compiler will be able to handle anything more advanced.
Using THREADPRIVATE with COPYIN works better than the original solution

Test results, if applicable
No test results should change.

- Very cludgy mess. Seems to work locally though.

The TRH_in in the SHARED was causing memory issues. Removing that makes things identical now.

No need for the inner loop.

…se THREADPRIVATE and COPYIN to create the necessary copies of TRH array in CalcFourierCoeffs_IEC instead of doing it manually

…hen not using OpenMP or explicitly setting the BLAS vendor. This should reduce the chances of slowdowns from calling the MKL with many small calculations trying to distribute it over many threads.

modules/turbsim/src/TSsubs.f90

andrew-platt

@deslaughter's changes are great - a much better method than what I proposed.

bjonkman · 2025-11-29T23:55:04Z

modules/turbsim/src/TSsubs.f90

-CONTAINS
-   SUBROUTINE Cleanup()
-
-      IF ( ALLOCATED( Dist      ) ) DEALLOCATE( Dist      )


In this subroutine, we are allocating memory, but with removing the cleanup() reoutine, we are not explicitly deallocating it anymore. I know this routine is called only once, so maybe isn't a big concern, but I've seen this cause memory leaks or allocation errors because the array is already allocated when the routine gets called a second time. Thoughts?

andrew-platt added 3 commits October 1, 2025 20:19

TurbSim: add OMP directives on Coeffs2TimeSeries

1184401

TurbSim: add OMP on CalcFourierCoeffs_IEC

5f9d7c5

- Very cludgy mess. Seems to work locally though.

TurbSim: OpenMP notes and limits on nested in Coh2H

1637f4a

andrew-platt added this to the v4.2.0 milestone Oct 2, 2025

andrew-platt requested review from deslaughter and mayankchetan October 2, 2025 04:55

andrew-platt self-assigned this Oct 2, 2025

andrew-platt added the Module: TurbSim label Oct 2, 2025

andrew-platt requested a review from bjonkman October 2, 2025 04:57

andrew-platt force-pushed the f/TurbSim_OpenMP branch from f090d87 to 1637f4a Compare October 2, 2025 05:09

andrew-platt added 2 commits October 1, 2025 23:22

TurbSim: correction to OMP directive

cd17f89

The TRH_in in the SHARED was causing memory issues. Removing that makes things identical now.

TurbSim: simplify looping in EyeCoh2H

448f1a5

No need for the inner loop.

andrew-platt mentioned this pull request Oct 2, 2025

Turbsim uses up all available threads when not compiled with OpenMP #3018

Closed

deslaughter added 2 commits November 24, 2025 17:19

Remove the _OPENMP ifdefs and let OpenMP set the number of threads. U…

5291ed3

…se THREADPRIVATE and COPYIN to create the necessary copies of TRH array in CalcFourierCoeffs_IEC instead of doing it manually

Change main CMakeLists.txt file to prefer sequential version of MKL w…

5858941

…hen not using OpenMP or explicitly setting the BLAS vendor. This should reduce the chances of slowdowns from calling the MKL with many small calculations trying to distribute it over many threads.

andrew-platt commented Nov 24, 2025

View reviewed changes

modules/turbsim/src/TSsubs.f90 Show resolved Hide resolved

andrew-platt commented Nov 25, 2025

View reviewed changes

bjonkman reviewed Nov 29, 2025

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

TurbSim: OpenMP parallelization in factorization and FFT #3020

TurbSim: OpenMP parallelization in factorization and FFT #3020

Uh oh!

andrew-platt commented Oct 2, 2025 •

edited

Loading

Uh oh!

Uh oh!

andrew-platt left a comment

Uh oh!

bjonkman Nov 29, 2025

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

TurbSim: OpenMP parallelization in factorization and FFT #3020

Are you sure you want to change the base?

TurbSim: OpenMP parallelization in factorization and FFT #3020

Uh oh!

Conversation

andrew-platt commented Oct 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

andrew-platt left a comment

Choose a reason for hiding this comment

Uh oh!

bjonkman Nov 29, 2025

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

andrew-platt commented Oct 2, 2025 •

edited

Loading