su2code/SU2

A problem of running shape_optimization.py in parallel due to error '139'

aozuzun opened this issue · 1 comments

          I have same problem.  In cluster, I can run normal analysis (not optimization) without any issue. When I try to run optimization on nodes of HPC, it gives same error at random stage. For example sometimes at DNS2 Defrom or DSN1 adjoint.

error.txt

Could you suggest any solution?

Originally posted by @mardar572 in #1666 (comment)

I managed to solve the error on my own. The problem was resolved when I made the following changes to the MPI parameters.

mpirun --mca btl_openib_cpc_exclude rdmacm --mca mpi_leave_pinned 0 --mca btl_openib_allow_ib 1 --mca btl openib,self,vader -n $NSLOTS