cvxgrp/diffcp

_solve_adjoint_derivative_dense much slower than np.linalg.solve

zcyang opened this issue · 2 comments

diffcp is installed with openmp flags:

MARCH_NATIVE=1 OPENMP_FLAG="-fopenmp" pip install diffcp

It's at least 5 times slower than np.linalg.solve.
Eigen solve should not be much slower than np.linalg.solve.

Report here in case the code performance can be improved.

It seems diffcp is using many cores in backward when batch_size = 1 ?