Closed this issue 7 years ago · 2 comments
for assembler's work.
passed variable.
indices = [l1, l2, l3, ...] start_inx = [0, l1, l1+l2, l1+l2+l3, ...] ---> one pass to accumulate.
parallel assembly is slower.
It is never going to be computational intense.