FMInference/FlexLLMGen

Why the variable bls must be less than 20?

Opened this issue · 0 comments

It is written in the report that**"Typically, gbs is a multiple of 4, and bls is less than 20 so there are not too many choices."**Could you give the reason how to determine the limitation“bls<20”?