
AssertionError: Probabilities don't sum to 1 along axis 1

There is an error message on utils/compute_per_position_ic(ppm, background, pseudocount) function. It says that the probabilities of ppm don't sum to 1 along axis 1. I think it has something to do with the warning I obtained earlier:

RuntimeWarning: invalid value encountered in true_divide
  vecs1/np.linalg.norm(vecs1, axis=1)[:,None]

I'm using grad X input as importance scores and grad alone as hypothetical scores. Is this error message implying that the importance scores are problematic?

Details attached below:

AssertionError: Probabilities don't sum to 1 along axis 1 in [[0.28571429 0.16666667 0.22619048 0.20238095

My suspicion is that there is a problem with your one-hot encoding of the sequence itself. Have you verified that, at every position in your one-hot encoded sequence, exactly one position among ACGT is a 1 and the others are 0? I think you may have some positions where it’s all zeros.

Right, that's my problem. It's working after I removed sequences with all zero entries.
I used Deep SEA dataset downloaded from their official website. I assumed the sequences are valid without checking.
Just a heads up for other people who are also trying modisco on Deep SEA.

Thanks for your amazing work.