Problem tracing binary CNN model after recent tracing optimization

Question

Problem tracing binary CNN model after recent tracing optimization

Closed this issue 9 months ago · 8 comments

jmitrevs commented 10 months ago

Prerequisites

Please make sure to check off these prerequisites before submitting a bug report.

Test that the bug appears on the current version of the master branch. Make sure to include the commit hash of the commit you checked out.
Check that the issue hasn't already been reported, by checking the currently open issues.
If there are steps to reproduce the problem, make sure to write them down below.
If relevant, please include the hls4ml project files, which were created directly before and/or after the bug.

Quick summary

Error when running tracing on a binary cnn (though I do not think the fact that the network is binary is important).

Details

When I tried to run tracing on a binary CNN (from #740):

https://gist.github.com/jmitrevs/0455fde237ccdc778c619e853e64dde6

it causes an error:

Traceback (most recent call last):
  File "/Users/jmitrevs/work/hls4ml/test/pytest/profile_binary_cnn.py", line 103, in <module>
    test_model2("Vivado", "io_parallel")
  File "/Users/jmitrevs/work/hls4ml/test/pytest/profile_binary_cnn.py", line 98, in test_model2
    keras_trace = hls4ml.model.profiling.get_ymodel_keras(model2, X)
  File "/Users/jmitrevs/work/hls4ml/hls4ml/model/profiling.py", line 592, in get_ymodel_keras
    and layer.activation.__name__ != "linear"
AttributeError: 'NoneType' object has no attribute '__name__'. Did you mean: '__ne__'?

The code seems to be new, from #863.

Steps to Reproduce

Run the file in the linked "gist" using the main branch of hls4ml.

Answer 1 · 2024-02-22T02:59:04.000Z

I am including @AdrianAlan and @jmduarte because they were involved in the latest tracing updates.

Answer 2 · 2024-02-22T08:16:52.000Z

Thanks for catching that. I will take a look.

Answer 3 · 2024-02-22T11:08:02.000Z

There's a fix for this in #961 (that PR needs bit more work to clean up)

Answer 4 · 2024-02-22T14:45:26.000Z

I am not sure it's a complete fix. When I attempted the fairly straightforward fix of checking that it's not None, I ran into isuses when activation was a string.

Answer 5 · 2024-02-22T14:45:53.000Z

We should validate the fix on this, as well.

Answer 6 · 2024-02-26T20:38:10.000Z

The fix LGTM.

@jmitrevs What do you mean when you say you ran into issues? When you add, let's say, activation='relu', in L33 in your example, the output looks all right, but the affected layer is out of order. Is that what you refer to?

Answer 7 · 2024-02-26T21:08:19.000Z

Maybe it does fix it. I just checked out the PR, and I get no error. I thought that it complained about a string activation having __name__ extracted, but I don't see that problem any more. So I think once the #961 goes in we can maybe close this.

Answer 8 · 2024-04-18T15:08:26.000Z

This has been fixed with the merge of #987 (which replaced #961)