dwadden/dygiepp

parse_ace_event.py cannot be executed correctly

ws-researcher opened this issue · 6 comments

parse_ace_event.py cannot be executed correctly

Can you please provide the command that you're running to cause the error, as well as a full stack trace showing the error message?

When I execute the command 'python ./scripts/data/ace-event/parse_ace_event.py default-settings':
image
It looks like after sent.as_doc(), the id of the token has changed, but the id of the entity has not changed, causing ‘start_token = [tok for tok in sent if tok.idx == entity.start_char]’ to be empty.

I was able to run the script without error. This is a bit tricky to debug, since the ACE distribution isn't public. Let's try this:

  • Confirm that you've created a Conda environment and installed dependences as specified in the README.
  • If the script still breaks, then it must be an issue with a file in your ACE distribution that I don't have. I unfortunately can't offer support for this, but if you're able to identify the file and modify parse_ace_event.py to fix it, I'd gladly accept a PR.

Sorry I can't help more.

I also encountered this problem when I used spacy's en_core_web_md model. And the problem seemed to disappear when I used en_core_web_sm instead.

Interesting, thanks @scanf3 for pointing this out! I just updated the README and the ACE preprocessing code to explicitly use en_core_web_sm; see this pull request.

@ws-researcher, if this resolves your problems, feel free to close this issue.

Thanks for pointing this out!. @scanf3 @dwadden