danny0838/webscrapbook

A regex that used to work with scrapbook-x doesn't work with webscrapbook

grotesque opened this issue · 3 comments

Sample text
~!~happy~!~

I want to write a regex that'll do substring search between two ~!~

With scrapbook-x I used to do (~!~).*(.*app.*).*(~!~) and it used to work.

This doesn't work for webscrapbook re:(~!~).*(.*app.*).*(~!~)

What am I doing wrong?

re: is a switching command and a keyword used with it will be ignored. You should separate the command and the keyword with space(s), i.e. re: (~!~).*(.*app.*).*(~!~).

Legacy ScrapBook X still accepts the keyword used with a command that doesn't need a keyword or an unsupported command. This can cause a confusion and WebScrapBook no more does that.

P.S. Be ware that (~!~).*(.*app.*).*(~!~) is a bad regex pattern that can cause a catastrophic backtracking. Do not really use it.

Thanks. It works with the added space.

I didn't understand the meaning of switching command or keyword. Maybe you can elaborate. I think documentation should explicitly mention the space being needed to run regular expressions.

And maybe we should make it work without the space like scrapbook-x does it. A small thing like this will very likely lead to new people concluding that regex doesn't work here.

The documentation should be clear about this. A command like mc: and re: affects all following normal keywords, and thus need not and should not be used with a keyword. Other commands are used together with a keyword instead. Things like re:keyword have never been documented (even in legacy ScrapBook X), and their behavior are never guaranteed.