adjunct/removecomments should be replaced
Opened this issue · 2 comments
The code I have for handling comments is getting hairy because I forgot about single-ticks within double-quotes. For example,
6015 IF WA=0 THEN PRINT "Error: File '";WF$;"' File not found.": END
That line was getting cut off at the first single tick after the word "File". In the meantime, I've kludged the script to simply pass through any lines with double quotes.
The proper solution would be to write a lexer using flex. Technically, a Context-Free Grammar is equivalent to a regular expression, however, it should be a lot easier to understand and debug.
I did the "proper solution", mentioned above. Or at least, I think I did. It was back in 2022 so I've forgotten if there were severe hiccups. It should work on MacOS with a simple make install
.
The current version of https://github.com/hackerb9/tokenize contains a program called "tandy-decomment" which, while not complete, does remove most the comments. Typically one would use the frontend "tokenize -d" to decomment and tokenize at the same time.