jwiegley/emacs-async

Maybe we should stop forcing async-bytecomp-package-mode onto users and maintainers

tarsius opened this issue · 6 comments

And by we I mean @thierryvolpiatto and me, and anyone else who adds something like the following to some of their packages:

(and (require 'async-bytecomp nil t)
     (let ((pkgs (bound-and-true-p async-bytecomp-allowed-packages)))
       (if (consp pkgs)
           (cl-intersection '(all magit) pkgs)
         (memq pkgs '(all t))))
     (fboundp 'async-bytecomp-package-mode)
     (async-bytecomp-package-mode 1))

async-bytecomp-package-mode advices the relevant parts of package.el to use a separate emacs instance to compile packages. This makes it almost impossible that an already loaded version of some package leaks into the byte-code of the newer version when package.el builds the new version in an emacs session in which the old version has already been loaded.

Back when async-bytecomp-package-mode was created package.el did not try to prevent such leakage at all, but nowadays it tries to accomplish the same by first unloading the old version of the package. This is an improvement but this does not actually guarantee that nothing leaks, it just makes it less likely and that simply is not good enough.

Bugs that result from old versions of a package leaking into the byte-code of the new version are extremely difficult to debug. They are always heisenbugs because the code that causes the bug literally does not actually exist in any version of the package.

If one maintains a handful of not all that popular packages, then that is probably okay. You might run into this issue without realizing it and that is okay--one mystery bug per year is okay. Similarly if you maintain Emacs but not any separate packages, then you won't have to deal with this issue much either.

But if you maintain Helm or Magit, then this happens all the time. And it gets really really frustrating to deal with bugs that simply are not possible because there just isn't any code that could possibly be doing what the user claims it is doing. For every bug report you have to keep in the back of your mind that this could be yet another instance of leaked older implementations.

As far as I remember the Emacs maintainers considered such leakage mostly as a theoretical issued but as the maintainer of a very popular package I experienced it very often and so I felt ignored and neglected.

We should now avoid doing something similar to other maintainers and the users of their packages.

There have been reports for a while now about what I call mystery bugs, which may very well be caused by our kludges that are supposed to prevent (a different class of) mystery bugs. E.g. #108, magit/with-editor#85 and magit/magit#2404 (comment).

I for one plan to remove the async kludges from my package, at least for a while. Who knows, maybe the unloading kludge used by package.el now actually is good enough.

Instead I will distribute with Magit a tool for recompiling Magit and its dependencies and always ask users to run that when they report a bug that I cannot immediately reproduce and that sounds even just slightly fishy. It's a bit sad.

The removal of the async kludges from my packages is experimental. If this causes a bunch of mystery bugs, then I would have to revert.

/cc @jwiegley @Malabarba @Stebalien

Alternatively, it may be possible to configure this on a per-package basis. I.e., some form of (async-bytecomp-this-file) form that, when evaluated (not compiled):

  1. Enables a restricted version of async-bytecomp-mode that only asynchronously byte-compiles specified files.
  2. Adds the current load-file-name to this list.

I'm happy to take a stab at implementing this functionality if there's interest.

Alternatively, a local variable may work even better (not sure if my first solution works as I think bytecomp happens before load). That is, always enable a minimal version of async-bytecomp-mode that asynchronously compiles packages marked with async-bytecomp: t.

You can't mark packages like this, at least as it would be in the
*pkg.el file because package.el is rebuilding a new *pkg.el on its own
instead of using the one that is already there and made by the
maintainer. I requested long time ago this feature but was refused.
And I don't want to go the Emacs way of parsing source files headers
e.g. magit.el or helm.el to find infos about a package, source files are
there to provide code, not packaging infos.

Emacs already does this to by setting file local variables. They're used to:

  • Mark files with no-byte-compile.
  • Enable lexical scoping.

Adding a new one to force async compile would be trivial.

You have the variable async-bytecomp-allowed-packages for this which
had by default magit and helm, but have been modified to 'all to handle
all packages.

I can do that in my config. But package maintainers can't just "opt-in" without potentially clobbering the user's preferences.