plugin: Add support for closing plugins
zacps opened this issue · 61 comments
Adding support for closing plugins would give a simple way to do hot code reloading in long running applications like web servers.
This would add another method to src/plugin/plugin.go
and a new file src/plugin/dl_close.go
that would handle closing the dlhandle and destroying the go symbol map.
It would also add the handle as an un-exported unsafe pointer to the go plugin struct.
The main issue is how to signal that a plugin has been closed without breaking API compatibility. Currently the plugin has a channel called loaded
which is closed when the plugin is loaded. The simplest solution is to add another channel which is closed when the plugin is closed, however this doesn't seem very 'neat' to me.
I don't see how this could be implemented safely. Other Go code could have references to functions and symbols loaded from the plugin-would unloading the plugin make those dangling references, or delay the unloading of a plugin until all the references have been reclaimed? Neither sound workable.
Would it be possible to make close fail until the reference count of all symbols in the plugin reached 1, so that the plugin itself was the only structure that held a reference?
Alternatively you could have a different function which checked whether references to the symbols existed and if there were none close the dlhandle, leaving the destruction of the plugin object and any other references up to the program. While this is far from ideal I think with adequate caution it could be a workable solution.
Would it be possible to make close fail until the reference count of all symbols in the plugin reached 1, so that the plugin itself was the only structure that held a reference?
This is spinny and racy.
Alternatively you could have a different function which checked whether references to the symbols existed and if there were none close the dlhandle, leaving the destruction of the plugin object and any other references up to the program. While this is far from ideal I think with adequate caution it could be a workable solution.
This sounds like a finaliser, with the associated problems of finaliser non determinism.
Could you be a bit more specific about the second point?
Your proposal is to add a way to free resources associated with a plugin. If this is delegated to a finaliser it becomes non deterministic when this freeing occurs as finalisers are not guaranteed to run promptly, or indeed, at all.
I'm not suggesting that it's run automatically by the GC, or in a defer (unfortunately).
I'm suggesting that if the needs to be unloaded that the developer will keep track of references to it's symbols then when they no longer need them destroy the references then explicitly call close.
In that situation they will either forget to call close, leaving the handle open, close successfully or attempt to close when there are still references which will fail. It could possibly fail with a panic rather than a plain error as attempting the close a resource that still has a reference is a pretty big bug.
I don't see what issues this approach has beyond reliance on the developer to not make mistakes. It does introduce a need to keep track of references which is far from perfect and not something I'd like to introduce to go. However I think that it's worth having this for those that want to use it and understand the additional complexity it introduces.
What if the plugin started a goroutine?
Go as a language and a runtime environment tends to avoid providing facilities that are easy to get wrong when getting them wrong causes the program to crash or behave unpredictably. I don't see how to make this at all reliable, so to me it doesn't seem to be a good fit for Go.
goroutines would have to be handled in the same way, requiring the developer to close them.
I can understand not wanting to introduce functionality that's easy to misuse. I just think that it's better to have the ability to do it if you know what you're doing. I guess it does go against the philosophy of go though.
I took a quick look at runtime
but couldn't find where exactly the goroutine is created. Are goroutines not closed when they return?
Ok, that's what I thought.
I personally think that manual bookkeeping wouldn't be too bad. If you're writing something designed to be loaded/unloaded you're probably going to make sure you keep track of references and goroutines as is.
I'm trying to think whether there's a better way to do this. I guess it's not something other languages have had problems with.
I have another idea for how this could be solved, it's not simple though and I'm not sure whether it would make sense.
Basically I think it would be possible to implement a compile-time check that ensured there was no possibility for a dangling reference when a program was closed.
The compiler could check to see if close was called on any plugin object in the program. If it was we'd find all of the lookup calls associated with that plugin and determine whether there were any paths that could leave a live reference when or after closed was called.
I have no idea how you'd implement this or whether it'd be fast enough to be viable. Just an idea.
What about having the GC close any plugins that are no longer referenced?
That's what the JVM does. And forcing a full GC whenever Go code must be unloaded is not hard.
@davecheney Any comment on making it a compile-time check?
I don't see how it's like a finaliser, at least in the aspect of non-determinism.
It is pretty similar to rust's general approach; I don't think that's a bad thing though.
If I load a plugin foobar-v1.so that has foobar()
I could wrap it in my own functionfoobarWapper()
that increments/decrements a waitgroup on every call. Things that want to call foobar do so accessing foobarWraper atomically and calling it. Later on I load foobar-v2.so, wrap it in a new wrapper, and atomically swap it out for out for the old foobarWrapper. Now I can use the old foobarWraper's wait group to make sure no one is active and when that is the case call dlclose using cgo.
This all would depend on the user not creating goroutines, but that could be checked for at the source level if you are building plugins from user submitted code.
@zacps Maybe something like the above would work for you.
Wouldn't you have to either use code generation or lose type safety though?
Wouldn't you have to either use code generation or lose type safety though?
@zacps I'm not sure how type safety is related to this...
This all would depend on the user not creating goroutines, but that could be checked for at the source level if you are building plugins from user submitted code.
So using my approach you would write a program to vet the user code before building the plugin. Probably by having a white-list of importable packages, and searching the code to get for invocations of "go X" and any functions in white listed packages you want to also blacklist.
I second starting with an unsafe way to load and unload plugins, with the expectation that the user will sanitize code before compiling it. It's OK if attempting to unload a plugin panics when there exist dangling pointers/references, and it's also OK if the panic happens later nondeterministically when the dangling pointers/references are accessed.
I currently use code generation for templates. I have a template transpiler which transpiles text/template
templates to Pure Go (it's a little coupled with my application right now, but I could always split it off and it currently misses a few features).
It would be nice, if I could reload these templates on the fly without restarting the entire instance, something impossible without the ability to close plugins or rolling my own JIT compiler.
Here is a thought: what about having plugins export their API as a first-class object, such that the plugin gets unloaded when the API object is garbage collected?
@DemiMarie this approach is problematic, please see my comments about finalisers earlier in this thread.
Objective C/Swift use automatic reference counting. Maybe that's the approach to take (manual book keeping - but with compiler's assistance)
Indirectly related for some use cases: https://www.datadoghq.com/blog/engineering/cgo-and-python/
This whole thread seems to be discussing a non-issue
To quote the man pages on dlclose(3)
A successful return from dlclose() does not guarantee that the
symbols associated with handle are removed from the caller's address
space. In addition to references resulting from explicit dlopen()
calls, a shared object may have been implicitly loaded (and reference
counted) because of dependencies in other shared objects. Only when
all references have been released can the shared object be removed
from the address space.
So why would we argue about trying to enforce something stronger than the underlying implementation?
Golang has no way to force you to lose your reference to something, nor do I think it should. It is and always has been you (the programmer's) responsibility to not use a resource that you have explicitly asked the system to dispose of.
Its one thing for the app to know it is no longer using references to a plugin that was loaded and think its safe to call dlclose
, but what about the runtime, is the GC still going to try to scan that address space and blow up?
Would it be possible to make close fail until the reference count of all symbols in the plugin reached 1, so that the plugin itself was the only structure that held a reference?
This is spinny and racy.
@davecheney what does "spinny" mean?
Its one thing for the app to know it is no longer using references to a plugin that was loaded and think its safe to call
dlclose
, but what about the runtime, is the GC still going to try to scan that address space and blow up?
You stop all goroutines in and towards the plugin and give up all other resources. Then you call plugin close. The runtime will enforce a GC and once this is done it will call dlclose
. It shouldn't matter if this takes some time. In the meantime you can load your new plugin and use it via a new handle.
If the plugin cannot be fully GCed, dlclose
will not be called.
I do not understand why it should be impossible to resolve this issue. It would be good to see a short summary of the reasoning.
From what I've read, the desire to close plugins comes primarily from developers building a fast code reloading system. For example, I have a Go program that...
- Compiles
file_x.go
into a plugin, loads it, and calls a function loaded from the plugin. - Watches
file_x.go
for changes - When the file changes, rebuild the plugin, reload it, and call the function again.
This has two issues:
- Loading a plugin consumes memory which cannot be freed, so overall memory usage increases with each change to
file_x.go
- Plugins have identity, and it's not possible to load a plugin twice. To workaround this, it's possible to give each version of the plugin a unique identity. Instead of reloading a plugin, you load a new version with a unique identity. This feels hacky.
Regarding issue 1: perhaps the garbage collector, not the program, should be responsible for releasing the memory associated with a plugin. #28783 seems related.
It's worth calling out that if the garbage collector is responsible for freeing plugin resources, then there's no way for a program to know when those resources have been freed, and that's a good thing because it removes complexity and matches the overall design of memory management in Go.
Regarding issue 2: perhaps there's a better way to manage the identity of plugins that have many, rapid iterations. Or, maybe the existing hack is enough. Seems like we should define this problem clearly and explore solutions before we decide that implementing Plugin.Close
is the best solution for that particular issue.
I'd like to note: I think this is an important issue. I don't consider fast code reloading to be an obscure edge case. There are many projects trying to build Go interpreters and REPLs and test runners and other interesting ways to iterate quickly on Go code. "Building a better Go linker" alludes to an interesting future where Go can compile, link, and load code directly into memory; that would open some interesting doors. Many projects would benefit significantly by having a way to collect unused plugin resources.
Example reloading code: https://github.com/buchanae/go-code-reload
One solution is to forget about the plugin package and use wasm: https://medium.com/wasmer/announcing-the-fastest-webassembly-runtime-for-go-wasmer-19832d77c050
@pjebs Every project I have used plugins on, for the use case @buchanae called out above, made the move because having to keep large pools of Lua or V8 runtimes around and call them via cgo to process externally loaded per transaction logic was too slow, used lots of memory and was dangerous. What you are proposing doesn't help at all. Its just another run-time to have to try to manage that happens to run WASM.
There are pure go wasm runtimes
Oberon's implementation BlackBox has ability to unload modules and able to hotswap them. When you ask runtime to unload module - it marks it as unreachable and when there are no pointers to it's memory - it is removed. If someone asks for types from unloaded module - runtime tries lo load it again, but it can be a newer version. So it is possible to have multiple instances of module at the same time. When all links to old one are removed - there will be only one!
I think it would be killer feature for web development or long running applications.
In case anyone else had the brilliant idea to do something quick and dirty to support loading new versions of plugins, I tried:
- Building the .so file with a different package path for each reload.
- Copying the .so file to a random file on /tmp (tmpfs/memory filesystem on my machine)
- Wrapping
plugin.Open
with some cgo to cache the handle returned bydlopen
- Implementing a
Close
function that callsdlclose
with the cached value of the shared library handle. - Removing the scratch .so file in /tmp
It didn't work for me. The disk usage on the /tmp filesystem seemed to continue growing with each reload operation. It panics 🤷♂️. I may be doing something wrong, of course.
// +build linux,cgo darwin,cgo freebsd,cgo
package unsafeplugin
/*
#cgo linux LDFLAGS: -ldl
#include <dlfcn.h>
#include <limits.h>
#include <stdlib.h>
#include <stdint.h>
#include <stdio.h>
static uintptr_t pluginOpen(const char* path, char** err) {
void* h = dlopen(path, RTLD_NOW|RTLD_GLOBAL);
if (h == NULL) {
*err = (char*)dlerror();
}
return (uintptr_t)h;
}
static int pluginClose(uintptr_t h, char** err) {
int i = dlclose((void*)h);
if (i != 0) {
*err = (char*)dlerror();
}
}
static void* pluginLookup(uintptr_t h, const char* name, char** err) {
void* r = dlsym((void*)h, name);
if (r == NULL) {
*err = (char*)dlerror();
}
return r;
}
*/
import "C"
import (
"errors"
"fmt"
"path/filepath"
"plugin"
"sync"
"unsafe"
)
var (
mutex sync.Mutex
pathToDlopenHandle = make(map[*plugin.Plugin]uint64)
)
func Open(path string) (*plugin.Plugin, error) {
mutex.Lock()
defer mutex.Unlock()
realPath, err := filepath.Abs(path)
if err != nil {
return nil, err
}
p, err := plugin.Open(realPath)
if err != nil {
return nil, err
}
cPath := C.CString(realPath)
defer C.free(unsafe.Pointer(cPath))
var cErr *C.char
h := C.pluginOpen(cPath, &cErr)
if h == 0 {
return nil, errors.New(`unsafeplugin.Open("` + path + `"): ` + C.GoString(cErr))
}
fmt.Println("path", realPath, "h", h)
pathToDlopenHandle[p] = uint64(h)
return p, nil
}
func Close(p *plugin.Plugin) error {
mutex.Lock()
defer mutex.Unlock()
h, ok := pathToDlopenHandle[p]
if !ok {
return errors.New(`unsafeplugin.Close()`)
}
var cErr *C.char
// man dlclose:
// > The dynamic linker maintains reference counts for object handles,
// > so a dynamically loaded shared object is not deallocated until dlclose() has been
// > called on it as many times as dlopen() has succeeded on it.
r := C.pluginClose((C.ulong)(h), &cErr)
if r != 0 {
return errors.New(`unsafeplugin.Close(): ` + C.GoString(cErr))
}
r = C.pluginClose((C.ulong)(h), &cErr)
if r != 0 {
return errors.New(`unsafeplugin.Close(): ` + C.GoString(cErr))
}
return nil
}
give a simple way to do hot code reloading...
I developed a solution for that. It is now open source. https://github.com/edwingeng/hotswap
Hotswap provides a solution for reloading your go code without restarting your server, interrupting or blocking any ongoing procedure. Hotswap is built upon the plugin mechanism.
Major Features:
- Reload your code like a breeze
- Run different versions of a plugin in complete isolation
- Invoke an in-plugin function from its host program with Plugin.InvokeFunc()
- Expose in-plugin data and functions with PluginManager.Vault.DataBag and/or PluginManager.Vault.Extension
- Handle asynchronous jobs using the latest code with live function, live type, and live data
- Link plugins statically for easy debugging
- Expose functions to other plugins with Export()
- Depend on other plugins with Import()
When do you have plans to optimize the plugin? For example, the close function, the plugin so size
This thread is just very valid point after another. There should be no reason why plugins can be closed to hot-swap files. Especially comments #20461 (comment) and #20461 (comment).
In fact, I didn't even know golang lacked this functionality because I incorrectly assumed how fundamentally easy it would be to implement.
I just spent an entire day implementing a hot-swap mechanic for a web server that manages about 90 different plugins from hundreds of different developers. All plugins follow a shared spec that I previously helped write, ie "your plugin must close all goroutines when symbol XYZ is called" was included so I can implement this hot swap feature at a later date. Come to find out, at the end of the day that plugin.Open
will cache results based on the file and per this thread, circumventing that results in a panic. So out of the 18 hours that I've just completely wasted, and an awkward phone call I have to make to the people I promised this feature too, please consider putting this function in.
With that being said, give me at least an unsafe
way to do it. Not having this feature for the sake of "it'll be misused by those who don't know what they're doing" is a very-mega-big downer for those who do know what they're doing.
I just spent an entire day implementing a hot-swap mechanic for a web server that manages about 90 different plugins from hundreds of different developers. All plugins follow a shared spec that I previously helped write, ie "your plugin must close all goroutines when symbol XYZ is called" was included so I can implement this hot swap feature at a later date. Come to find out, at the end of the day that
plugin.Open
will cache results based on the file and per this thread, circumventing that results in a panic. So out of the 18 hours that I've just completely wasted, and an awkward phone call I have to make to the people I promised this feature too, please consider putting this function in.
Workaround: Swap to new filenames (Yup....)
Sure memory usage climbs, but there's no other option... and it's been 5 years.
Adding ControllerBus to the discussion of plugin mechanisms / structures / proof-of-concept:
https://github.com/aperturerobotics/controllerbus/tree/master/plugin
This would greatly benefit from the plugin unloading improvements mentioned above; however, I've started working on strategies using IPC messaging to cross the Process barrier rather than sharing memory in the same process. This is a workaround for the lack of strong Plugin features.
it's been a long time. is there any solution for it now?
In the time since I opened this issue I finished high school, completed a Bachelors degree in computer science & physics, and published a couple of scientific articles.
I also learnt Rust (thanks for the recommendation #20461 (comment)).
There is no change here.
looking at this too. any updates?
this is not working for golang 1.21 but for reference
https://github.com/pkujhd/godynamic
anyone taking on this issue? unloading plugin and removing memory used will be great
Nobody is working on this issue and we have no idea how to implement it. Sorry.
@ianlancetaylor From my PoV there are four options:
- Use GC and finalizers to release a plugin when there are no more references to it. This is safe, but not deterministic.
- Declare using a closed plugin to be undefined behavior (in the C sense): if you use a plugin that has been unloaded, anything at all can happen, including executing malicious code. This is deterministic, but not safe.
- Wrap each plugin function in a trampoline that checks if the plugin is still alive and panics if not. This is memory safe and deterministic, but still risks crashing the program.
- Implement something higher-level (such as hot code reloading) directly in the runtime. This would use one of the above options internally but not expose it.
Furthermore, I believe these are the only possible options for a language like Go that cannot encode lifetimes in the type system and does not have destructors. Is this accurate?
@DemiMarie Thanks. However, I'm not sure that gets at the reasons that people care about this. What do people actually want to achieve by closing a plugin? If all that they want is for future calls to the plugin to fail, then I agree that option 3 suffices. But I suspect that they want something more: I suspect that they want to release any resources that the plugin is using. And that seems much harder.
I'm not sure that gets at the reasons that people care about this. What do people actually want to achieve by closing a plugin?
They want to be able to load a new/modified revision of the (same) plugin.
I'm not sure that gets at the reasons that people care about this. What do people actually want to achieve by closing a plugin?
They want to be able to load a new/modified revision of the (same) plugin.
This has been exactly what I wanted for a long time, as well as releasing resources, but over time I realized that it wasn't necessary.
For what it's worth, I've managed to solve this in a slightly hacky method. Go is capable of running other executables. In this way, plugins are now run on a sort of plugin service, for which can be spooled up or down by the primary go executable. It's not great for intense data transfer, because I opted in for all plugin calls to be callable through a REST API, but it solved it in my case. Hopefully it can for some other developers wanting a similar scenario.
@Niko-Kk it's necessary as you said for performance.
Lots of projects use RPC with subprocesses for plugins, like hashicorp go-plugin. I wrote a RPC library for this purpose: https://github.com/aperturerobotics/starpc which also supports typescript. The approach works fine but having the plugins in-process definitely would be better for performance.
@DemiMarie Thanks. However, I'm not sure that gets at the reasons that people care about this. What do people actually want to achieve by closing a plugin? If all that they want is for future calls to the plugin to fail, then I agree that option 3 suffices. But I suspect that they want something more: I suspect that they want to release any resources that the plugin is using. And that seems much harder.
It doesn’t seem much harder to me, provided that the plugin code is written correctly. First, the runtime will check if there are any goroutines executing the plugin’s code. If there are, the plugin can’t be closed. Otherwise, all of the plugin’s entry points will be made to panic, and a Close()
function in the plugin will be called to release non-GC’d resources. Once that function returns, that the plugin code and data will never be used again, the runtime can safely unregister the plugin’s GC roots and unmap the plugin’s code and data, after which the plugin’s global variables are free to be garbage collected.
If this is too complex, a simpler approach would to for the application to drop all references to the plugin, after which the plugin will be unreachable and can be garbage collected. This is the approach used by Java, which will garbage collect classes and class loaders that are no longer reachable. To me, this is the solution that fits the Go langauge best, provided that there is a way (runtime logs?) to debug situations where something is keeping a plugin alive.
First, the runtime will check if there are any goroutines executing the plugin’s code.
I'm not sure that's enough. There might be some variable out there somewhere that is an interface value that has methods that point to the plugin's code. That means that the plugin code might be invoked at some point in the future.
The garbage collection approach is more plausible, but it's not a simple change. Normally we can treat any global variable as a garbage collection root, but a global variable in a plugin would not be a root. We would have to develop a new kind of reference, by which a reference to plugin code would keep plugin global variables alive. That said, one nice feature of the garbage collection approach is that we wouldn't need to have a way to close plugins.
From what I've read, the desire to close plugins comes primarily from developers building a fast code reloading system. For example, I have a Go program that...
- Compiles
file_x.go
into a plugin, loads it, and calls a function loaded from the plugin.- Watches
file_x.go
for changes- When the file changes, rebuild the plugin, reload it, and call the function again.
This has two issues:
- Loading a plugin consumes memory which cannot be freed, so overall memory usage increases with each change to
file_x.go
- Plugins have identity, and it's not possible to load a plugin twice. To workaround this, it's possible to give each version of the plugin a unique identity. Instead of reloading a plugin, you load a new version with a unique identity. This feels hacky.
Regarding issue 1: perhaps the garbage collector, not the program, should be responsible for releasing the memory associated with a plugin. #28783 seems related.
It's worth calling out that if the garbage collector is responsible for freeing plugin resources, then there's no way for a program to know when those resources have been freed, and that's a good thing because it removes complexity and matches the overall design of memory management in Go.
Regarding issue 2: perhaps there's a better way to manage the identity of plugins that have many, rapid iterations. Or, maybe the existing hack is enough. Seems like we should define this problem clearly and explore solutions before we decide that implementing
Plugin.Close
is the best solution for that particular issue.I'd like to note: I think this is an important issue. I don't consider fast code reloading to be an obscure edge case. There are many projects trying to build Go interpreters and REPLs and test runners and other interesting ways to iterate quickly on Go code. "Building a better Go linker" alludes to an interesting future where Go can compile, link, and load code directly into memory; that would open some interesting doors. Many projects would benefit significantly by having a way to collect unused plugin resources.
Example reloading code: https://github.com/buchanae/go-code-reload
I am having a similar problem while reloading the plugin using some solution like "issue 2" but still reloads the previous states. I guess we have to wait then ...
First, the runtime will check if there are any goroutines executing the plugin’s code.
I'm not sure that's enough. There might be some variable out there somewhere that is an interface value that has methods that point to the plugin's code. That means that the plugin code might be invoked at some point in the future.
The garbage collection approach is more plausible, but it's not a simple change. Normally we can treat any global variable as a garbage collection root, but a global variable in a plugin would not be a root. We would have to develop a new kind of reference, by which a reference to plugin code would keep plugin global variables alive. That said, one nice feature of the garbage collection approach is that we wouldn't need to have a way to close plugins.
I suspect that in practice, anyone using this approach will write a helper function that drops all references to the plugin (by setting various pointers to nil) and then forces a GC.
Azareal
Did you found any solution since ?