proposal: spec: immutable type qualifier
romshark opened this issue ยท 126 comments
This issue describes a language feature proposal to Immutable Types. It targets the current Go 1.x (> 1.11) language specification and doesn't violate the Go 1 compatibility promise. It also describes an even better approach to immutability for a hypothetical, backward-incompatible Go 2 language specification.
The linked Design Document describes the entire proposal in full detail, including the current problems, the benefits, the proposed changes, code examples and the FAQ.
Updates
- October 7th: This proposal is approaching its second revision addressing major flaws such as
const
-poisoning, verbosity,const
-keyword overloading and others.
Introduction
Immutability is a technique used to prevent mutable shared state, which is a very common source of bugs, especially in concurrent environments, and can be achieved through the concept of immutable types.
Bugs caused by mutable shared state are not only hard to find and fix, but they're also hard to even identify. Such kind of problems can be avoided by systematically limiting the mutability of certain objects in the code. But a Go 1.x developer's current approach to immutability is manual copying, which lowers runtime performance, code readability, and safety. Copying-based immutability makes code verbose, imprecise and ambiguous because the intentions of the code author are never clear. Documentation can be rather misleading and doesn't solve the problems either.
Immutable Types in Go 1.x
Immutable types can help achieve this goal more elegantly improving the safety, readability, and expressiveness of the code. They're based on 5 fundamental rules:
- I. Each and every type has an immutable counterpart.
- II. Assignments to objects of an immutable type are illegal.
- III. Calls to mutating methods (methods with a mutable receiver type) on objects of an immutable type are illegal.
- IV. Mutable types can be cast to their immutable counterparts, but not the other way around.
- V. Immutable interface methods must be implemented by a method with an immutable receiver type.
These rules can be enforced by making the compiler scan all objects of immutable types for illegal modification attempts, such as assignments and calls to mutating methods and fail the compilation. The compiler would also need to check, whether types correctly implement immutable interface methods.
To prevent breaking Go 1.x compatibility this document describes a backward-compatible approach to adding support for immutable types by overloading the const
keyword (see here for more details) to act as an immutable type qualifier.
Immutable types can be used for:
- immutable fields
- immutable methods
- immutable arguments
- immutable return values
- immutable variables
- immutable interfaces
- immutable reference types and containers
- immutable package-scope variables
Immutable Types in Go 2.x
Ideally, a safe programming language should enforce immutability by default where all types are immutable unless they're explicitly qualified as mutable because forgetting to make an object immutable is easier, than accidentally making it mutable. But this concept would require significant,
backward-incompatible language changes breaking existing Go 1.x code. Thus such an approach to immutability would only be possible in a new backward-incompatible Go 2.x language specification.
Related Proposals
This proposal is somewhat related to:
Detailed comparisons to other proposals are described in the design document, section 5..
Please feel free to file issues and pull requests, become a stargazer,
contact me directly at roman.scharkov@gmail.com and join the conversation on Slack Gophers (@romshark), the international and the russian Telegram groups, as well as the original golangbridge, reddit and hackernews posts! Thank you!
Nice document; clearly you spent a while on it. I only briefly glanced over it.
Copies are the only way to achieve immutability in Go 1.x, but copies inevitably degrade runtime performance. This dilemma encourages Go 1.x developers to either write unsafe mutable APIs when targeting optimal runtime performance or safe but slow and copy-code bloated ones.
Not exactly the only way. An alternative approach is to have an opaque type with only exported methods that provide read-only access, which is how reflect.Type
achieves immutability. The v2 protobuf reflection API also takes this approach. It has other downsides (like needing to manually create methods for each read-only operation), but pointing out that there are non-copy approaches to immutability.
@networkimprov please no $
in names, I don't think we need to start another PHP again
As far as I can tell in my initial skim, this doesn't solve the memchr
problem. In C the standard library memchr
function, which returns a pointer to the first occurence of a character in a string, is defined as char *memchar(const char *, char)
. The problem is that in terms of types, if memchr
is passed a const char *
it should return a const char *
, and if memchr
is passed a char *
it should return a char *
. That is, it should preserve the const-ness of its first argument. But there is no way to write that in C. And I don't see how to write that in your proposal.
Other comments:
- Is there any difference between this proposal and the use of
const
as a type qualifier in C, other than the logical extension to interfaces? - Look up "const-poisoning."
- Pedantically, I don't particularly like using the word "immutable" to describe the parameter to
func F(const []int)
. That slice is not immutable; all that declaration says is thatF
will not change it. This is particularly clear if you writefunc F(s1 const []int, s2 []int)
and then call it asF(s, s)
. You can't say thats1
is immutable withinF
, because ifs2
changes thens1
will change. - Do we really have to worry about immutable containers of mutable values? Yes, that comes up once in a while, but it is often enough to make it worth writing
const [] const T
?
@dsnet
getters/setters are doing just that: they copy stuff from the inner scope of the struct. You
don't want to copy everything every time, and you certainly don't want to do it manually. Writing setters, getters, cloners just for the sake of ensuring immutability is not only quite tedious but also very error-prone due to pointer aliasing, which is the scariest part actually. Copy-code tends to be rather complicated in Go, one wrong copy (like copying a pointer, or naively copying a reference type such as a slice) and you've introduced aliasing that could have terrific, non-obvious consequences. With immutable types though, having read-only aliasing is just fine because there's no mutable shared state.
Currently, the safest way of avoiding manual copying of large structs are interfaces. You could define 2 interfaces where one of them lacks the mutating methods and return interfaces from the getters only. This is an "okay" solution, but it doesn't solve internal mutability problems. In big open source projects many people are working on the code, intentions must be unambiguous, clear and precise, which they're currently not. Can you automatically ensure that the methods implementing the read-only interface do not mutate the object for sure, even after merging a pull request from an external developer who's not fully aware of your intentions? You can't! You'll have to write proper unit tests and carefully analyze each and every commit! With immutable types you declare your interface methods immutable and you can be 100% sure that any implementation of it trying to mutate the object will fail the compilation. Apart from that, interfaces aren't free, they do have a slight runtime cost due to indirection, so having an option to avoid them for performance reasons while still preserving safety is a good thing!
I always prefer to solve these kinds of problems declaratively. I declare what is mutable/immutable while the compiler does all the dirty work of making sure neither me, nor my coworkers, nor the open source contributors sending in their pull requests shoot themselves in the foot introducing bugs. Isn't this the way compiled languages should make our lives easier?
@networkimprov
As section 3. of the design document clearly states: immutability by default and explicit mutability qualification through mut
is preferable, but would only be possible in a backward-incompatible Go 2.x specification, which is not to be expected any time soon (AFAIK, the "Go 2" they're advertising is rather a Go 1.13+ because the folks at Google aren't big fans of breaking compatibility as it seems).
Naming conventions would break backward-compatibility. Old Go 1.x code could either stop compiling or fail at linting, which is unacceptable. This proposal aspires to preserve backward-compatibility at all cost. There's also a somewhat related question in the FAQ by the way.
You may want to read Russ' evaluation of a read-only slices proposal. It contains a lot of the issues that this proposal should grapple with.
getters/setters are doing just that: they copy stuff from the inner scope of the struct.
But they don't have to. If the inner field is a composite type, the getter can return an opaque type that internally holds a pointer to the composite type and only provides exported read-only getter methods.
Writing setters, getters, cloners just for the sake of ensuring immutability is not only quite tedious but also very error-prone due to pointer aliasing, which is the scariest part actually
I've written several immutable APIs in this way. I absolutely agree that it is tedious, but I personally don't think it was "very error-prone" from the perspective of the API author. Pointer aliasing is not inherently the problem; it is problematic if a pointer to a non-opaque type leaks to the public API. However, I find it relatively straight-forward to review the public API and reason that it doesn't violate immutability.
Can you automatically ensure that the methods implementing the read-only interface do not mutate the object for sure, even after merging a pull request from an external developer who's not fully aware of your intentions? You can't!
Since read-only APIs are usually just getters, they are not terribly complicated such that you would accidentally mutate the object (e.g., it is not hard to review this and reason it is read-only).
In big open source projects many people are working on the code, intentions must be unambiguous, clear and precise, which they're currently not.
An opaque read-only API does make the intention clear. The lack of any setter methods is a clear signal that the user should not (and cannot) mutate anything.
Apart from that, interfaces aren't free, they do have a slight runtime cost due to indirection, so having an option to avoid them for performance reasons
Interfaces are one such implementation, but it doesn't have to be. It can be a concrete type too:
type MutableStruct struct {
Field int
...
}
type ImmutableStruct struct { p *MutableStruct }
func (p ImmutableStruct) GetField() int { return p.p.Field }
There is practically no runtime cost to this as the compiler can inline all the getters as if they were nested field accesses (or slice indexes, map lookups, etc).
I am bringing the technique up not as an end-all alternative to your proposal, but more so to counter the claim that "copies are the only way to achieve immutability ... [which] degrades runtime performance". It is a legitimate approach taken today to address this problem, which the proposal seems to gloss over.
I agree that there are disadvantages to opaque APIs with read-only getters (especially with regard to their tediousness and perhaps the lack of implicit casting), but I think it would help the case of a proposal trying to add immutability to acknowledge techniques done today to work around the problem and show that the benefit of adding immutability outweighs the cost (e.g., complexity in type system and the "const poisoning" mentioned earlier).
Naming conventions would break backward-compatibility.
Above I suggested a go-vet switch to support a naming convention. Such a convention would be optional, permanently. As would a mut
keyword. Lots of folks don't want to code that way.
There has been plenty of discussion about const
-ness over the years, yet the two priorities for Go2 are error handling & generics, and code for them presumably won't land for a couple years (there is no defined schedule as yet).
I love how well done the proposal is. Thank you for your hard work on this.
I have just one thought that I would like to share:
It is stated in the proposal that we can't cast an immutable var to a mutable one, as we could break the immutability. The opposite, casting a mutable var to an immutable one, can be done, which makes sense. However, in this case, we can break immutability. I know that you talk about this in section 4.5. How are constants different from immutable types?, but this could lead to very subtle and unexpected situations.
For example, let's say we have this Slice type:
type Slice []int
func (s *Slice) Add(elem const int) {
*s = append(*s, elem)
}
func (s const Slice) ImmutableVersion() const [] const int {
return s
}
And then we use it like this:
slice := Slice{1,2,3,4}
immutableVersion := slice.ImmutableVersion()
// Now immutableVersion = {1,2,3,4}
slice.Add(5)
// Now immutableVersion = {1,2,3,4,5} It has changed
This behavior could be unexpected and lead to confusion, as you were guaranteed by the type system that the var immutableVersion
was immutable.
This can be even worse with slices as, if the capacity is exceeded, append
will allocate a new underlying array, what means that the immutableVersion
won't be changed. So we don't really know if/when the immutableVersion
will change.
This won't happen if the method ImmutableVersion()
returns a copy.
Don't get me wrong! I love the proposal. I think it is the best one I have seen for immutability and I would like it to come true as soon as possible.
I just wanted to know the general opinion about the case I have posted.
Thanks!
In 2.6. Immutable Interface Methods I am not sure I understand why enforcement of mutability on the interface is important. This seems more like an implementation detail and could severely limit the usefulness of interfaces if abused too much. The answer in 4.7 doesn't make much sense to me.
The difference between C-style const
and the proposed const
Is there any difference between this proposal and the use of const as a type qualifier in C, other than the logical extension to interfaces?
There is! The const
in C is just confusing while the const
in this proposal always targets the type on the right:
goal | Go | C |
---|---|---|
reassignable pointer to writable T | * T |
ะข * |
reassignable pointer to read-only T | * const T |
T const * |
read-only pointer to writable T | const * T |
T * const |
read-only pointer to read-only T | const * const T |
T const * const |
In fact, C-style const
is so confusing that const char *
is the exact same as char const *
. You also can cast const
to non-const
in C, which you can't in this proposal. Please do not compare the const
from C with the proposed const
for Go, we don't wanna do it the horrible C-way, but rather learn from its mistakes!
Const-Poisoning
If by "const-poisoning" you mean the ability to cast immutable types to their mutable counterparts then I've got good news for you: C-style const poisoning is impossible with this proposal.
IV. Mutable types can be cast to their immutable counterparts, but not the other way around.
Terminology
Pedantically, I don't particularly like using the word "immutable"
This proposal is not about functional-programming-style "immutable objects", it's about "immutable types". Immutable objects remain immutable after they're initialized while immutable types are types you can't perform mutations on. "Types" and "objects" are obviously not the same and this proposal doesn't propose immutable objects.
Immutable Reference Types
Do we really have to worry about immutable containers of mutable values? Yes, that comes up once in a while, but it is often enough to make it worth writing
const [] const T
?
Reference types such as pointers, slices and maps shall be no exception in the concept of immutable types (slices and maps are reference types. Yes, they're implemented by a struct but to us users they're opaque). Section 5.2.1. describes why transitive immutability is to be avoided. Basically, it makes the entire concept of immutable types useless when the developer faces a slightly more complex situation like when a reference, such as a pointer, must point to an exact mutable object. But it's the complex situations the developers need the compiler's help most! Transitive immutability will force the developer to throw immutable types out the window because they limit his/her expressiveness making it totally useless.
@ianlancetaylor's 2010 blog post on const: https://www.airs.com/blog/archives/428
He describes const
as compiler-enforced documentation, except for variable definition where it directs the compiler to use read-only memory. (And he discusses const poisoning.)
But there is another way that const can affect generated code; a const function argument can be passed by reference instead of by value, so that a compound object (Go struct or array) isn't copied onto the stack. (Go maps and slice contents are already passed by reference.)
I'd like to see Go support that kind of const.
@romshark Const poisoning refers to what happens when you add const
to one function, and then you have to add const
to every function that it calls, and then you have to add const
to every function that those functions call, etc.
Then sometimes you discover that you have to change some lower function to not use const
, for perfectly valid reasons, and const poisoning refers to the problem of removing const
from the entire call chain.
These aren't made up problems, they are real issues that arise in practice in large programs.
Also, let me ask the comparison with C again: is there any difference between the use of const
in this proposal and the use of const
in C, other than syntax?
@ianlancetaylor
This is the reason Section 2.12. even exists. It's definitely true that once you've got immutable types you need to use them everywhere. This, essentially, is the price you pay for having predictability, clarity, and safety. It's kind of a stopper for the Go 1.x proposal, I agree, but in Go 2.x this must be done with immutable types by default to avoid having ambiguity from the very start and not end up with the Go 1.x problem of having to fix all libraries including the standard one.
I honestly can't imagine what "perfectly valid reasons" you need to have to, for example, make any of the lower functions called by strings.Join()
not use const
for the slice of strings you pass to a
because a
should be guaranteed to not be touched in any way neither by strings.Join()
nor by any of the functions up the stack. And since immut -> mut
casting is inherently forbidden I see no problems here. Once you provide a contract (API) - you either support it or you break it, not silently violate it!
Can you give us an example of when we'd suddenly discover that we actually needed mutable inputs and thus have to "remove immutability"?
There is no semantic difference between the C-style const
and the proposed const
. The proposed const
is a better execution of the C version, but in the end, they both serve a similar purpose:
- make the intentions of the code author clear and reliable.
- protect the code from undesired and unexpected mutations.
Problematic cases happen in large, complex, programs, so there are no small examples. In terms of your proposal, the kind of thing that happens is that you start passing a map around, and there is no reason to change it, so you mark it const
everywhere. Let's say it's a map of names to codes or something. Then later you realize that the names sometimes change, but you can only discover that deep in the call hierarchy. So you have to add some code there to change the map, and you have to remove const
from all the callers. Obviously that is easy to nit pick, it's just an example, and, yes, I've seen this kind of thing happen in real code. In fact in C++ this is where most people reach for const_cast
.
That aside, I note that you haven't replied to my memchr
comment. In that regard you might want to read through #22876, which tries to address that problem through "permission genericity."
@ianlancetaylor
Yes, that is a problem indeed and the "permission genericity" concept proposed by Jonathan Amsterdam in #22876 does look promising (I wonder how I missed that). I'll give it a thought, it probably makes sense to integrate the concept of immutability genericity into this proposal as well.
I appreciate the thought that went into this proposal, however I think immutability is mostly an academic concern. Seeing how complex this proposal is, I'd like to hear of some experience reports where accidental mutation actually caused serious problems in a large Go code base. In my experience, accidental mutation is a relatively rare cause of bugs in programming. Therefore I think it does not warrant the troubles of having to use const constantly, or having to constantly worry about const correctness.
I appreciate the thought that went into this proposal, however I think immutability is mostly an academic concern. Seeing how complex this proposal is, I'd like to hear of some experience reports where accidental mutation actually caused serious problems in a large Go code base. In my experience, accidental mutation is a relatively rare cause of bugs in programming. Therefore I think it does not warrant the troubles of having to use const constantly, or having to constantly worry about const correctness.
I am more than reluctant to introduce more complex solution than the problem you're trying to solve. Go is a simple language to a certain extent, we don't need to copy other language just to make some swing.
I haven't seen any comments from those who've given a ๐, so here's my input.
Personally I don't like the idea behind const
types. It reminds me of C's const
which, IMO, was a disaster that complicated C's otherwise simple type system.
I am aware that it brings in a lot of safety, but it comes at the cost of a lot of readability. I also understand that sometimes sacrifices to readability need to be made to increase safety, but I'm not sure if this is one of them.
In fact in C++ this is where most people reach for
const_cast
.
In the C++ codebases I've maintained, the vast majority of uses of const_cast
were in order to overload const
and non-const
member functions.
That technique is recommended in Effective C++ (and in this StackOverflow answer), but disrecommended in the isocpp.org core guidelines.
That seems to support the theory that const
-parametricity is sufficient to avoid the need for such casts.
I didn't mean to imply it was the only time people used const_cast
.
That does raise the problem of migration paths, though: there are widespread APIs today, such as io.Writer
that accept read-write slices, and a naive approach to const
in the type system would require at least one of:
- workarounds at every call site, such as conversions through
unsafe.Pointer
; - escape hatches for calling existing (Go 1) APIs, such as (unsound) bivariant subtyping; or
- large-scale updates to ~all existing packages.
In contrast, a dynamic analysis (such as the one in #22048, possibly only enforced when the race detector is enabled) would only affect existing APIs that actually perform unexpected writes.
I'm not sure why you're bringing up io.Writer
, @bcmills.
Mutable variables can safely be accepted as const
and the API documentation for io.Writer
itself attempts to convey its immutability requirements.
Languages that get this right start with immutable by default and do a lot of work behind the scenes to make everything seamless and need some syntax sugar to make it easy to use. I love immutability, but I don't think it'd really fit in to Go well.
I do want to provide my position on one of @ianlancetaylor's points, though.
[T]he memchr problem. In C the standard library
memchr
function, which returns a pointer to the first occurence of a character in a string, is defined aschar *memchar(const char *, char)
. The problem is that in terms of types, ifmemchr
is passed aconst char *
it should return aconst char *
, and ifmemchr
is passed achar *
it should return achar *
. That is, it should preserve the const-ness of its first argument.
Let's say A
is const char *
and B
is char *
.
memchr(A, B)
is fine.
memchr(A, A)
and memchr(B, B)
should both be type errors. Each should need an explicit step to transition to/from an immutable copy: something like memchr(A, mutable_copy_of(A))
and memchr(immutable_copy_of(B), B)
.
This is the same as if this were a Go func with the signature func memchr(string, []byte) []byte
where A
is a string
and B
is a []byte
. You can't just call memchr(A, A)
and have it be the same as string(memchr(A, []byte(A)))
.
This inevitably leads to needing (at least) two versions of everything to avoid copying everything dozens of time or not being able to use (im)mutable versions of types because a needed dependency made the choice for you.
Sometimes generics could help with that by letting you write multiple versions simultaneously, but that means making a lot of things generic that otherwise would not need to be, essentially swapping "const poisoning" with "generics poisoning".
Having freeze
/unfreeze
/isFrozen
sidesteps this but adds a dynamic axis to a static type system. Instead of having two versions of each function you have one version that needs to cope with both at runtime.
@romshark I also had bad experience with C++ const poisoning, and it is not due to "bad design/API" but mere new features/requirements added, that need to modify something deep in the call chain, that nobody though should be modified before. And usually these features are needed for "yesterday", I did the only possible thing - const_cast
right on the spot. So @ianlancetaylor concerns are very valid. One thought on all this const correctness thing: We have to declare that if something is marked as const in the current call chain, doesn't mean that it will not change by other part of the program running concurrently where they are not const types. It is just that current call chain can't change it ( unless you remove the const with something along the lines of const_cast
). Yeah, const correctness it is not a simple beast.
@jimmyfrasche I don't quite grasp your comments on memchr
. I probably didn't describe the issue properly. In Go terms, a similar problem arises with bytes.Split
. Suppose we want to add a const
qualifer to bytes.Split
. We might write
func Split(s, sep [] const byte) [][]byte
After all, Split
does not modify the contents of s
or sep
, it just returns slices of s
. But we can't write that function, because the slices of s
will have type [] const byte
. So instead we write
func Split(s, sep [] const byte) [][] const byte
But now we can't use the function if we plan to modify the resulting slices, because even though we pass in a normal []byte
we get back [][] const byte
when we really want [][]byte
.
That is, the memchr
problem is that we want the const
qualifier on the result to be there or not depending on whether the qualifier is there or not on the first input argument. But the type system doesn't give us a way to write that.
@LeonineKing1199
I'm not sure why you're bringing up io.Writer, @bcmills.
Mutable variables can safely be accepted as const and the API documentation for
io.Writer
itself attempts to convey its immutability requirements.
io.Writer
today accepts non-const
slices.
If you attempt to make the argument to io.Writer
a const []byte
in the interface definition, you will break implementations of Write
. that call non-const
functions with that argument (perhaps because they are erroneous, or perhaps because those methods are actually const
but not labeled as such yet).
On the other hand, if you do not make the argument to io.Writer
const
, you will break anyone attempting to call Write
on a const []byte
: even though the documentation says that the slice is logically read-only, the type system does not reflect that fact.
So what you're left with is one of four options:
- Update the entire world to use
const
atomically. (This is essentially impossible.) - Try to build a directed acyclic call graph, and update from the leaves inward. (Likely also impossible.)
- Start with an unenforced
const
, and try to update the world (without backsliding!) until you can turn on enforcement. (Technically plausible but unlikely to complete in finite time.) - Leave a ~permanent loophole (such as
const_cast
, or a โGo 1โ personality that ignoresconst
) in the language, leave the existing Go 1 APIs as-is, and rely on the Go 2 migration process to apply the loophole as needed.
@ianlancetaylor I get your point but I don't think that's a reasonable thing to expect to come baked into an immutability proposal.
What you're asking for is being able to say func Split(s, sep T) []T
where T
can be either []byte
or []const byte
. That's an issue for generics to solve not obscure rules about implicit casting and quantum function signatures.
If you attempt to make the argument to io.Writer a const []byte in the interface definition, you will break implementations of Write. that call non-const functions with that argument (perhaps because they are erroneous, or perhaps because those methods are actually const but not labeled as such yet).
Imo, this is a feature and not a bug. The API docs stress the importance of not mutating the input slice. If functions break this it means a couple of things. One, people will have to update their functions to use const slices which sucks but I think is acceptable. Two, it'll catch genuine breaks in the invariants laid out by io.Writer
.
In terms of preserving backwards compatibility, this would be most easily accomplished in C++ via function overloading (i.e. const
vs non-const
overloads).
Since Go doesn't have that option and prides itself on being opinionated, I think its best bet is to not make io.Writer
take a const
slice but Go can still add const
to the language so new libraries going forward can encode immutability at the type system level.
@romshark it seems clear to me that Go won't admit a const
qualifier, despite all the happy thumbs on your well-crafted document.
Can you come at this from a different angle? Maybe an optional mutability indicator that stops at the package boundary, so that a mutable-aware package can call APIs or be called by programs which lack mutability awareness.
It's not the wholistic system you wanted, but it might draw less anti-aircraft fire :-)
@jimmyfrasche That is a reasonable point, but after all the issue of "the qualifier of the result type ought to be inherited from the qualifier of the input type" really does come up in fairly simple examples. It's a mark against this proposal that it can't handle that case. It would be nice to not have to reach for generics when there are only two possible type arguments. We know from long experience that there are problems with the const
qualifier in C; we shouldn't repeat them merely because they are familiar.
@ianlancetaylor The C/C++ approach is bad. It is not an example to follow. I wouldn't even consider what it does immutability, really. It gives you some minor guarantees but it's ultimately an exercise in const correctness for the sake of const correctness.
You don't really get immutability unless you force copying between values of T
and immutable T
and disallowing writes to values of the latter (unsafe
shenanigans notwithstanding).
My point is that mutable T
and immutable T
are different typesโequally as different as any two distinct types.
They have a lot of similarities, surely, but it's superficial. These types are also superficially similar:
type T struct { X, Y int }
type CT struct { X, Y int }
but a function that accepts and returns T
wouldn't be expected to just work with CT
. You need to introduce a type parameter if you want to write a signature like that. If for no reason other than to keep the relation between the types clear!
The functional languages generally get immutability right. If you want to explore immutability you should look at the various ML/Miranda descendants, not at C or C++. They make all of it work by creating value semantics with a lot of sleight of hand and many heap pointers and many more optimizations to make up for all the duplication and copying and pointer chasing, though. I'm not sure it would work well with Go. I suspect it could not be made to work with Go. That doesn't make me happy because I'd love to have immutable types.
Go is not a functional programming language and I really don't think we should introduce an update which will break every single Go program in any update...
If we should be learning (what to do AND what not to do) about immutability from any programming languages, we should be from programming languages that fill the same purpose that Go does, such as Rust, C-family, etc.
I'm currently trying to solve the qualifier-verbosity problem. Check out Go 2: mutability qualification propagation #20, this idea is to be discussed.
Go is not a functional programming language and I really don't think we should introduce an update which will break every single Go program in any update...
You can add a language feature without mandating an update to existing APIs.
You can add a language feature without mandating an update to existing APIs.
But then we come into the adoption problem like before. We physically can't change the io.Writer
signature without breaking every existing implementation of io.Writer
. Especially if we "learn from " and adopt an "immutability by default" feature, which really would break every current Go program.
I would like to mention however that the memchr
and bytes.Split
problems can be easily resolved by returning indices rather than slices. This is a bit more verbose and doesn't really solve the problem but ๐คทโโ๏ธ
But then we come into the adoption problem like before.
This isn't a real problem. No one has to do anything. Just because Go has goto
doesn't mean we're forced to use it.
If people don't want to adopt immutability, they don't have to.
True, but in this particular issue, the proposal is to make immutability the default in Go 2, which would mean everyone is forced to used it and everyone will have to rewrite their Go 1 programs...
Avoid both const
and incompatibility of mut
:
romshark/Go-1-2-Proposal---Immutability#23
I suggest we should introduce Annotation, knows as compile tags in go, for marking the immutable definations.
- for interface
// here we go
// +immutable
type IMutIfaces {
Func()
}
// now we got the immutability funcs embedded
type Ifaces {
IMutIfaces
AnotherFunc()
// Func() // This SHOULD NOT compile, since we embedded an immutable Func early.
}
- for func
type My struct {
name string
}
// here we go
// +immutable
func (my *My) Name() string {
return my.name
}
type You struct {
My
name string
}
func (you *You) YourName() string {
return you.name
}
// This SHOULD NOT compile, too.
//func (you *You) Name() string {
// return you.name
//}
- for field
We DO NOT need this. You should use un-exported principle if you do not want any one changing it somewhere.
I really don't think that comments should affect the specification of the language, we'd probably want a separate construct for this
Go already has some significant comments. Although it may not have been a good idea to include those...
Technically speaking - the Go language itself does not have significant comments, but the go build
tool will analyze comments to see if it should compile it or not, the go generate
tool will analyze comments to generate code, etc. But these are only tools, and the Go language itself does not recognize build/generation tags.
In terms of @ tags, maybe if they're different than other languages, the typical styles don't fit with Go IMO. Maybe something like type string = []byte@immut
?
The type string = []byte@immut
is as same as the
@immutable
type string = []byte
here. The syntax former is a little stranger to myself.
The issue is that it isn't the same though. That annotation relates to the defined type string
rather than the aliased type.
You're saying "I'm defining an immutable type, string, which is the same as []byte" rather than "I'm defining a type, string, which is the same as an immutable []byte"
Also, that syntax cannot be inlined. So I can make a variable like var list []string@immut
which cannot be done with the prefix-based syntax (remember that the immutability is a property of the type, not the variable, so the tag should be on the type)
@mcspring
Annotations are not only ugly, they also target symbols instead of types and imply transitive immutability, which is not what we want because they'd make mixed-mutability types such as immut * mut T
and immut [][] mut *T
impossible which makes the whole concept limiting and rather useless.
This proposal is about the immutability of types and mutability qualifiers are language keywords used in type expressions.
Think about immutable types as read-only interfaces to potentially shared memory.
Guys, are you sure you want to pollute the language with what is in the FP languages domain? Can't we just make something not intrusive, and analyze later with go vet
.
Re "default immutable", would it make sense to add some sort of a keyword such that you can change the default for a specific file to being immutable? That would be backwards-compatible, but would allow people who want to use it an easy way to opt-in.
@gwd great idea! I filed an issue for that concept in the design doc repo:
romshark/Go-1-2-Proposal---Immutability#23 - Mutability qualifier limited to package
@gwd the overloading of const
proposed in the first revision of this document is now obsolete and we do indeed consider opt-in per-package solutions in the second revision that's currently a WiP.
I'm not yet sure whether the "everything is immutable by default in immutablity-aware packages" strategy is the way to Go because Go was and most likely will remain mutable by default. Immutability by default in immutability-aware packages might cause a lot of confusion (actually I'm not yet sure whether it will or not, but I tend to assume that it will).
The (probably) less confusing way would be to still keep everything mutable by default but make immutability-aware packages provide the mut
and immut
keywords which are not part of the older language specification. You then could enable those keywords using the //go:immutable
compiler flag and when you want a function argument to be immutable for example you'd declare it as func PointerMatrix(a immut [][]*T)
.
When an old immutability-unaware package imports a newer immutability-aware package the immut
and mut
keywords are just ignored, but the compiler could still throw a warning when old code incorrectly uses newer code, like when you're mutating immutable global variables in immutability-unaware packages or similar.
EDIT: see the comment below
But you really ought to try a default-immutable proposal first to see if it flies :-)
@networkimprov imagine the following case (immutability is by default)
we have a new package b which has immutability enabled:
//go:immutable
package b
var C string = "immutable"
we have an old package a which is immutability-unaware:
package a
import "b"
func Func() {
b.C = "new value" // dangerous!
}
Since a is not aware of immutability - the above code will compile, but might throw a warning like:
.a.go:6:6 WARNING: mutating variable of an immutable type immut string
The problem is that it's not obvious for a regular Go programmer, who's not yet familiar with the concept of immutable types, that b.C
must not be mutated! b.C
looks just like a regular mutable string variable. If the code was not immutable by default, like this:
//go:immutable
package b
var C immut string = "immutable"
...then it'd be less confiusing because you'd take the immut
qualifier into account
However, I'm not 100% sure whether or not this is truely confusing, maybe it's not if there's a warning in both the linter and the compiler?
Well if you import a //go:immutable
package, you're writing new code. If you prefer your code not be default-immutable, let keyword immut
tag references to data from the immutable package. (immut
would not apply to local-package data, nor within a default-immutable package.)
// NOTE: revised in a comment below
package a
import "b"
func f() {
v := b.C // OK, copied
p immut := &b.C // OK
var x immut *T // OK
x = p // OK
b.C = T{} // no
*p = T{} // no
p = new(T) // no
x immut := T{} // no
x immut := new(T) // no
}
@networkimprov I just realized, that "immutability by default" might be dangerous! If we elude immut
then older compilers will recognize b.C
as a regular mutable string, which is not good at all! Chances are high you're gonna get yourself in trouble because the author of b
assumed b.C
will never be mutated. Mutating b.C
might produce silent bugs which compile perfectly on older compilers.
It's better to make the immut
keyword obligatory because older compilers will fail as immut
is an unknown keyword from their point of view.
So, finally, it should rather look like this:
//go:immutable
// package b is taking advantage of the new immutability qualifiers
package b
type T struct {
Name immut string
}
var C immut T = T{
Name: "global",
}
// package a doesn't accept the "immut" and "mut" keywords
// but respects immutability of imported packages on newer compilers
package a
import "b"
func f() {
/* local instance */
l := b.T{} // l is of type "mut b.T"
l.Name = "foo" // ERROR: illegal mutation of "immut string"
/* copy */
v := b.C // v is of type "mut b.T"
v.Name = "foo" // ERROR: illegal mutation of "immut string"
v = b.T{Name: "x"} // fine, v is mutable
v.Name = "bar" // ERROR: illegal mutation of "immut string"
/* direct access */
b.C = b.T{} // ERROR: illegal mutation of "immut T"
b.C.Name = "" // ERROR: illegal mutation
/* intermediate pointer */
p := &b.C // p is of type "mut * immut b.T"
p.Name = "baz" // ERROR: illegal mutation of "immut string"
*p = b.T{} // ERROR: illegal mutation of "immut T"
p = new(T) // fine, because the pointer is mutable ("mut * immut b.T")
/* casting */
var x *b.T = p // ERROR: illegal casting "mut * immut b.T" to "mut *b.T"
}
- compiling with an older compiler, which doesn't have a clue about
//go:immutable
: will fail - compiling with a newer compiler, which supports immutable packages: will fail due to illegal mutations and illegal casting
If //go:immutable
doesn't stop a compiler that doesn't recognize it, then maybe
package b
immut
package b immut // alternatively
@networkimprov I like the oneline syntax, but if we scale it it can look bad:
package b immut nogc smthElse
-
I personally think that a compiler flag like
//go:immutable
is okay for a language experiment while a proposal changing the package declaration syntax will almost certainly be rejected for obvious reasons (it's exceptional, it's kinda weird if other parameters get introduced, etc.). -
Also, people are used to a Go where everything's mutable by default and will probably revolt when
var C T
from a newer package andvar C T
from an older one are different types even though they look exactly the same.
If we keep Go mutable by default and just introduce the immut
and mut
keywords through the //go:immutable
immutability experiment, then we can drop the flag in the future, when immut
and mut
become an inherent part of the language specification. We won't however be able to drop the flag if we make those packages immutable by default because if we elude the flag later then old untouched code won't compile!
Comments should never be used to indicate language changes. Go uses comments to give hints to the compiler that you want something to be treated in a certain way, but they should never cause something to fail to compile. (They're also used with cgo, but that's a whole separate thing)
Either way, from a design standpoint, a comment should not affect the language itself in any way. If we do this it'd be much better with package b immut
specifier or similar.
package b immut nogc
looks kinda confusing, hard to read, especially if we consider adding more in the future. Argument-like parenthesis would look better IMHO:
package b (immut, nogc)
Also immut
could be shortened down to mut
for "mutability qualification".
Back to immutable-mutable package integration. I suggested immut
to tag references in a mutable package to data from an immutable package; immut
would not apply to local-package data, nor copies of immutable data.
@romshark then suggested immutable behavior in a default-mutable package, which I'd expect to cause the confusion he's concerned about, in #27975 (comment).
I think the following rules are both consistent and safe:
package m // default mutable
import "i" // default immutable
func f() {
var x * immut i.T // OK
p := &i.D // OK
x = p // OK
v := i.D // OK, copied
v = i.T{} // OK
v.a = 1 // OK
*p = i.T{} // no
p = new(i.T) // no
i.D = i.T{} // no
i.D.a = 1 // no
var x immut i.T // no
}
And we really ought to seek a blessing from the Go Gods to continue this discussion, because we might be wasting our fingerwork :-)
@ianlancetaylor @robpike @griesemer @rsc, any thoughts on package-specific immutability?
More above, and here: romshark/Go-1-2-Proposal---Immutability#23
The laws of immutability must be respected even in the scope of regular immutability-unaware packages, otherwise immutability qualification doesn't solve any problem but creates a whole bunch of new ones when seemingly immutable stuff gets mutated silently from places you'd never expect.
And here's why:
A copy isn't always a deep copy because of pointer aliasing.
package i (immut)
type T {
S string
F immut *T
}
func (r *T) PotentiallyMutate() {}
func (r immut *T) ReadOnly() {}
var G T = T{
F: &T{},
}
package main
import "i"
func main() {
g := i.G // Shallow copy
g.F = nil // ERR: illegal assignment
g.F.F = nil // ERR: illegal assignment
}
i.G.F
is only shallowly copied, thus mutating g.F.F
will mutate G
causing all sorts of trouble, because nobody ever expected G
to be mutated from the outside! Making a deep-copy of i.G.F
is also not an option for 2 reasons:
- we'll decouple the two. Any state changes like a mutation in
i.G.F.S
will not reflect ong
- it's too expensive for recursive structures, especially if we do it over and over again at runtime
Even when we create our own independent instance in the scope of an immutability-unaware package we still must obey the rules:
package main
import "i"
func main() {
m := i.T{F: &i.T{}}
m.ReadOnly() // OK
m.PotentiallyMutate() // OK, because m is mutable
m.F = nil // ERR: illegal assignment
m.F.PotentiallyMutate() // ERR: mutating method on immutable m.F
m.F.ReadOnly() // OK
}
Why? because we didn't make i.T.F
immutable for no reason! There's a reason why we did it and we cannot allow main
to illegaly abuse i.T
in a way the author of i
never intended. Also implementing the "when something originates from an immutable package then obey the rules, otherwise don't"-logic would unnecessarily complicate the compiler.
What must be illegal is: using the immut
keyword in a non-immut-package. The immut
and mut
keywords are only available in immutability-aware packages for backward-compatibility reasons.
var x * immut i.T
The above snippet should not compile in package main
. It could only compile in package main (immut)
It'd sure be great to get feedback from actual Go-team members, but I'm yet to publish the second revision of the design document covering already mentioned criticism:
- const poisoning (immutability genericity),
- verbosity (MQP - mutability qualification propagation),
- const overloading (per-package immutability)
- immutability by default?
As soon as I've covered these issues I'll document them, publish the second revision and ask the Go-team for further feedback. I just don't like wasting anyone's time with ill-conceived ideas and immutability isn't on their hot-fix list as it seems.
@networkimprov Personally I'm not a fan of having the language change on a per-package basis. That's typically the wrong granularity.
@ianlancetaylor currently I know of only 3 possible ways:
- introduce the
mut
andimmut
keywords, potentially breaking any code that uses these names for other symbols keeping Go mutable by default.
- pro: easiest
- con: not so great for backward-compatibility, some older code might break due to name collisions
- con: no experimental phase
- introduce the new keywords on a per-package basis with a new package-arguments syntax:
package p (immut)
- con: requires a new package declaration syntax
- con: no experimental phase
- con: cannot be canceled as easily as 3.
- introduce the new keywords on a per-package basis with a temporary
//go:immut
compiler-flag and call it the "Go immutability experiment". Remove the flag in the future and get to the same situation as 1. but give people time to migrate
- pro: experimental feature with potential to become part of the language
- con: introduces a new kind of compiler flag which rules the package scope and must be present in each file of the package
- If we want immutability by default then 2. and 3. are the only options (except that we can't remove the flag over time, because the code from older Go packages and the newer ones - where everything's immutable by default - is naturally incompatible)
- the
check
keyword proposal takes the 1. approach if I'm not mistaking, so it wouldn't be the only language feature to get introduced this way
We can add new keywords in new language versions where needed. See the discussion in #28221.
Immutability can today already be achieved by using structs with purely value based semantics, avoiding pointers altogether. For example:
package main
import (
"fmt"
)
type CannotMutate struct {
i int
s string
}
func NewCannotMutate(i int, s string) CannotMutate {
return CannotMutate{i, s}
}
func DoesNotMutate(data CannotMutate) CannotMutate {
data.i++
return data
}
func (cnm CannotMutate) String() string {
return fmt.Sprintf("CannotMutate: %d %s", cnm.i, cnm.s)
}
var hiddenState CannotMutate = CannotMutate{42, "Can't touch this"}
func State() CannotMutate {
return hiddenState
}
func main() {
data1 := NewCannotMutate(7, "hello")
fmt.Printf("Data 1: %s\n", data1.String())
data2 := DoesNotMutate(data1)
fmt.Printf("Data 1: %s\n", data1.String())
fmt.Printf("Data 2: %s\n", data2.String())
data3 := State();
fmt.Printf("Data 3: %s\n", data3.String())
fmt.Printf("hiddenState: %s\n", hiddenState.String())
data4 := DoesNotMutate(data3)
fmt.Printf("Data 3: %s\n", data3.String())
fmt.Printf("Data 4: %s\n", data4.String())
fmt.Printf("hiddenState: %s\n", hiddenState.String())
}
Try it here: (https://play.golang.org/p/BJiDNdxk9N_t)
As long as you stick to this style, external packages cannot mutate your package's state, nor can they mutate your data's contents.
An immut
keyword brings a lot of trouble without much additional benefit beyond sticking to a value base API as I show above. Furthermore, seeing the new rules for Go language change proposals (https://blog.golang.org/go2-here-we-come), immutability by default is definitely out of the question, since it would require a full rewrite of almost /all/ existing go code.
Such immutable API is practical and has been used in the wild frequently, see this github search:
https://github.com/search?q=immutable+go
@beoran I'm sorry, but have you even read the problems section? Please read it, it's worth your time :) There were also a lot of discussions you should take into account.
If you still don't want to take your time reading then consider this: how do you want to safely handle slices
then? Just hope the code does what the comments say? Copy the hell out of every slice? This proposal is about compile-time guarantees and clear definitions of APIs, again, please read it!
Immutability-by-default was one of the possible ways, and no, it would not require rewriting any existing code if it's enabled on a per-package basis! I now move away from it though for backward-compatibility and historical reasons. We don't want per-package immutability as it turns out, so we can't use immut-by-def. Also, Go1 is mutable by default, making Go2 the opposite would generate a lot of confusion. Go2 will probably have to remain mutable by default and just provide the mut
and immut
keywords.
@beoran pointers and copies are not interchangeable, there's a reason we have them and there's sometimes a reason to use them in APIs. But it's not only public APIs, it's also package-internal code that often needs to be protected, consider the following example:
Not just public APIs, it's compiler-enforced documentation & clear intentions
package a
type Object struct {
// uniqueID must remain immutable once the Object is created
uniqueID string
}
func NewObject() Object {
return Object{
uniqueID: generateUniqueID(),
}
}
/* lots of code here */
How do you make sure Object.uniqueID
is never changed after the Object
instance is created? You'll have to make sure you never allow something like obj.uniqueID = "foo"
to ever happen in the scope of your package, which might be big & open source, and a lot of people could be working on it pushing their pull requests.
Instead of relying on smart humans (which is an anti-pattern in software engineering), I propose this:
type Object struct {
// Save to be exported
UniqueID immut string
}
You just can't make this mistake, ever. You'll never get faulty pull requests because it won't compile if anyone messed it up.
Slices
// Find guarantees to never write to a
func Find(a []string, s string) int {
return second(a, s)
}
// somewhere in another package, promises not to change b,
// because third promised it as well (sometimes in the past)
func second(b []string, _ string) int {
return third(b)
}
// somewhere in a third package
func third(a []string) {
a[0] = "whoops" // third(a []string) never guaranteed to never change a
}
How do you make sure a
is never changed by Find
? And by "never" I really mean never, not even over time and many many iterations. Writing a comment won't help, a comment is not enforced, it's a pure claim but code might change over time (bugs might get introduced). If we assume those functions are maintained by 3 different people then chances of silently introduced bugs become very high!
Do you really want to write wrappers & interfaces for each and every slice everywhere? Sounds like tons of boilerplate and trouble which could've been avoided with immut
without a single line of additional code:
func third(a immut []string)
- There's no way the maintainer of
third
can break his promise - There's no way the maintainer of
second
can introduce a silent bug becausethird
decided to change its behavior over time because a change froma immut []string
toa []string
is a breaking one.
Copies, copies, copies, boilerplate, boilerplate, boilerplate
Consider the following example:
type T struct {
internal []uint64
}
// Get returns a reference to the internal slice
func (t *T) Get() []uint64 {
return t.internal // shallow copy return
}
We can't do that, we want to avoid T.internal
from being randomly mutated from the outside, it's unexported for a reason, so we'll have to copy it:
// GetCopy returns a copy to the internal slice
func (t *T) GetCopy() []uint64 {
cp := make([]uint64, len(t.internal))
copy(cp, t.internal)
return cp
}
But copying is expensive, how can we optimize our API? Well.. we could implement manual iterating:
// Len returns the length of the internal slice
func (t *T) Len() int {
return len(t.internal)
}
// At returns an item from the internal slice given its index
func (t *T) At(index int) uint64 {
return t.internal[index]
}
Let's look at the benchmark results
goos: linux
goarch: amd64
pkg: test
BenchmarkGet-12 1000000 1326 ns/op
BenchmarkGetCopy-12 300000 5610 ns/op
BenchmarkIndex-12 1000000 2227 ns/op
PASS
ok test 5.338s
https://play.golang.org/p/NNcnq8DhWJV
The iteration interface is certainly much better than copying, but it's still not as fast as it could be, so... should we make internal
exported and rely on our users to never mutate it? It's dangerous, but it's fast, it's a dilema!
A dilemma that could've been solved with:
func (t *T) Get() immut []uint64 {
return t.internal
}
Still want your boilerplate?
I could go on and on... and actually I've already covered all this (and more) in countless discussions (I hope I did). It's all about software engineering, which is what programming becomes when you add time and other programmers.
I think your first example is an example of a bad API design. If it is critical that Object is immutable, then it should be implemented in its own package 'a/object'. This has the additional benefit that it makes testing and code review easier.
As for the slice example, a slice is in essence a pointer value, and so it should not be used at all if immutability is desired. The use of a slice is a hasty optimization here.
Yes, to do immutability in Go now, we need to write a lot of boilerplate. That's a common complaint against go, look at how we don't have enums or other similar conveniences. Generics will probably solve this problem once they get implemented. But for now, go generate, or immutable data structure libraries are feasible and sufficient.
I can agree that as the compiler stands now, a value based API is less performant than a pointer based one. I consider this a compiler and runtime optimization problem. Many functional language compilers only have pure value based semantics and optimize this splendidly. Go should also do that, but that is a different issue.
As I see it the main benefit of the immut keyword would be performance. The immut keyword would tell the compiler to pass data by pointer or slice, but to also give the passed argument value semantics. While I see the appeal of that, I feel it doesn't weigh up against the added complexity of the language.
Particularly, the function argument and result covariance problem leads to having to either having write everything function twice, once with mutable and once with immutable arguments and results, or having to do mutable/immutable casts all over the place, much like with const in C.
I think I can see where you are coming from, but my experience with immutability is different from yours. I think we will have to agree to disagree, and let the go authors weigh the arguments and decide.
Nice document; clearly you spent a while on it. I only briefly glanced over it.
Copies are the only way to achieve immutability in Go 1.x, but copies inevitably degrade runtime performance. This dilemma encourages Go 1.x developers to either write unsafe mutable APIs when targeting optimal runtime performance or safe but slow and copy-code bloated ones.
Not exactly the only way. An alternative approach is to have an opaque type with only exported methods that provide read-only access, which is how
reflect.Type
achieves immutability. The v2 protobuf reflection API also takes this approach. It has other downsides (like needing to manually create methods for each read-only operation), but pointing out that there are non-copy approaches to immutability.
Well, I guess this does not quite work for nested protos
I would like to point out one more use of immutable type: as map keys.
Among all the comparable types available in go, there are only two variable-length types: interface{}
and string
, where string
is essentially an immutable []byte
. The only way to use variable-length slices of arbitrary value type as a map key is to convert them into string
. Immutable types allow us to replace the byte
in string
with arbitrary value types and use pointers instead of value types in map keys.
I would like to point out one more use of immutable type: as map keys.
While this proposal uses the term "immutable", it is more accurately describing a "read-only" view of formerly mutable values.
On the other hand, map keys need true immutability, which is a more restrictive model. Not only do we need the map key to be "read-only", but we also need to guarantee globally that there are no mutable views of that value (lest the value changes due to remote side-effects, causing all sorts of buggy map behavior).
Currently, slice/map/function values can't be used as map keys is not because they are not immutable, it is just because different people have different views on how these values should be compared. To avoid the confusions caused by different views, Go forbids comparing them.
Just wanted to say that this would be an amazing addition to Golang, and mention that Microsoft is in the races with their new open-source Project Verona language focused on memory safety (and is highly influenced by RUST).
I definitely would prefer immutable by default as Golang is function first and it's difficult to enforce developers to declare runtime constants as readonly
or immutible
. I personally really like how Rust does it, let mut ...
.
I would be curious to see a meta-analysis on immutable defaults with regard to the following topics:
- Would declaring
mut
in every mutable variable declaration cause more code bloat than accessors across current popular repos? - Would introducing
mut
cause a cognitive burden on understanding the language?
@pbarker That's how the Go compiler would act if:
- the Go type system was immutable by default
- Go would allow safe implicit casting (safe only)
- Go would allow mixed-mutability types
https://gist.github.com/romshark/5d4650d837c1d87ef237e68ca1408280
IMHO we'd get easier debugging and reading by trading off:
- language simplicity (e.g. most people will struggle to understand the difference between
mut *T
,* mut T
andmut * mut T
, etc.) - the ease of writing
- and probably the ease of refactoring too (making code more reliable usually also makes it less flexible)
how about const
? oh wait ๐คฃ
something that works with :=
is probably necessary, so maybe:
const a, var b, const c := get3Results();
๐คฃ
HONESTLY, I would say something like OCaml would be nice, like a quote after the variable:
a', b, c' := get3Results();
if the variable has ' after it, it's immutable...it's also important to do this in function parameters (JS and TypeScript do not have this to my knowledge):
func HasImmutableParams(a string, b' bool, c int){
// b is immutable
}
how about
const
?
We already came to the conclusion that the const
keyword shouldn't be overloaded.
I would say something like OCaml would be nice, like a quote after the variable:
a', b, c' := get3Results();if the variable has ' after it, it's immutable...it's also important to do this in function parameters
This proposal is about read-only types, not about immutable variables. I already explained why I find the type-based approach better than the variable-based one.
I don't like the idea of immut
and mut
keywords.
Why not use the existing 'readonly' type-syntax that already applies to channels?
(<-
)
IE.
//You can assign read/write values to readonly types.
//(like with channels)
var readonly <-int = 5
readonly = 2
//However, if the type is a pointer type or struct, slice or map
//then it is a compiler error to mutate/write the underlying value.
var slice <-[]int = []int{1, 2, 3}
fmt.Println(slice) //Allowed
slice[0] = 2 //COMPILE ERROR
var pointer <-*int = new(int)
pointer = new(int) //allowed
*pointer = 3 //COMPILE ERROR
type Something struct { Value <-*int }
var thing Something
thing.Value = new(int) //allowed
*thing.Value = 3 //COMPILE ERROR
var readOnlyThing <-Something
//allowed
readOnlyThing = Something{
Value: new(int),
}
readOnlyThing.Value = new(int) //COMPILE ERROR
@Splizard what if you want an immutable slice of pointers to mutable objects though?
IMHO, <-
syntax looks very confusing:
/* Immutable slice of pointers to mutable objects */
<-[]* T<-
immut []* mut T
/* Mutable pointer to an immutable object */
*<- <-T
mut * immut T
* <-T
* immut T
@romshark <-
indicates that the semantic region of memory for a value is read only.
For a slice, this is the elements of that slice.
For a pointer, this is the value being pointed to.
For a map this is the values inside that map.
For a struct, this is the fields of that struct.
Etc
However, any pointers inside immutable types are still ordinary typed pointers. <-
isn't recursive and needs to be added to any type with pointer semantics in order for the underlying value of that type to be made immutable.
//ie this is allowed
var slice = <-[]*int{new(int)}
*slice[0] = 3
pointer := slice[0]
*pointer = 5
//but this is not allowed
slice[0] = new(int) //COMPILE ERROR
//in order to prevent the underlying value of
//the pointers from being
//modified then you need to create a
//read only slice with read only int pointers.
var safeslice = <-[]<-*int{new(int)}
*safeslice[0] = 3 //COMPILE ERROR
@Splizard we're going to run into issues because of the existing read-only and write-only channel declaration syntax.
According to this proposal, the following statement declares an immutable channel:
c := make(immut chan int, 1)
var r immut <-chan int = c
var w immut chan<- int = c
c = nil // Illegal assignment on read-only type
r = nil // Illegal assignment on read-only type
w = nil // Illegal assignment on read-only type
There's an obvious semantic conflict in case of the <-chan
and chan<-
syntax since var c <-chan int
would declare a mutable variable. Alternative syntax like var c <-<-chan int
would be very confusing and far from ideal IMHO.
Specifying mutability on the variable like this: var <-c <-chan int
is not supported by this proposal since I propose to define mutability on the data types.
To avoid overly verbose declarations such as var s immut chan immut * immut T
, this issue suggest propagating mutability qualification in the type definition: var s immut chan *T
.
@romshark
I'm confused.
This proposal is about read-only types, not about immutable variables. I already explained why I find the type-based approach better than the variable-based one.
Why is assigning nil to a variable with type immut chan int
illegal?
For example, strings are immutable in Go and this is completely valid:
c := string("hello")
c = ""
I mean a string
is almost an alias of <-[]byte
.
I don't see a semantic conflict with channels, they already have a read-only 'immutable' type equivalent. <-<- chan int
is the same type as <-chan int
because the type is already read only.
You don't have to like this notation but I don't see why there should be multiple ways in Go to declare a type to be read-only/immutable.
@Splizard While I like your proposed syntax for its consistency, it produces an ambiguity:
var chanOfSlices chan<-[]Type = nil
Is this (chan<-)([]Type)
or chan (<-[]Type)
?
Why is assigning nil to a variable with type
immut chan int
illegal?
Take a look at this example:
type S struct {
ID immut int
Name string
}
s := S{42, "foo"}
s.ID = 43 // no!
s.Name = "bar" // fine
s = S{43, "bar"} // fine
var s2 immut S = S{42, "foo"}
s2.ID = 43 // no!
s2.Name = "bar" // no!
s2 = S{43, "bar"} // no!
immut
makes s2
read-only.
For example, strings are immutable in Go and this is completely valid.
I mean astring
is almost an alias of<-[]byte
.
I initially picked the wrong title for this proposal. We're rather talking about read-only, not immutable types.
A variable or field of type string
is still assignable/writable, the underlying data, however, is read-only. const string
is somewhat closer to immut string
with the only difference being that contstants must be known at compile-time while immut
types are just read-only once they're initialized.
@deanveloper
No, this ambiguity already exists in Go, try creating a channel of read-only channels.
You need to use parentheses.
chan (<-chan int)
vs chan <-chan int
@romshark
I suppose I should create a new proposal, as I reject the idea that a type can restrict a variable from being reassigned. Variables are reassignable by definition. What s2 = S{43, "bar"} // no!
shows, is that immut
changes the meaning of the variable and it is no longer a variable, it is a const
pointer to a read-only type. IE const s2 <-*S = &S{43, "bar"}
@Splizard If immut
wouldn't apply to variables then we'd lose a crucial feature of this proposal because a very common use case would be package-level read-only variables:
package something
var ErrSomethingWentWrong immut error = errors.New("something went wrong")
var Options = immut []string {"foo", "bar", "baz"}
var Dict = immut map[string]string {"foo": "bar", "baz": "faz"}
var DefaultLogger immut *Log = &Log{os.Stdout}
// etc.
Variables are reassignable by definition
No, not necessarily. Functional programming languages usually don't even feature reassignment. A variable is a placeholder for values unknown at compile-time.
it is a
const
pointer to a read-only type,
A constant's value must be known at compile time. A variable of immut
type isn't required to be known at compile-time.
Hi,
I am in favor of adding only *const T
which is a pointer type that can only read target data, no writing. This does not mean target data is constant, just that target cannot be changed by read-only pointer. No immutability for regular data (including pointers). Declarations are like
var x int
p := &x // regular pointer
r := &const x // read-only pointer
var s *const int
s = &x // ok
p = r // forbidden without a cast
r = p // ok
// they should have different reflection kinds. design document does not talk about reflection.
*p++
*r++ // forbidden
var z struct {
A int
B string
}
r := &const z
r.B += "abc" // forbidden
func (r *const Object) Some() { // read-only method of type
Thanks..
@jfcg 3 issues:
- How do you define read-only package-scope variables?
- How do you define read-only struct fields?
- How do you make maps and slices read-only?
If those issues remain unanswered then this should be a different proposal because it serves a different purpose.
* How do you define read-only package-scope variables? * How do you define read-only struct fields?
expose them with read-only pointers
* How do you make maps and slices read-only?
similarly for slices, []const T
could mean read-only slice, cannot be used to write to underlying data.
for maps, maybe map[K]const T
could mean the same thing. In both cases expansion/deletion/overwrite could be forbidden.
Or, slice expansion could be allowed if there is capacity. For map, map[const K]const T
could mean not-expandbale as well.
@jfcg If, and only if *const int
is similar to const int
in the sense that it cannot be mutated and assigned to then it'd be an interesting alternative to consider.
However, I see a potential problem as it'll lead to pointer abuse and cause increased number of allocations and hence generate more pressure on the garbage collector since objects referred to by a pointer escaping the function scope will be allocated on the heap. This contradicts one of the goals of this proposal: improve performance by avoiding unnecessary copying.
Also, it'd be impossible to have mixed-mutability types, which, I fear might lead to problems.
@jfcg If, and only if
*const int
is similar toconst int
in the sense that it cannot be mutated and assigned to then it'd be an interesting alternative to consider.However, I see a potential problem as it'll lead to pointer abuse and cause increased number of allocations and hence generate more pressure on the garbage collector since objects referred to by a pointer escaping the function scope will be allocated on the heap. This contradicts one of the goals of this proposal: improve performance by avoiding unnecessary copying.
Also, it'd be impossible to have mixed-mutability types, which, I fear this might lead to problems.
I think an example is better:
package mypkg
type ConfVar struct {
A int
B string
}
var pkgVar ConfVar
func PkgVar() *const ConfVar {
return &const pkgVar
}
So, there is no global read-only pointer to mess with and no allocation. mypkg
can modify pkgVar
and outsiders have fast & read-only access to that package-level variable by one call to PkgVar()
. You can write similar methods for allowing read-only access to private struct fields. Can you elaborate what you mean with examples?
@jfcg let's say you want a factory that returns objects with an immutable field:
type Object struct { ID *const uint64 }
type Factory struct { counter uint64 }
func (f *Factory) NewObject() *Object {
f.counter++
id := f.counter // Will be allocated
return &Object {ID: &const id} // Pointer escapes function scope
}
This approach leads to a performance penalty.
The read-only type approach doesn't:
type Object struct { ID immut uint64 }
type Factory struct { counter uint64 }
func (f *Factory) NewObject() *Object {
f.counter++
return &Object {ID: f.counter} // No pointers, no allocations
}
This is an oversimplified example. I know that we could just return *const
instead, but we might want the object to be mutable with only the Object.ID
field being immutable.
package mypkg type ConfVar struct { A int B string } var pkgVar ConfVar func PkgVar() *const ConfVar { return &const pkgVar }
Second issue: pkgVar
remains mutable within the package, which might not be desirable.
package mypkg
type ConfVar struct {
A int
B string
}
var PkgVar = immut ConfVar{
A: 42,
B: "foo",
}
Read-only types would guaranteemypkg.PkgVar
to be immutable since it cannot be written to from both within and from outside of the package.
type Object struct { ID immut uint64 } type Factory struct { counter uint64 } func (f *Factory) NewObject() *Object { f.counter++ return &Object {ID: f.counter} // No pointers, no allocations }This is an oversimplified example. I know that we could just return
*const
instead, but we might want the object to be mutable with only theObject.ID
field being immutable.
This case (or similar ones) does not even need read-only pointers, let alone full blown immutability:
type Object struct {
id uint64
Data string
}
func (o *Object) ID() uint64 {
return o.id
}
Methods are perfectly fine and even they can be optimized out by compilers.
Second issue:
pkgVar
remains mutable within the package, which might not be desirable.
It is actually by design (for example). Only the package can update its confguration variables, sounds pretty straightforward to me.
package mypkg type ConfVar struct { A int B string } var PkgVar = immut ConfVar{ A: 42, B: "foo", }Read-only types would guarantee
mypkg.PkgVar
to be immutable since it cannot be written to from both within and from outside of the package.
What advantages do always-immutable variables have over typed constants?
Can you think of a real example where full blown immutability delivers undisputed advantage that struct methods, functions and read-only pointers cannot?
Was this issue addressed by someone from the core language team? I noticed that this is a 3 years old issue and in some way I would like to know if it is worth it to keep me hyped for constant custom datatypes. Having something readonly sometimes makes a huge difference in programming and also helps you at debugging very much.