We need a process for handling registries, APIs and other 'enumerations'

Question

We need a process for handling registries, APIs and other 'enumerations'

Closed this issue 4 years ago · 127 comments

Currently mixed into #79, but the 'Registry' question is much more limited and we should consider attacking it independently.

The AB has a discussion page on Registries at https://www.w3.org/wiki/Registries

Answer 1 · 2018-02-21T17:52:46.000Z

In general there are other items: vocabularies, accessibility mappings that could fit with registries as well.

Answer 2 · 2018-02-21T17:53:53.000Z

Leaving in the AB hands for now, they can point out what the process consequences are when they work it out.

Answer 3 · 2018-03-14T04:28:40.000Z

https://www.w3.org/wiki/Registries is a discussion

Answer 4 · 2018-03-14T04:29:48.000Z

"registries" suggests the rather narrow IANA-like operation, and the requests include API etc.

Answer 5 · 2018-03-16T21:26:32.000Z

improved discussion at https://www.w3.org/wiki/Repositories

Answer 6 · 2018-03-19T17:17:40.000Z

Is there a license that applies to the items in the registry?

Answer 7 · 2018-03-19T17:24:57.000Z

On Mar 19, 2018, at 10:17 , Virginia Fournier ***@***.***> wrote: Which IP rules would apply to the items in the registry?

If you look at the analysis, a lot depends on what the status of the code-points is vis-a-vis obligation to recognize or implement. MP4, for example, merely documents to avoid collision and duplication; so the IPR requirements are those of the referenced spec. (or company). Some IANA code-points are effectively mandatory — so the IPR requirement would need to be W3C or W3C-conformant, i.e. RF. If people think this covers the landscape, we can look into formulating the rules, boilerplate text, and so on.

Answer 8 · 2018-07-26T16:52:41.000Z

An example is the Media Source Extensions Byte Stream Format Registry, which maintains a mapping between MIME-type/subtype pairs and byte stream format specifications for use with MSE.

Answer 9 · 2018-07-27T07:37:22.000Z

Another example: the EPSG registry, which is the authoritative source of coordinate reference system definitions. The OGC maintains a shadow copy of this registry.

Answer 10 · 2018-07-27T15:43:36.000Z

Here's yet another example: the Open Metadata Registry, which has some very well thought-out features for versioning and change control.

Answer 11 · 2018-07-27T16:59:03.000Z

and another

https://www.w3.org/2011/07/regreq.html

Answer 12 · 2018-07-30T09:40:56.000Z

The stats registry in https://w3.github.io/webrtc-stats/ is one example of a spec that tries to perform the job of a registry. Somewhat doubtful - we still want the stats' behavior documented and published too.

Answer 13 · 2018-07-30T18:29:29.000Z

Note also

https://www.w3.org/2011/07/regreq.html

TTML uses:

https://www.w3.org/wiki/TTML/RoleRegistry
https://www.w3.org/wiki/TTML/ItemNameRegistry

And perhaps more importantly the media type registry with short form
profile designators for TTML is at:

https://www.w3.org/TR/ttml-profile-registry/

there is an XPointer registry:
https://www.w3.org/2005/04/xpointer-schemes/
It has an ad-hoc script for adding entries:
https://www.w3.org/2005/04/xpointer-schemes/0register
We don't know if that is just supposed to deposit email in someone's mbox
and whether that person knows of their mandate.

Answer 14 · 2018-10-24T12:18:53.000Z

I'd like to mention the UK Government Registers project - https://www.registers.service.gov.uk/

It provides an authoritative - and non-revocable - list of "things".

See user stories at https://gds.blog.gov.uk/2015/09/01/registers-authoritative-lists-you-can-trust/ and https://gds.blog.gov.uk/2015/10/13/the-characteristics-of-a-register/

Answer 15 · 2019-03-12T16:12:59.000Z

Looks like webauth has some, see w3c/webauthn#1177

Answer 16 · 2019-03-14T17:26:14.000Z

in case anybody is interested to read a bit about how registries are usually managed, what is managed, and how to do it, https://tools.ietf.org/html/draft-wilde-registries might be interesting to read. it's expired but still up to date, and if anybody has feedback or input for that document, i would be delighted to hear about that.

Answer 17 · 2019-03-14T17:31:02.000Z

as a second possibly useful resource, https://github.com/dret/RegMan/blob/master/W3C.md has a list of current W3C specs that ideally should use registries, but use a variety of other ways to do it because W3C doesn't currently support registries.
there are probably more specs where having a registry would have been beneficial, but people did not do it because there currently is no culture or support for doing it at the W3C.

Answer 18 · 2019-04-23T08:36:52.000Z

i have also posted this over at #79, but this here may be the more specific issue: i just updated the draft of my "the use of registries" document, and i have added a list of w3c specifications that currently are using some shape or form of a "registry", but are doing so in a rather ad-hoc session because there is no process or even guidance (afaict). here is a direct link to the section listing W3C specifications: https://tools.ietf.org/html/draft-wilde-registries-02#appendix-A

Answer 19 · 2019-05-11T00:15:25.000Z

Initial suggested requirements at https://www.w3.org/wiki/Repositories#Recommendation

Answer 20 · 2019-05-12T21:39:06.000Z

On 2019-05-10 17:15, David Singer wrote: Initial suggested requirements at https://www.w3.org/wiki/Repositories#Recommendation

what's the best way to comment/discuss a variety of the documented requirements? i just want to make sure this is done in a way that works best. https://tools.ietf.org/html/draft-wilde-registries has a bit more elaborate model of registries. it could be one idea to simply use that one, and i'd be more than happy to adjust where it doesn't seem to fit.

Answer 21 · 2019-05-12T22:45:25.000Z

comments here in the issue are fine

Answer 22 · 2019-05-12T23:11:23.000Z

"A registry is a table that documents logically independent 'atoms'; conceptually a table with independent rows" -> it also is a set of rules how to manage that table. often the table also has columns with specific meaning, such as timestamps, author info, or a "deprecated" label.
"Registries are purely documentational and contain no requirements." -> in most cases they do. most require the meaning of entries to be kept stable over time. most have additional requirements in terms of adding/changing/deprecating/removing entries.
" hosted in a way that preserves history (e.g. Git, a Wiki)" -> it's rather different to keep an edit history of a web page that represents a registry (such as a wiki), or to keep a history of the actual registry changes. it may make sense to require the latter, so that for example machinery can be built around managing changes. in this day and age, one might even go as far as requiring an API that makes both the content as well as the history machine-readable.
in the "The registry:" section, maybe add the API, a published/machine-readable history of the registry actions, possibly provide an outline of standard columns (value, description, timestamp, deprecation, reference to definition, ...)
it might make sense to provide a "menu" of registry policies that w3c groups can choose from, so that they don't have to write their own. they may be allowed to write their own, but in most cases, they's probably happily choose from a small set of standard choices.

Answer 23 · 2019-05-12T23:14:27.000Z

to be honest, i am a bit confused by the title of this issue: "We need a process for handling registries, APIs and other 'enumerations'"
i completely get the need for registries. how do APIs factor into this, and how are they related or something similar to registries?

Answer 24 · 2019-05-13T15:57:30.000Z

We have a few APIs at the W3C that are 'mapping' APIs, and every time there is a new feature in the spec. they are mapping, a new matching 'API' has to be inserted in the API registry, as I understand. But I'd like someone to confirm (I don't manage such an API myself).

"A registry is a table that documents logically independent 'atoms'; conceptually a table with independent rows" -> it also is a set of rules how to manage that table. often the table also has columns with specific meaning, such as timestamps, author info, or a "deprecated" label.

Thank you, added "** the rules for values (logically, the column values) in each entry (e.g. uniqueness, matching to a value from some other specification or registry, etc.)"

"Registries are purely documentational and contain no requirements." -> in most cases they do. most require the meaning of entries to be kept stable over time. most have additional requirements in terms of adding/changing/deprecating/removing entries.

Right, and we're saying that those rules are in the document, not (solely) in the registry, so that the rules get the review to the level required for the referencing document.

" hosted in a way that preserves history (e.g. Git, a Wiki)" -> it's rather different to keep an edit history of a web page that represents a registry (such as a wiki), or to keep a history of the actual registry changes. it may make sense to require the latter, so that for example machinery can be built around managing changes. in this day and age, one might even go as far as requiring an API that makes both the content as well as the history machine-readable.

Most Wikis I know keep version history, and obviously Git does. Do we need to establish that a record is kept directly of requests, as well?

in the "The registry:" section, maybe add the API, a published/machine-readable history of the registry actions, possibly provide an outline of standard columns (value, description, timestamp, deprecation, reference to definition, ...)

add what API?
added "* managed such that registration history (requests and actions) are archived (e.g. a W3C mailing list archive, pull request history, etc.)"

it might make sense to provide a "menu" of registry policies that w3c groups can choose from, so that they don't have to write their own. they may be allowed to write their own, but in most cases, they's probably happily choose from a small set of standard choices.

yes, maybe, but at this level I am trying to tease out what the rules and recommendations are.

Answer 25 · 2019-05-13T21:21:10.000Z

On 2019-05-13 08:57, David Singer wrote: We have a few APIs at the W3C that are 'mapping' APIs, and every time there is a new feature in the spec. they are mapping, a new matching 'API' has to be inserted in the API registry, as I understand. But I'd like someone to confirm (I don't manage such an API myself).

i am still not quite understanding, but it seems to be sufficiently different from registries to maybe need a different support mechanism?

* "A registry is a table that documents logically independent 'atoms'; conceptually a table with independent rows" -> it also is a set of rules how to manage that table. often the table also has columns with specific meaning, such as timestamps, author info, or a "deprecated" label. Thank you, added "** the rules for values (logically, the column values) in each entry (e.g. uniqueness, matching to a value from some other specification or registry, etc.)"

with rules i was more referring to the way how entries are added, updated, and removed, and how those rules may directly or directly result in populating the table cells.

* "Registries are purely documentational and contain no requirements." -> in most cases they do. most require the meaning of entries to be kept stable over time. most have additional requirements in terms of adding/changing/deprecating/removing entries. Right, and we're saying that those rules are in the document, not (solely) in the registry, so that the rules get the review to the level required for the referencing document.

true. but the point is: if there is a requirement for some process, you cannot just go ahead and change the document/registry. you have to follow the process, wherever/however that is defined.

* " hosted in a way that preserves history (e.g. Git, a Wiki)" -> it's rather different to keep an edit history of a web page that represents a registry (such as a wiki), or to keep a history of the actual registry changes. it may make sense to require the latter, so that for example machinery can be built around managing changes. in this day and age, one might even go as far as requiring an API that makes both the content as well as the history machine-readable. Most Wikis I know keep version history, and obviously Git does. Do we need to establish that a record is kept directly of requests, as well?

i think so, personally. i wouldn't be interested in the edit history of the publishing mechanism of the registry. i'd much rather have a clear and well-defined history of the relevant changes to the registry *content* (as opposed to the registry representation).

* in the "The registry:" section, maybe add the API, a published/machine-readable history of the registry actions, possibly provide an outline of standard columns (value, description, timestamp, deprecation, reference to definition, ...) add what API?

the API for a registry. given where we are today, i think it really would make sense to say that at least each registry must be accessible through a well-defined API, and ideally its history should be part of that history as well. i think it would be ok to have that as a read-only API (i.e., no registry updates through the API), but it would be a wasted opportunity to not make newly designed registries machine-readable.

added "* managed such that registration history (requests and actions) are archived (e.g. a W3C mailing list archive, pull request history, etc.)"

good, and potentially/ideally not just in a way that's accessible for humans.

* it might make sense to provide a "menu" of registry policies that w3c groups can choose from, so that they don't have to write their own. they may be allowed to write their own, but in most cases, they's probably happily choose from a small set of standard choices. yes, maybe, but at this level I am trying to tease out what the rules and recommendations are.

ok. then we can table this until things develop a bit further.

Answer 26 · 2019-05-21T20:36:30.000Z

Straw man proposal:

Registries are developed through the normal standards track, and are published on /TR just like other technical reports.
Registries are defined as part of a specification that defines an extensible/updatable table of items. A single REC-track specification can contain have multiple of these (i.e. zero or more).
The section defining the registry must a) state that it is a Registry per the W3C Process, b) define the fields of its table of items, c) define the method and criteria by which changes are proposed and incorporated.
Changes to the registry other than adding/removing/updating entries in the registry go through the normal specification change process.
However, adding/removing/updating entries in the registry can be done through a lightweight process similar to how we handle editorial changes.

Answer 27 · 2019-05-21T21:16:35.000Z

Hm, I think Elika's suggestions at #168 (comment) and mine at #168 (comment) are very similar...

Answer 28 · 2019-05-21T21:22:34.000Z

I think this is a good starting point and agree with all but possibly the first point. I wonder about the use case of a CG owning a registry .... Or maybe it’s OK To insist that if a group wants an authoritative registry in /TR, they need to go through at least a streamlined version of Charter to ensure there is consensus to let them.

Answer 29 · 2019-05-21T21:46:10.000Z

@michaelchampion Community Groups can publish Community Group Reports of whatever they want; afaict if they want to make a registry of stuff there's no reason they can't do so right now.

Answer 30 · 2019-05-21T21:49:06.000Z

@dwsinger I think the main difference is that mine just inlines the registry into the /TR publication so that the questions of where and how it's hosted and what archiving and update mechanisms are involved are already solved. ;)

Answer 31 · 2019-05-21T21:56:50.000Z

Update mechanics are important, and this is something that is better defined in @dwsinger's proposal although it would probably help to have some concrete working examples.

Answer 32 · 2019-05-21T22:15:27.000Z

I think the only difference is minor, and we should merge when we resolve this:

I propose a 'W3C Registries' page which links to all registries managed under the policy, and they in turn link back to their enclosing document (Rec, CG Report, Note, whatever);
Elika proposes that the Registries be in /TR

Answer 33 · 2019-05-22T05:38:04.000Z

A working registry that versions updates of each data element in the registry can be seen here. Notice also that each element has a status, such as "added" "published" "deprecated". This registry is already being used for large metadata sets.

I think that the broad definition of registries that seems to be being used here is going to be an impediment to development. Registration of ontologies will have different requirements from registration of datasets and from registration of data elements. I see a mixture of assumptions in the answers and it seems we are not always talking about the same thing. Can we narrow / split our field so that the discussion is more focused?

Answer 34 · 2019-05-22T11:41:19.000Z

I +1'ed to a few of @fantasai's comments which I believe are similar to some of @dwsinger's.

Items going into the registry should have been discussed as part of the document's progression through the Rec Track; so by the time a document is ready to move to the final stages of the Process the adding of items to the registry should have already been discussed. Therefore a lightweight method of adding items to a registry sounds sensible.

Answer 35 · 2019-05-22T12:43:47.000Z

Registries are defined as part of a specification that defines an extensible/updatable table of items.

@fantasai are you suggesting that some subsections of Recs are actually inlined data sourced from other sources which can be updated in-place? So I might look at a spec one day, and see it say some text, and then the next day it shows something different?

Inlining seems both convenient for quick reading and also inconvenient because the updatable content might not be super-obviously subject to change. If someone references a dated version of a spec containing a registry they probably don't expect its contents to change. Requiring that a link is traversed in order to find the "current" version of a registry avoids this.

On balance I'd tend to avoid inlining registry content within Recs for this reason.

Another feature of a registry that would be useful is a change notification service, analogous to (or actually?) an RSS feed for that registry, that can be subscribed to so interested parties can make their own updates, e.g. to implementations, in a timely way.

Answer 36 · 2019-05-22T12:47:32.000Z

On balance I'd tend to avoid inlining registry content within Recs for this reason.

I agree with @nigelmegitt that inlining registry content could lead to some difficulties. I think registries need to be easily updatable.

Answer 37 · 2019-05-22T13:05:17.000Z

On 2019-05-21 22:36, fantasai wrote: Straw man proposal: * Registries are developed through the normal standards track, and are published on /TR just like other technical reports.

does that refer to the spec establishing the registry, or the registry itself? IETF has RFC 8126 which defines how to establish registries, and an RFC one will be published in the regular RFC stream. the registry itself though is a different thing and available elsewhere. it seems that this setup makes sense, but i am not sure whether that's what you mean here.

* Registries are defined as part of a specification that defines an extensible/updatable table of items. A single REC-track specification can contain have multiple of these (i.e. zero or more).

that's what RFC 8126 defines in the IETF space. it would probably take an equivalent REC to specify how a REC has to establish a registry.

* The section defining the registry must a) state that it is a Registry per the W3C Process, b) define the fields of its table of items, c) define the method and criteria by which changes are proposed and incorporated.

https://tools.ietf.org/html/draft-wilde-registries-02#section-6 has a few more things that might be worth considering.

* Changes to the registry other than adding/removing/updating entries in the registry go through the normal specification change process.

that's an interesting one. i am not sure IETF even has a process for changing a registry. once it is established, it is supposed to work as specified.

* However, adding/removing/updating entries in the registry can be done through a lightweight process similar to how we handle editorial changes.

it might be good to keep a history of the registry, or even a history of individual entries. additionally, it might be a good idea to make sure that there is an API for a registry, i.e. that there is a way how to get a machine-readable version of at least the registry contents, and potentially the history as well.

Answer 38 · 2019-05-22T13:12:18.000Z

On 2019-05-21 23:49, fantasai wrote: @dwsinger <https://github.com/dwsinger> I think the main difference is that mine just inlines the registry into the /TR publication so that the questions of where and how it's hosted and what archiving and update mechanisms are involved are already solved. ;)

now i understand: the registry *is* a TR, not just the document establishing it. just as food for thought: IETF has 2000+ registries (with of course far more values in them), with quite a number of updates happening. looking at this, maybe treating every update of a registry as something that triggers a published TR update might become relatively noisy. at a more abstract level, i would argue that specifications and registries are different things, and it makes sense to manage and publish them separately. but that's probably mostly a matter of opinion.

Answer 39 · 2019-05-22T13:15:03.000Z

On 2019-05-21 23:56, chaals wrote: Update mechanics are important, and this is something that is better defined in @dwsinger's proposal <https://www.w3.org/wiki/Repositories#Recommendation> although it would probably help to have some concrete working examples.

for a more extensive list of things to consider, https://tools.ietf.org/html/draft-wilde-registries-02#section-6 has some food for thought. typically, registries operate in a framework given by general registry mechanics (which are established by a spec defining the registry mechanism). for example, a specific registry can define whether it allows to remove values, or not.

Answer 40 · 2019-05-22T13:19:56.000Z

On 2019-05-22 13:41, thisNatasha wrote: Items going into the registry should have been discussed as part of the document's progression through the Rec Track; so by the time a document is ready to move to the final stages of the Process the adding of items to the registry should have already been discussed. Therefore a lightweight method of adding items to a registry sounds sensible.

it's not a given that registries will only allow updates made through the W3C TR process. it probably should be up to the individual registry to define what hoops update requests have to jump through.

Answer 41 · 2019-05-22T13:25:26.000Z

On 2019-05-22 14:43, Nigel Megitt wrote: @fantasai <https://github.com/fantasai> are you suggesting that some subsections of Recs are actually inlined data sourced from other sources which can be updated in-place? So I might look at a spec one day, and see it say some text, and then the next day it shows something different?

i think the suggestion is that the registry itself *is* a REC that is always republished when it is updated.

Inlining seems both convenient for quick reading and also inconvenient because the updatable content might not be super-obviously subject to change. If someone references a dated version of a spec containing a registry they probably don't expect its contents to change. Requiring that a link is traversed in order to find the "current" version of a registry avoids this.

+1; to me a registry is something different than a spec.

Another feature of a registry that would be useful is a change notification service, analogous to (or actually?) an RSS feed for that registry, that can be subscribed to so interested parties can make their own updates, e.g. to implementations, in a timely way.

+1, that's the idea of an API for registries and their updates, and Atom might be a good choice there. it might be worthwhile to think about the problem of traffic volume, though. if you end up having implementations that constantly pull the feed, that might create some issues for popular registries.

Answer 42 · 2019-05-22T13:27:30.000Z

On 2019-05-22 14:47, Tzviya wrote: I agree with @nigelmegitt <https://github.com/nigelmegitt> that inlining registry content could lead to some difficulties. I think registries need to be easily updatable.

once again just pointing to IETF here: in their model, the RFC establishing the registry often also specifies the initial contents. the current content then is available in the registry only.

Answer 43 · 2019-05-22T13:52:55.000Z

You may be interested in the work the UK Government is doing with Registers.
https://gov.uk/registers

If I've understood this discussion correctly, this could be a model for registries.

For example, there is a canonical register of every country the UK Government recognises - https://www.registers.service.gov.uk/registers/country - it also includes countries which no longer exist, and metadata about them. Each register is maintained by a named owner, and they commit to regularly updating them. They're also cryptographically signed so that end users can be assured that they have not been compromised.

There's more detail at https://www.registers.service.gov.uk/about/characteristics-of-a-register - and I'm happy to link anyone up to the team which looks after them.

(I'm not the GOVUK rep any more - but still maintain an interest.)

Answer 44 · 2019-05-22T15:06:45.000Z

@edent wrote

Each register is maintained by a named owner, and they commit to regularly updating them. They're also cryptographically signed so that end users can be assured that they have not been compromised.

I think that captures the essence of a "registry" -- there is some specific owner, presumably chosen based on qualifications, who is accountable for keep the registry up to date and accurate, and a verification mechanism to ensure that updates are actually made by the owner.

Answer 45 · 2019-05-22T18:19:21.000Z

@fantasai are you suggesting that some subsections of Recs are actually inlined data sourced from other sources which can be updated in-place?

Yes, effectively. Not in-place as in changing a dated publication, but in-place as in the "latest version" of the spec always includes the latest copy of the registry data.

Inlining seems both convenient for quick reading and also inconvenient because the updatable content might not be super-obviously subject to change.

A registry needs to be clearly labelled as such, to opt in that data to the registry-update process. As I said, “The section defining the registry must a) state that it is a Registry per the W3C Process, b) define the fields of its table of items, c) define the method and criteria by which changes are proposed and incorporated.”

If someone references a dated version of a spec containing a registry they probably don't expect its contents to change. Requiring that a link is traversed in order to find the "current" version of a registry avoids this.

The dated version of a spec won't change. Only the undated one. Each change to the registry is an updated publication, just like any editorial change to the spec is an updated publication.

Another feature of a registry that would be useful is a change notification service, analogous to (or actually?) an RSS feed for that registry, that can be subscribed to so interested parties can make their own updates, e.g. to implementations, in a timely way.

I believe there is already an RSS feed for /TR documents. W3C might consider having per-spec RSS feeds as well. If there's a need for some more specialized data service, the WG can provide one however is convenient for the users and maintainers. There are plenty of websites out there serving copies of the ISO language codes and Unicode tables, for example--not every copy has to be served by ISO.

I think registries need to be easily updatable.

The group in charge of maintaining the registry should set up appropriate tooling, e.g. pulling data from a GH-hosted TSV once a day and, if there are changes, building it into a spec, and publishing it with Echidna. Or whatever. The registry can be inlined in the spec as a table, and/or served as a separate TSV, JSON, or other file data in the publication directory, same as other support materials like images and examples and indexes.

it might be worthwhile to think about the problem of traffic volume, though. if you end up having implementations that constantly pull the feed, that might create some issues for popular registries.

Scaling up hosting is an issue no matter what process we decide on, but ultimately we just need to host the official copy. W3C can faciliate serving copies of the data off of someone else's more robust server if needed (using appropriate data formats / APIs) and even offer a w3.org URL for the /TR documents to link to for high-traffic pulls, interesting data queries, and the like.

Hosting directly on /TR quickly and neatly solves the questions of what format, where to host, whether to design and build some new system for serving the data, and how to handle issues like archiving, longevity, branding, reputation, and authority. The issues of convenience and speed can be solved with mirrors.

now i understand: the registry is a TR, not just the document establishing it. just as food for thought: IETF has 2000+ registries (with of course far more values in them), with quite a number of updates happening. looking at this, maybe treating every update of a registry as something that triggers a published TR update might become relatively noisy.

We already have specs which are updated almost daily. This wouldn't be a new problem. And it would not be necessary to announce the registry updates. :) Announcements should be reserved for substantive changes to the framework of the registry, or if the WG particularly wants to announce some set of changes.

Importantly, we are already publishing registries like AAM through the spec publication process. This proposal just streamlines it so that these become practical to maintain.

Answer 46 · 2019-05-23T05:28:48.000Z

I think an other important question is where does stability come from. In the case of specifications, it comes from having multiple implementations that demonstrably match the spec, with the market relying on them so much that changing is generally not practical. Because of that, we don't really need a rule saying that updates to a REC must be compatible with the previous publication of this REC.

For registries there's no such thing. In the general case, there is no expectation that each entry in a registry will necessarily have multiple implementations. There are also different kind of expectations on different registries:

some can be updated and changed, you just need to not be reckless about it, kind of like specs
some should be append only
some should allow appending and deprecating, but never removing or changing
Flexible initially, then append only (or append / deprecate) after certain maturity criteria have been met

At which level do we want to enforce that:

trust the consensus of the WG and its chairs to not do anything silly
require each registry to have an "updating policy" section, which has to be followed (publication is denied if it isn't followed, it's valid ground for formal objections, etc)
Charters must have an update policy for each registry the WG hosts
The Process dictates what can be done

I kind of prefer something along the lines of 2, so that it's enforceable while still letting us account for the diversity of needs for different registries, but that raises the question of what the rules for changing the "updating policy" are. It feels that this is something that should be hard to do, but not necessarily impossible.

Answer 47 · 2019-05-23T12:58:54.000Z

On 2019-05-23 07:28, Florian Rivoal wrote: I think an other important question is where does stability come from. In the case of specifications, it comes from having multiple implementations that demonstrably match the spec, with the market relying on them so much that changing is generally not practical. Because of that, we don't really need a rule saying that updates to a REC must be compatible with the previous publication of this REC.

so that's about the REC establishing the registry, right? in the registry model, you don't update the REC. in the REC, you simply refer to the registry and say that whatever value space is managed there is the one that people should look at. the important aspect is that registry entries need to be stable, too (i.e., no updates change change the meaning of existing entries, and mostly just additions anyway).

For registries there's no such thing. In the general case, there is no expectation that each entry in a registry will necessarily have multiple implementations.

i don't quite follow. what's the "implementation of an entry"? let's take a simple example of a language tag registry. the entry "en" may not be recognized/supported by all implementations using languages, but those that do need to be able to depend on the fact that this value will always identify the english language.

There are also different kind of expectations on different registries: * some can be updated and changed, you just need to not be reckless about it, kind of like specs

in terms of breaking changes? i would disagree.

* some should be append only

yes, that's one policy.

* some should allow appending and deprecating, but never removing or changing

yes, that's another popular policy.

* Flexible initially, then append only (or append / deprecate) after certain maturity criteria have been met

some have "experimental" status for entries, but to me that's a slippery slope because it's hard to say how that should work for implementations.

At which level do we want to enforce that: 1. trust the consensus of the WG and its chairs to not do anything silly

that would be when establishing the registry (and its initial contents)

2. require each registry to have an "updating policy" section, which has to be followed (publication is denied if it isn't followed, it's valid ground for formal objections, etc)

that would be a very good idea.

3. Charters must have an update policy for each registry the WG hosts

i am not quite sure what this means.

4. The Process dictates what can be done

the question is whether defining policies is open, or you define a closed set of policies registries can choose from.

I kind of prefer something along the lines of 2, so that it's enforceable while still letting us account for the diversity of needs for different registries, but that raises the question of what the rules for changing the "updating policy" are. It feels that this is something that should be hard to do, but not necessarily impossible.

i am not sure it would be good to allow updates of the update policy. that would essentially undermine the idea that people can depend on how a registry evolves. they might depend on entry semantics being stable, and then the registry changes its policy that there can be breaking changes to updates. that would break implementations depending on the initially defined policy.

Answer 48 · 2019-05-23T13:16:18.000Z

i don't quite follow. what's the "implementation of an entry"?

That would depend on the type of registry. For some things (a list of languages), speaking of implementation doesn't make sense. For others it might: a list of video codec is a list of things that can be implemented. But there probably wouldn't be an expectation that all UAs implement all video formats. The registry could be used by a capability discovering API, and so it would be expected that many entries would not be implemented by many implementation.

Regardless, the point is, for REC, something qualifies when it has 2 implementations (roughly speaking). For registries, that doesn't work.

some can be updated and changed, you just need to not be reckless
about it, kind of like specs
in terms of breaking changes? i would disagree.

Again, that would depend on the type of registry. The example given earlier by @edent of the list of countries recognized by the UK government does change and update existing entries, and that's perfectly reasonable for that registry. The same policy would not be reasonable for a list of codecs

Charters must have an update policy for each registry the WG hosts
i am not quite sure what this means.

It means that if several policies are possible for maintaining a registry, it's not the working group who decides which one they will follow, but the Advisory Committee when they create (or update) the working group.

i am not sure it would be good to allow updates of the update policy.

I am quite sure that it would be bad to allow that to happen lightly. But you can never completely ban it: in the worse case, people can start a completely separate registry with the same information and a different policy. This is a human endavor, and mistakes will be made. So when we realize that we made a mistake in the update policy of a particular registry, it would be good if we had the option to fix it. So I think having some kind of hard-but-possible to change path for updating the update policy would be probably a good thing.

Answer 49 · 2019-05-23T13:54:59.000Z

Not in-place as in changing a dated publication, but in-place as in the "latest version" of the spec always includes the latest copy of the registry data.
...
The dated version of a spec won't change. Only the undated one.

@fantasai I'm struggling to understand this: isn't every "latest copy" an alias to a dated version? In your proposal, does an update to a registry automagically generate a new dated version of the Rec that references it, and then update the "latest" link to point to the updated Rec?

W3C might consider having per-spec RSS feeds as well. If there's a need for some more specialized data service, the WG can provide one however is convenient for the users and maintainers.

+1

require each registry to have an "updating policy" section, which has to be followed (publication is denied if it isn't followed, it's valid ground for formal objections, etc)

@frivoal : +1 to option 2.

@edent thank you for the links, that's a really useful page. Text from that page:

each register has a named owner called a ‘custodian’.

In the case of W3C I think it would be reasonable to assign custodianship to a group as an alternative to an individual.

Answer 50 · 2019-05-24T21:37:10.000Z

I have read the comments above and tried my hardest to incorporate them into a revised Wiki text at https://www.w3.org/wiki/Repositories. It would probably help me more if people had more specific edits (or wholesale replacement text)...

Answer 51 · 2019-05-25T09:24:34.000Z

On 2019-05-24 23:37, David Singer wrote: I have read the comments above and tried my hardest to incorporate them into a revised Wiki text at https://www.w3.org/wiki/Repositories. It would probably help me more if people had more specific edits (or wholesale replacement text)...

thanks for making these changes. i'd like to contribute, but it seems like my account does not allow me to edit the wiki. are you interested in diffs/comments, and/or is there a way to get access to the wiki?

Answer 52 · 2019-05-26T10:08:58.000Z

On 2019-05-24 23:37, David Singer wrote: I have read the comments above and tried my hardest to incorporate them into a revised Wiki text at https://www.w3.org/wiki/Repositories. It would probably help me more if people had more specific edits (or wholesale replacement text)...

generally speaking, i am wondering whether it wouldn't make sense to start a document that outlines how registries at the W3C are working and supported. it would be the equivalent of RFC 8126. it would cover things such as: - how specs have to create registries - how registries are working in terms of value management - how registries are working in terms of update policies - how W3C manages the registry as part of the spec lifecycle it might even have a section (or might refer to https://tools.ietf.org/html/draft-wilde-registries-02) explaining why spec writers might choose to manage aspects of their specs in registries, and what popular design patterns look like. i am not sure whether that document would be a NOTE or on the REC track. but it seems that regardless of the details, if W3C decides that going forward, supporting registries would improve the process, then such a document will be required.

Answer 53 · 2019-05-28T18:43:15.000Z

@frivoal

At which level do we want to enforce that:
trust the consensus of the WG and its chairs to not do anything silly

Under my proposal, any changes to elements in the registry fall under this policy, because I'm equating them with the current procedures for editorial changes under the Process.

require each registry to have an "updating policy" section, which has to be followed (publication is denied if it isn't followed, it's valid ground for formal objections, etc)

Yes, in my proposal this is one of the requirements to declare a Registry.

Charters must have an update policy for each registry the WG hosts

I do not propose a requirement for this. A Charter could if it wanted to, though.

I kind of prefer something along the lines of 2, so that it's enforceable while still letting us account for the diversity of needs for different registries, but that raises the question of what the rules for changing the "updating policy" are. It feels that this is something that should be hard to do, but not necessarily impossible.

Under my proposal this would be a substantive change to the spec, just like any other substantive change.

@nigelmegitt

I'm struggling to understand this: isn't every "latest copy" an alias to a dated version? In your proposal, does an update to a registry automagically generate a new dated version of the Rec that references it, and then update the "latest" link to point to the updated Rec?

Exactly. Just like any other edit to a spec.

Answer 54 · 2019-05-29T08:15:52.000Z

@fantasai

I'm struggling to understand this: isn't every "latest copy" an alias to a dated version? In your proposal, does an update to a registry automagically generate a new dated version of the Rec that references it, and then update the "latest" link to point to the updated Rec?

Exactly. Just like any other edit to a spec.

Ah I see, that seems like a non-goal or possibly even undesired for registries in the uses I've seen for it.

Rather than wanting to automagically update a Rec it seems more common for folk to want to update a different document that's referenced by the Rec. I suspect the underlying reason is usually to avoid process delays and for the group maintaining the registry to feel empowered to make quick changes. It could be that an alternative way to meet those needs would be acceptable even if it involves updating a Rec, but I don't have any evidence for that one way or the other.

However one disbenefit of your proposed approach is that external standards organisations often prefer that normative references point to dated versions of specifications. Anyone doing that would never get the updated registry entries. That might be a good or a bad thing, depending on the registry concerned, I guess.

Answer 55 · 2019-05-29T09:48:19.000Z

On 2019-05-29 10:15, Nigel Megitt wrote: @fantasai <https://github.com/fantasai> Exactly. Just like any other edit to a spec. Ah I see, that seems like a non-goal or possibly even undesired for registries in the uses I've seen for it.

+100. one of the goals of registries is to decouple the value space in the registry from the spec establishing it. in some cases, a registry may be very tightly coupled to the spec, and the registry mechanism may just be a different way of "managing spec updates". in this case, the above approach may be a good fit. in other cases (and there are many examples in the existing registries out in the wild), the registry is not so much an inherent part of the spec that established it, it just happened to be established as part of the spec. in those cases, it seems that treating the registry through some inclusion process would not be a good way of taking advantage of the general idea of registries.

Rather than wanting to automagically update a Rec it seems more common for folk to want to update a different document that's referenced by the Rec. I suspect the underlying reason is usually to avoid process delays and for the group maintaining the registry to feel empowered to make quick changes. It could be that an alternative way to meet those needs would be acceptable even if it involves updating a Rec, but I don't have any evidence for that one way or the other.

there are many examples where the spec remains unchanged when the registry is updated. having this improved spec stability often is one of the main goals behind using registries: they are not so much a publication mechanism to regulate spec updates, but more a specification design pattern to separate updates to a spec from updates to a specific value space used in the spec. as somebody implementing a spec, you can safely ignore registry updates, unless you are curious about what was added to the registry value space. the same is not true for spec updates. this pattern improves loose coupling and is at least in the IETF space one of the main motivations for using a registry.

Answer 56 · 2019-05-29T09:50:12.000Z

In "Background discussions", "Accessibility API Mappings" (AAM) is listed as a general example of a register. Though the AAM are a prime case for being registers, I think it's too specific a term to be included on that list. Suggest using "Mappings" as the general term, with the AAM as the illustrative example in the subsequent paragraph.

In the "Reference requirements" section it says that a register must be referenced by at least one W3C document. What is the process in the, albeit unlikely, event that a register is no longer referenced? The obvious thing would be to make it obsolete per the current Process perhaps?

I prefer @dwsinger's suggestion of having a registers page, as opposed to including registers on /TR. They're different beasts, and combining them on /TR seems likely to confuse.

I also agree with concerns raised by @nigelmegitt and @tzviya, that pulling a register inline into an otherwise stable Recommendation is likely to be problematic, particularly when a Recommendation is used as a legal reference.

Lastly, is there a tipping point at which a table in one specification should transition into an independent register? When that table is referenced by one or more other specifications for example?

Answer 57 · 2019-05-29T10:04:49.000Z

On 2019-05-29 11:50, Léonie Watson wrote: In the "Reference requirements" section it says that a register must be referenced by at least one W3C document. What is the process in the, albeit unlikely, event that a register is no longer referenced? The obvious thing would be to make it obsolete per the current Process perhaps?

it would be problematic to judge the utility of registries solely by the W3C specs referencing them. the beauty of registries is that they are openly accessible and managed value spaces, so once it is out there, you don't know who is using it. maybe you could mark these as "not currently used in a W3C spec, not actively managed anymore", but other than that, it would be good to keep it around. all of this would only apply if the spec establishing a registry would get unpublished, right?

I prefer @dwsinger <https://github.com/dwsinger>'s suggestion of having a registers page, as opposed to including registers on /TR. They're different beasts, and combining them on /TR seems likely to confuse.

+1, specs and registries per design are different things.

Lastly, is there a tipping point at which a table in one specification should transition into an independent register? When that table is referenced by one or more other specifications for example?

i'd say most importantly when that table is likely to evolve in ways that do not affect the stable core of the spec, so that you can keep the spec stable and still have a process for extending the table. that's the kind of "spec design good practice" that would be important to establish across WGs. https://tools.ietf.org/html/draft-wilde-registries-02#section-4 says something about this, but i am sure it could be improved. my suggestion would be to come to an initial design of the registry model for W3C, and to then reach out to WGs (maybe starting with the ones listed in https://tools.ietf.org/html/draft-wilde-registries-02#appendix-A) and do two things: - explain why registries might be a useful design pattern to start to use in W3C specs and what good practices look like. - ask for feedback how they like the initial proposal. TPAC might be a good place to do this, if there is an opportunity to engage with WGs or even have a townhall style event about the general idea and the initial proposal.

Answer 58 · 2019-05-30T00:38:50.000Z

Lastly, is there a tipping point at which a table in one specification should transition into an independent register? When that table is referenced by one or more other specifications for example?

I think the easy answer to this is that the WG should think about the trade-offs:

a table in the spec. can only be updated by following the process for updating a spec.; a register can have a lighter-weight, more rapid-response, mechanism;
updating a spec. means getting consensus of the group that owns the spec.; a register can have lighter-weight admission/approval criteria (if desired)

So, if a table is only rarely updated and then only by WG consensus, it might not benefit from being a registry. If anyone can be allowed to request new entries, and the admission criteria are reasonably easy to meet, then a registry may be preferred.

Answer 59 · 2019-05-30T00:40:36.000Z

my suggestion would be to come to an initial design of the registry model for W3C,

That's exactly what I hope I have in the Wiki, but people seem to be not commenting on or noticing it...

Answer 60 · 2019-05-30T09:39:39.000Z

On 2019-05-30 02:40, David Singer wrote: my suggestion would be to come to an initial design of the registry model for W3C, That's exactly what I hope I have in the Wiki, but people seem to be not commenting on or noticing it...

i would comment on it if i could. i'd love to contribute, but i do not have edit access to that wiki. maybe we can use something that's more open so that anybody can contribute?

Answer 61 · 2019-06-05T22:36:52.000Z

Comments are welcome here, on this issue, or in new Process issues.

Answer 62 · 2019-06-11T12:27:04.000Z

in other cases (and there are many examples in the existing registries out in the wild), the registry is not so much an inherent part of the spec that established it, it just happened to be established as part of the spec. in those cases, it seems that treating the registry through some inclusion process would not be a good way of taking advantage of the general idea of registries.

To be clear, I'm not arguing that registries can't be split out into their own /TR report with their own shortname and nothing but the rules around what the registry is about and the format and updating rules of each entry. Just that we should re-use the same publication and review mechanisms for things on /TR as much as possible (with some modifications to ease the updating of values in the registry), since publishing there

works reasonably well and is established already
addresses all the archiving concerns wrt stability of URIs across time and provision of historical data
also solves issues around finding and referencing (since there are both dated URLs and latest URLs)
does not overly-constrain the presentation of registries, since the spec editors can decide how they are formatted in the HTML and also post data files in however many convenient formats they want together with the publication

The one thing /TR is not good at is doing interesting queries against a large registry of values, but the types of queries one might want to do will vary by the registry so that is better handled by external services (which could be informatively linked from the /TR report) than by creating some new standardized service.

Answer 63 · 2019-06-13T01:13:23.000Z

I think what distinguishes registries from standards development is that the purpose of registries is effective sharing of information/data, not consensus or other types of agreement. That said, there isn't a clear line here; in some cases there might be some level of agreement desired, but still a desire to use a registry process that's not designed for having agreement. In those intermediate cases a registry process might still be appropriate if precise definitions for eligibility can be written (e.g., "the value is defined by a specification at organizations A or B in states X or Y").

I tend to think the key tension with registries is between:

making registration easy enough that people (at least those aware of the registry) don't use unregistered values in the wild, and
ensuring the registry has the information needed by its users (such as knowing how to find the information needed about the values, or the information needed to avoid duplicate registrations).

Costs to updating a registry that don't help with the second point still increase the risk of the first, so it's important to keep unrelated costs of update low. (I think this is also closely related to what I think distinguishes a registry from standards development: the purpose is effective sharing of information/data, and not consensus or other types of agreement.)

I think it's also important in the W3C context that:

a registry can still be updated after the Working Group that created it has been closed
the W3C continues maintaining registries if the maintainer disappears, which probably requires that W3C know what registries it has

Also, for what it's worth, a few recent examples of registries being developed at W3C, from a 2017 email from the TAG to the AB, include:

permissions registry
Budget API (which needs to not collide with the permissions registry)
Card Network Identifiers
Feature Policy

and an older example is the XPointer Registry.

Answer 64 · 2019-06-13T18:35:10.000Z

On 2019-06-12 18:13, L. David Baron wrote: I think what distinguishes registries from standards development is that the purpose of registries is effective sharing of information/data, /not/ consensus or other types of agreement.

that entirely depends on the registry policy. i'd say that in most registries you see in the wild there is at least some lightweight review/approval process. it would be a good idea to have some flexibility for policies for the w3c registries, so that each group can decide according to their needs. but generally speaking i agree that the registry idea is much more narrow than a general specification concept, and specifically focused on managing an evolving value space. this allows anybody using that value space (including the spec that established it) can remain stable and decide if and when they are going to take value space updates into account.

That said, there isn't a clear line here; in some cases there might be some level of agreement desired, but still a desire to use a registry process that's not designed for having agreement. In those intermediate cases a registry process /might/ still be appropriate if precise definitions for eligibility can be written (e.g., "the value is defined by a specification at organizations A or B in states X or Y").

as mentioned in https://tools.ietf.org/html/draft-wilde-registries-02#section-3.3, in most cases, there will be some process in place. that's not required, but seems to be what most registries end up doing.

I tend to think the key tension with registries is between: * making registration easy enough that people (at least those aware of the registry) don't use unregistered values in the wild, and

this is definitely an issue, and not one that can be perfectly solved. that's why WHATWG starting forking IETF registries (which may have been the worst possible way to address the problem).

* ensuring the registry has the information needed by its users (such as knowing how to find the information needed about the values, or the information needed to avoid duplicate registrations).

that's what very lightweight processes often ensure: we're not checking whether your addition makes sense to us, but we're checking that you at least provide documentation so that others can find out what you're meaning.

Costs to updating a registry that don't help with the second point still increase the risk of the first, so it's important to keep unrelated costs of update low. (I think this is also closely related to what I think distinguishes a registry from standards development: the purpose is effective sharing of information/data, and /not/ consensus or other types of agreement.)

that again depends on the registry. for example, if you have very limited value spaces (0-255, for example), you absolutely must manage them responsibly and probably with quite a bit of scrutiny.

I think it's also important in the W3C context that: * a registry can still be updated after the Working Group that created it has been closed

that's possible if the process required can still be executed, right? if the process requires review by WG, that will not be possible. so that might be a good constraint for the registry process: it cannot be bound to anything that by definition has a limited lifetime.

* the W3C continues maintaining registries if the maintainer disappears, which probably requires that W3C know what registries it has

if "maintaining" means it's available/accessible, absolutely yes. but if the process for updates doesn't work anymore, it will become a historical registry that will not change anymore.

…

Answer 65 · 2019-06-13T19:23:55.000Z

I agree with many of your points. However, in response to:

that again depends on the registry. for example, if you have very
limited value spaces (0-255, for example), you absolutely must manage
them responsibly and probably with quite a bit of scrutiny.

I should clarify that what I'm trying to say here is that from the perspective of designing a process for registries, I don't think cases like that should be seen as use cases for a registries process since those cases should probably use a standards process that involves that higher level of scrutiny from the community.

Answer 66 · 2019-06-13T22:14:17.000Z

On 2019-06-13 12:23, L. David Baron wrote: that again depends on the registry. for example, if you have very limited value spaces (0-255, for example), you absolutely must manage them responsibly and probably with quite a bit of scrutiny. I should clarify that what I'm trying to say here is that from the perspective of designing a process for registries, I don't think cases like that should be seen as use cases for a registries process since those cases should probably use a standards process that involves that higher level of scrutiny from the community.

thanks for the clarification. this is definitely something that will need to be taken into account for the final design. there is a continuous spectrum of registration processes between managing a tiny value space with lots of inspection and review on the one end, and managing one where registration is pretty much unconstrained by any process at the other end. most cases i have seen in practice are somewhere in the middle. just looking at the examples in https://tools.ietf.org/html/draft-wilde-registries-02#appendix-A you will probably already find many or all of the current candidates being somewhere along this spectrum and not on either end. i think it's important to think of registries as a pattern of writing and managing standards and their evolution. the goal typically is to avoid updating standards by "outsourcing" those parts that are known to evolve. how this evolution is constrained/managed is a different matter. i think it might be useful to start establishing and supporting this pattern at w3c. it will be most useful if it is applicable in a variety of situations. allowing a spectrum of registry management policies is an important factor to make the pattern more widely applicable.

…

Answer 67 · 2019-06-26T11:50:10.000Z

Based on https://www.w3.org/wiki/Registries#Recommendation, the discussion here, as well as the https://www.w3.org/wiki/Maintainable_Standards#Registries, @fantasai and I (mostly her) have drafted possible Process-text to implement registries, using today's process as the starting point.

The changes largely fall into two categories:

Defining what a registry, and a change to a registry, are. This is agnostic to the existing REC Track vs evergreen vs any other track we may eventually design.
minimal tweaks to the REC track to allow for registry updates without triggering transition calls or other overhead-heavy process

You can preview the document with the changes incorporated here:
https://w3c.github.io/w3process/registries/

Or in diff form here:
https://services.w3.org/htmldiff?doc1=https%3A%2F%2Fw3c.github.io%2Fw3process%2F&doc2=https%3A%2F%2Fw3c.github.io%2Fw3process%2Fregistries

The changes are in:

6.2.5 on classes of changes
6.3 which defines registries
6.5.1 for no-overhead revisions to a CR for registry updates
6.6 for allowing PR without implementation of all entries in a registry
6.8.2.3 for allowing no-overhead revisions to a REC for registry updates

This is provided to help discussion on the basis of concrete text, not as a take-it-or-leave it offer.

Answer 68 · 2019-06-26T13:38:47.000Z

On 2019-06-26 07:50, Florian Rivoal wrote: * Defining what a registry, and a change to a registry, are. This is agnostic to the existing REC Track vs evergreen vs any other track we may eventually design.

the text suggests that a registry only exists as an embedded table, and not as a type resource. to me, that's not really what a registry is, it mostly sounds like a table of values as an integral part of the spec, and some rules about how that part of the spec can be updated (as opposed to the rest of the spec, i assume). given that definition, i am assuming that a document just trying to establish a registry would go through the usual lifecycle? if so, what does that lifecycle mean, i.e. what is a WD and what is a CR and so forth when we're looking at a "registry document"? a more traditional definition of a registry would define it as a resource type itself which is created and managed according to given rules, but that exists as a resource for itself. we have had some discussions about these differences in various threads, and it seems that the text in the current draft takes a position on one end of the design spectrum (with certain constraints and side-effects).

* minimal tweaks to the REC track to allow for registry updates without triggering transition calls or other overhead-heavy process

apart from the concerns raised already, given these updates, i am wondering how/if registries can be tracked, and how people interested in registry status and evolution can track a registry itself. it seems that the current proposal is to treat "registry only" updates of a document as a general but more lightweight update. if a registry were to be treated as a standalone resource and referenced from or transcluded into a spec, implementers trying to track the registry would have an easier time following and understanding registry evolution (if they are so inclined). this would follow the general design idea behind registries which often is the attempt to better separate evolution of a spec itself, and evolution of the value spaces created/managed/referenced by that spec.

…

Answer 69 · 2019-06-26T14:03:38.000Z

@dret remarks

the text suggests that a registry only exists as an embedded table, and not as a type resource.

I am sympathetic to this point and believe the sentence

A technical report may contain one or more such registries , either alone or in addition to other normative content.

is meant to say that Recommendations may contain Registries and also that Registries may be in Technical Reports that are not themselves Recommendations.

The intent (as I understand it) of the Florian/Elika proposal is to permit Recommendations to be easily updated when the update(s) are confined to the section identified as a Registry.

Answer 70 · 2019-06-26T14:17:40.000Z

Similar point to @dret's #168 (comment)

Copied across from https://lists.w3.org/Archives/Public/public-w3process/2019Jun/0027.html to bring the conversations back here:

Thanks for this, there’s one feature of this proposal which I expect to cause friction:

This change envisages that registry content is included in RECs, and enforces that updates to registries are made by updating their containing REC. That in turn means that any dated reference to the REC will become outdated by a registry change when no other change has been made.

I’ve noted previously that this pattern does not fit with common usage of registries, where the registry content is referenced by the REC, which therefore does not need to change at all to accommodate changes in value.

The obvious get-out to address this would be to make a REC that only contains a registry, and reference that normatively from another REC. Introducing that as a pattern could work; we should be aware that it imposes a much higher bar for publication of registry content than has been set until now, where registries can take the form of a WG Note, a wiki page etc. I would expect some degree of push-back against that imposition on that basis.

Answer 71 · 2019-06-26T14:33:07.000Z

I do not interpret this as forcing Registries to only be in Recommendations. If this remains a point of confusion then we should be explicit that Registries may be maintained in other ways, including outside of /TR. The general qualities of registries enumerated in Registry requirements should apply wherever the registry is located.

Answer 72 · 2019-06-26T14:36:56.000Z

@swickr if the Registries are permitted not to be in Recommendations then that does indeed need to be clarified.

Answer 73 · 2019-06-26T17:46:43.000Z

On 2019-06-26 10:03, Ralph Swick wrote: is meant to say that Recommendations may contain Registries and also that Registries may be in Technical Reports that are not themselves Recommendations. The intent (as I understand it) of the Florian/Elika proposal is to permit Recommendations to be easily updated when the update(s) are confined to the section identified as a Registry.

sure, that's one possible way to go. the question is whether that is the best way to approach robust support of registries. IETF has 2000+ registries of various shapes and sizes, and i believe that they benefit a lot from not treating each registry as an RFC. i think it would be a good discussion to talk about the goals and benefits this group is after when talking about registries, and then start considering options that best benefits those goals. i do understand that for some cases, the "embedded and always a TR" model of a registry looks good. i think in other cases this might not be quite as true. but we're probably all bringing assumptions to the table about why registries are created, what they contain, how they are defined, and how they are managed.

Answer 74 · 2019-06-26T17:47:12.000Z

I attempted to document the comments made in today's call in this wiki page change which allows for registries to be data sets captured in various forms.

Answer 75 · 2019-06-26T17:56:29.000Z

On 2019-06-26 10:33, Ralph Swick wrote: I do not interpret this as forcing Registries to /only/ be in Recommendations. If this remains a point of confusion then we should be explicit that Registries /may/ be maintained in other ways, including outside of /TR. The general qualities of registries enumerated in Registry requirements <https://www.w3.org/wiki/Registries#Registry_requirements> should apply wherever the registry is located.

in that case it would be important to say waht that other possible form is, how it works, and where it is. also, it might be better to have registries only working one way and not in multiple ones, so that all registries can be found and used and updated and subscribed to in the same way.

Answer 76 · 2019-06-26T21:29:30.000Z

@nigelmegitt

I attempted to document the comments made in today's call in this wiki page change which allows for registries to be data sets captured in various forms.

Nice. I clarified to say that they may be represented as an HTML document, CSV, etc., not that they are such a thing: the registry is the data, not the representation.

@dret

i do understand that for some cases, the "embedded and always a TR" model of a registry looks good. i think in other cases this might not be quite as true

I would be interested in examples for which having a canonical representation in an HTML document (in addition to any other convenient representations) is fundamentally incompatible with the use case.

also, it might be better to have registries only working one way and not in multiple ones, so that all registries can be found and used and updated and subscribed to in the same way.

Agreed. Which is why I proposed publishing them on /TR :)

Answer 77 · 2019-06-26T22:41:18.000Z

On 2019-06-26 17:29, fantasai wrote: @dret <https://github.com/dret> i do understand that for some cases, the "embedded and always a TR" model of a registry looks good. i think in other cases this might not be quite as true I would be interested in examples for which having a canonical representation in an HTML document (in addition to any other convenient representations) is fundamentally incompatible with the use case.

is that a trick question? of course one can choose any structured representation. i think the important questions are not around "can i represent registry contents as HTML". it's about whether registries are embedded, are TRs, and how updates and management and notifications are managed. some more considerations are listed in https://tools.ietf.org/html/draft-wilde-registries-02.

also, it might be better to have registries only working one way and not in multiple ones, so that all registries can be found and used and updated and subscribed to in the same way. Agreed. Which is why I proposed publishing them on /TR :)

that response is quoted out of context.

Answer 78 · 2019-06-27T00:43:58.000Z

I have absolutely no idea how I would represent something like mp4ra.org (github: https://github.com/mp4ra/mp4ra.github.io) as a 'document' which is 'published' on /TR. It boggles the mind. There are multiple pages, built from a database of CSV files by a github build script.

Answer 79 · 2019-06-27T02:15:33.000Z

What if the /TR entry was a document that linked to a data table inside a spec, SQL query, or whatever? Sort of like the Readme file at the top level of a GitHub repo?

Not advocating, just brainstorming.

Answer 80 · 2019-06-27T02:22:10.000Z

Matches one of the things I think should be possible. As long as it keeps history, is backed up, etc. see the wiki ;-)

…

Sent from my iPhone

On Jun 26, 2019, at 7:15 PM, Michael Champion ***@***.***> wrote: What if the /TR entry was a document that linked to a data table inside a spec, SQL query, or whatever? Sort of like the Readme file at the top level of a GitHub repo? Not advocating, just brainstorming. — You are receiving this because you were mentioned. Reply to this email directly, view it on GitHub, or mute the thread.

Answer 81 · 2019-06-27T02:39:46.000Z

On 2019-06-26 22:15, Michael Champion wrote: What if the /TR entry was a document that linked to a data table inside a spec, SQL query, or whatever? Sort of like the Readme file at the top level of a GitHub repo?

if the TR entry is just an empty shell with a link, that may be an indication that publishing it as TR is a bit like maslow's hammer? to me, one of the advantages of a process would be that registries would be uniform in a variety of ways, such as representations, where to find them, how they define/announce their management, and how developers can find histories and get update notifications. this may be hard to do in a coherent way when the link to the actual registry contents can point to all kinds of things. the more there is a unified way to access registries (ideally an API, and maybe that's just an atom feed of updates doing it all event log style), the easier it will be to build machinery around this. while i wouldn't necessarily expect (and certainly wouldn't hope to see) lots of runtime access, it would be super useful to enable design time access: imagine you run the build pipeline of a project, and one of the things it does is telling you that for some of the data types that you're using, new values have become available in the registry. that would be relatively easy to build with a unified model, and laborsome to the point where nobody might bother when each registry has its own bespoke model of how it operates and communicates with the outside world.

Answer 82 · 2019-06-27T20:05:36.000Z

@dwsinger I may be missing something, but that site says it's managing the registration of "code points", yes? And there's a list of all the things that have been registered at http://mp4ra.org/#/atoms yes? So you would copy out those tables into a document that includes (roughly speaking) the contents of http://mp4ra.org/#/request , mark it as a W3C Registry, and publish that document on /TR. If you prefer to split it out into multiple pages, you can also do that: we can have multi-page documents on /TR. You can also include, in the publication folder, copies of the CSV files (which you link to from the document so people can find them). And presumably you'd automate the whole process so that as soon as a commit goes through in the GH repo, everything gets rebuilt and posted to /TR via Echidna, same way certain WGs automate republishing of their specs on /TR.

@dret

is that a trick question?

No, it's a serious one. You keep arguing that an HTML document is insufficient to represent a registry.

to me, one of the advantages of a process would be that registries would be uniform in a variety of ways, such as representations, where to find them, how they define/announce their management, and how developers can find histories and get update notifications. this may be hard to do in a coherent way when the link to the actual registry contents can point to all kinds of things.

Publishing in /TR answers all of these questions. They are found on /TR. The data is located in that publication, not on an external server, so that it is archived and as reliably available as w3.org itself. It defines and announces its management in that same document. Developers can find histories the same way they find the history of any other publication: through the dated version links. Update notifications via Atom/RSS feeds for specs, if they are not already available, should be simple to set up. These answers would be consistent for all W3C Registries.

Additional machinery around accessing and querying individual registry tables can be set up, but that's a question of improving tooling for convenience, and needn't be a prerequisite for establishing or using a registry: we already have all the basics covered on /TR.

Answer 83 · 2019-06-28T01:15:01.000Z

On 2019-06-27 16:05, fantasai wrote: @dwsinger <https://github.com/dwsinger> I may be missing something, but that site says it's managing the registration of "code points", yes? And there's a list of all the things that have been registered at http://mp4ra.org/#/atoms yes? So you would copy out those tables into a document that includes (roughly speaking) the contents of http://mp4ra.org/#/request , mark it as a W3C Registry, and publish that document on /TR. If you prefer to split it out into multiple pages, you can also do that: we can have multi-page documents on /TR. You can also include, in the publication folder, copies of the CSV files (which you link to from the document so people can find them). And presumably you'd automate the whole process so that as soon as a commit goes through in the GH repo, everything gets rebuilt and posted to /TR via Echidna, same way certain WGs automate republishing of their specs on /TR.

i don't think anybody doubting that it could be done somehow. the more interesting discussion whether doing it that way is a good solution to the problem.

@dret <https://github.com/dret> is that a trick question? No, it's a serious one. You keep arguing that an HTML document is insufficient to represent a registry.

i don't think i ever said that, and it's not what i am thinking. the more interesting questions are around good/useful representations, and even more questions of resource management.

to me, one of the advantages of a process would be that registries would be uniform in a variety of ways, such as representations, where to find them, how they define/announce their management, and how developers can find histories and get update notifications. this may be hard to do in a coherent way when the link to the actual registry contents can point to all kinds of things. Publishing in /TR answers all of these questions.

it's posibble one answer.

They are found on /TR.

that's different from having a registry page where you simply find all registries.

The data is located in that publication, not on an external server, so that it is archived and as reliably available as w3.org itself.

i am confident whatever backup service is used to backup w3.org would be able to back up registry data in any shape or form as well, regardless of the path on the server.

It defines and announces its management in that same document.

sure. any publication of a registry can do that, i guess.

Developers can find histories the same way they find the history of any other publication: through the dated version links.

to find out about changes in the differences you have to do a diff? to automate you have to monitor that TR, then retrieve the previous version, than do a diff, and then parse the diff's HTML into the actual change?

Update notifications via Atom/RSS feeds for specs, if they are not already available, should be simple to set up. These answers would be consistent for all W3C Registries.

if a TR contains three embedded registries, each update of any registry triggers a TR publication, right? would the proposed update tell me about the actual change, and not just about the fact that "something has changed somewhere"?

Additional machinery around accessing and querying individual registry tables can be set up, but that's a question of improving tooling for convenience, and needn't be a prerequisite for establishing or using a registry:

that wasn't what i was saying. i said others should be able to do it, and it would be good to think about how to make it as easy as possible.

we already have all the basics covered on /TR.

i don't quite grasp you utter confidence that TR is all that is ever needed. it's fine to discuss what might be useful and what may be optional when it comes to a flexible and future-proof way of managing registries for w3c specs, but there seems to be little room for discussion here.

Answer 84 · 2019-06-28T03:39:00.000Z

whether doing it that way is a good solution to the problem.

There are two kinds of good solutions. Good solutions in the abstract, were we to design things from first principles without regards for how difficult they are to roll out, and good solutions in practice, considering what we are likely to achieve in a reasonable amount of time. Just because we can easily do something doesn't necessarily make it good, but if something is good and doable in practice, that's a strong candidate.

Publishing in /TR answers all of these questions.

It's posibble one answer.

The claim isn't that TR with some tweaks is the only reasonable way we could do this, but that it is a way, and that it is a way we can easily roll out, given that we already have most of it, both on the rules side and on the tooling side.

They are found on /TR.
that's different from having a registry page where you simply find all
registries.

https://www.w3.org/TR/ lists everything on TR. https://www.w3.org/TR/?tag=css lists the subset of that that has anything to do with CSS. We could easily set up https://www.w3.org/TR/?tag=registry that would give you all TR entries that are or contain registries.

to find out about changes in the differences you have to do a diff?

You can:

look at the changes sections of the document
use https://services.w3.org/htmldiff
do a source diff
go find out from the document's headers where the source is maintained and look at the version history there

would the proposed update tell me about the actual change

If it's not already set up (maybe it is, but I cannot find it), it should be easy to set up an RSS feed for each technical report (spec, document, call them what you want) on TR, where each entry contains a copy of the abstract of the document and the changes section.

i don't quite grasp you utter confidence that TR is all that is ever needed.

I don't think the point is that TR is the only way we could ever solve that problem, but:

Used right, it does seem to solve all the use cases we've said we wanted to solve
If we want to enable this process soon (in the next 6 months rather than in the next 5 years), choosing a way that needs minor additions over what we already have seems better than a way that needs us to write everything from scratch.

Answer 85 · 2019-06-28T04:25:29.000Z

@dwsinger

Matches one of the things I think should be possible. As long as it keeps history, is backed up, etc. see the wiki ;-)

I'm getting the sense that you would like the process to be the abstract requirements, so that any process/tooling combination that fulfills them all would be valid to use. Is that right?

I think this is misguided, as it doesn't actually solve the problem, and just passes it down to people who want to maintain a registry. Each person who wants to maintain a registry then has to come up with an actual process/tooling for doing so, check with the Team whether their particular instantiation fulfills all the requirements. (Do we need to define the process for checking that the process/tooling is acceptable by the meta process?)

We need to put in the process a particular instantiation of the principles in the wiki, not the abstract principles themselves. Not "you can use anything you want, as longs as it maintains history, and has properties foo and bar", but "use this; it maintains history, and has properties foo and bar". Otherwise we're not writing a process for registries, but a dictionary definition of registries.

(or perhaps that's what you meant too, but I was becoming unsure).

As for mp4ra, I don't see what the complexity is either. For sure it is large, but I don't see what aspect of it is in conflict with the proposed process.

As far as I can tell, this is just a bunch of tables.
The http://mp4ra.org/ website presents these tables in a number of sub-sections (http://mp4ra.org/#/atoms, http://mp4ra.org/#/brands), but specs can have subsections has well, served from different URLs if we want to (https://www.w3.org/TR/CSS22/selector.html vs https://www.w3.org/TR/CSS22/colors.html)
The source is maintained in csv files, with build scripts that generate the registry site from them, but so what? @fantasai's proposal does not dictate in any way what tooling you use to build your registry. We can teach bikeshed to read csv files if we want to.
A number of table entries cross-reference eachother with hyperlinks. So what? We can do that in HTML, and tools like bikeshed make that pretty convenient.
It has a search function at http://mp4ra.org/#/search. This one's a little more fuzzy, but:
- The Process doesn't forbid us from including a similar JS-powered search function into a spec. PubRules might, but that's a question for PubRules, not for the Process.
- Arguably, the search function isn't part of the registry itself, it's just a service provided that uses the registry as its input. If, for example https://www.w3.org/TR/uievents-code/ was a registry (and even if it isn't), nobody would prevent us from writing https://www.uievents-code.org, and from having a /search page in there if we find it convenient.

Answer 86 · 2019-06-28T09:49:04.000Z

to find out about changes in the differences you have to do a diff?

In case folk aren't aware, there's a whole discipline (sometimes called "Master Data Management") that deals with managing reference data sets and their evolution over time, as well as managing the combination of different sources of the same data when it is unclear which is authoritative.

One approach that is often taken is never to delete a data point, instead marking the validity of each point with some time range.

This really helps:

to answer queries like "what would the answer have been if I'd have asked on 1st June 2019?" and
to highlight re-use of data points that had a different meaning historically, so that a reasoned decision can be made about whether that re-use is a good idea or not.
additions can be published with a "becomes valid" date in the future to allow for planned changes to be synchronised.

This is important when we think about the proposal for publishing the registry data sets as HTML documents. We could avoid the need for using a diff tool or looking at a change set by requiring this validity data to be included on each data point, and then as a standard template, including a filtering option so that any arbitrary version of the data set can be presented without having to go through additional tools.

(I am not claiming to be an expert on master data management, my knowledge is an artefact of a previous job!)

Answer 87 · 2019-06-28T10:04:19.000Z

@nigelmegitt The kind of tooling you describe seems useful, but I don't think they need to be built into the registry. They operate on the content of the registry, so as long as the content of the registry has an agreed upon automatically processable format, and that revisions in the registry are dated, that kind of tool can be built.

Registries have been described as an urgent need. I think we should be careful not to overengineer what we're doing. The core idea of a registry is quite simple. Quoting the wiki:

A registry is a data set that documents logically independent 'atoms'; conceptually a table with independent rows, and rules for the values in the columns

Now, on top of that, lots of things can be built. And maybe some of the things that can be built should be built by w3c to make it easier to work with registries. But in the end, the registry is still a (set of) table(s) with rules on what goes in there and how to update them, and version history. We need to get that part right, and the rest can be built on top.

Answer 88 · 2019-06-28T12:52:25.000Z

On 2019-06-27 23:39, Florian Rivoal wrote: There are two kinds of good solutions. Good solutions in the abstract, were we to design things from first principles without regards for how difficult they are to roll out, and good solutions in practice, considering what we are likely to achieve in a reasonable amount of time. Just because we can easily do something doesn't necessarily make it good, but if something is good and doable in practice, that's a strong candidate.

sure, we're in agreement on this one. i still cannot shake the feeling that some of the questions of rather common scenarios (multiple registries defined by a spec, how to manage and find updates to which registry with which values) are repeatedly glossed over.

The claim isn't that TR with some tweaks is the only reasonable way we could do this, but that it is /a/ way, and that it is a way we can easily roll out, given that we already have most of it, both on the rules side and on the tooling side.

yes, it is /a/ way. for the rules, i think the purpose of this is to figure out the rules for registries. there are things we can build on, there's https://tools.ietf.org/html/draft-wilde-registries-02 which i am willing to extend as needed, which would allow us to focus on why and what w3c wants to do with registries, instead of starting with the how. i am not sure that shoehorning all that's needed to support registries in a robust way into the TR process necessarily would be an easier or more future-proof than saying that specs and registries are two different things, and are handled in two different ways. but then again i don't know the internals of TR tooling.

https://www.w3.org/TR/ lists everything on TR. https://www.w3.org/TR/?tag=css lists the subset of that that has anything to do with CSS. We could easily set up https://www.w3.org/TR/?tag=registry that would give you all TR entries that are or contain registries.

i'd also have https://www.w3.org/TR/?exclude-tag=registry to make sure i don't see registry updates? as somebody who has followed TR/ for 20 years, i am concerned about the utility of this page. it already has suffered a bit under the pressure of daily updates of specs where it has become hard for people to separate meaningful updates from "another commit just happened" kind of updates. putting even more into the TR/ firehose may not help to keep it useful for those who want to see meaningful changes, instead of seeing every single thing that happens somewhere in a repo. so that's maybe just me venting about the loss of utility of TR. but that's definitely also a concern i am having. again, i totally agree that it is /possible/ to treat registries as TRs. i am still unclear about the unbridled enthusiasm to pick that one solution before we talk more about the why and what of registries.

to find out about changes in the differences you have to do a diff? You can: * look at the changes sections of the document

so that would need to be a new requirement for the "section as registry" approach then, right? always needs to clearly document the change into the changes section, and in a way that tooling can easily extract that.

* use https://services.w3.org/htmldiff * do a source diff * go find out from the document's headers where the source is maintained and look at the version history there

again, i agree that it is /possible/ to find the change. is it easy, though?

would the proposed update tell me about the actual change If it's not already set up (maybe it is, but I cannot find it), it should be easy to set up an RSS feed for each /technical report/ (spec, document, call them what you want) on TR, where each entry contains a copy of the abstract of the document and the changes section.

unless you have a strict schema for how changes are communicated in the changes section, this would still be hard for tooling to build on.

i don't quite grasp you utter confidence that TR is all that is ever needed. I don't think the point is that TR is the only way we could ever solve that problem, but: * Used right, it does seem to solve all the use cases we've said we wanted to solve

if the definition of "solving" is "it is not impossible to do it this was", then yes. but that's not the discussion we should be having.

* If we want to enable this process soon (in the next 6 months rather than in the next 5 years), choosing a way that needs minor additions over what we already have seems better than a way that needs us to write everything from scratch.

totally agreed. but i honestly don't think that anybody tries to be super heavyweight here. it's just odd to start talking about solution first instead of first discussing the problem in a bit more depth. when you look at the IANA solution, in terms of tooling, it's actually super-simple. most work there went into thinking about why registries are important, what specs should be able to do, and then setting up the simplest possible solution for that. i think a similar path to arrive at a solution might be a good idea for w3c.

Answer 89 · 2019-06-28T13:05:40.000Z

Now, on top of that, lots of things can be built.

@frivoal some things are very hard to add later. Especially for registries, if we think that managing the lifecycle of entities in a registry is important, then one cost of adding that later is that there will probably be a loss of data quality.

If the representation of the data set happens to be a document managed under an archived space, such as /TR, then it may be possible to compute this data later, with some hope of accuracy. If there is only API access then it would be extremely difficult.

Answer 90 · 2019-06-28T19:10:33.000Z

There have been a variety of use cases for "registries" presented. Not all of these require the strict data history management and specialized atomic APIs that others do. If we insist on a system that has those requirements, then it becomes harder to use for the cases that don't need it. One of the benefits of defining it through /TR is flexibility.

I'd like to point out that Unicode has, in effect, a lot of registries about its code points detailing various properties of characters and the like. They are officially published as text files, because that is a stable and easily parseable format. Other services such as https://unicode.org/cldr/utility/ and http://www.fileformat.info/info/unicode/ wrap tooling around that, providing useful ways to look at the data. Implementations import the data files regularly. But these interfaces to the data are not the canonical publication of that data.

One of the primary purposes of publishing an official registry through W3C rather than on a private server somewhere is to have a canonical publication that's basic enough to be readable and archival and importable and reusable. Hosting data somewhere else than w3.org doesn't satisfy this. Interfaces like the mp4ra.org query system are nice, but query systems are not fundamental. We need to solve the fundamental requirement: to provide the official data in a way that is consistent and continuous. Everything else is just incremental improvement in tooling.

I'm not saying we shouldn't build improved tooling. But if we need to build an entirely new system that has the same stability and consistency guarantees as /TR as a prerequisite for solving this problem, then we're not going to get anywhere soon. And we'll either have to shoehorn anything that doesn't quite fit into that specialized system to match its inputs and outputs, or be unable to handle it as a registry.

Answer 91 · 2019-06-28T19:23:33.000Z

On 2019-06-28 15:10, fantasai wrote: I'd like to point out that Unicode has, in effect, a lot of registries about its code points detailing various properties of characters and the like. They are officially published as text files, because that is a stable and easily parseable format. Other services such as https://unicode.org/cldr/utility/ and http://www.fileformat.info/info/unicode/ wrap tooling around that, providing useful ways to look at the data. Implementations import the data files regularly. But these interfaces to the data are not the canonical publication of that data.

couple of points here: - unicode is one spec (afaict) and not an org putting out a substantial stream of various specs. - the unicode spec itself is not published as a character database and vice versa. - having a unified and simple format for registries indeed is powerful, which is why the spec publishes the character database info someplace else (not embedded in the spec), and in a different and reusable form.

One of the primary purposes of publishing an official registry through W3C rather than on a private server somewhere is to have a canonical publication that's basic enough to be readable and archival and importable and reusable. Hosting data somewhere else than w3.org doesn't satisfy this. Interfaces like the mp4ra.org query system are nice, but query systems are not fundamental. We need to solve the fundamental requirement: to provide the official data in a way that is consistent and continuous. Everything else is just incremental improvement in tooling.

from what i have read so far, there seems to be no disagreement around any of these points. but again, i think we're discussing solutions before we have discussed the problem and what we want a solution to do and to be good at.

Answer 92 · 2019-07-04T14:08:08.000Z

I'm getting the sense that you would like the process to be the abstract requirements, so that any process/tooling combination that fulfills them all would be valid to use. Is that right?

I think this is misguided, as it doesn't actually solve the problem, and just passes it down to people who want to maintain a registry. Each person who wants to maintain a registry then has to come up with an actual process/tooling for doing so, check with the Team whether their particular instantiation fulfills all the requirements. (Do we need to define the process for checking that the process/tooling is acceptable by the meta process?)

We need to put in the process a particular instantiation of the principles in the wiki, not the abstract principles themselves. Not "you can use anything you want, as longs as it maintains history, and has properties foo and bar", but "use this; it maintains history, and has properties foo and bar". Otherwise we're not writing a process for registries, but a dictionary definition of registries.

OK, so, yes, I want to agree on what the rules are, and write them in the (relatively hard to change) process.

Yes, while we're learning, I want to leave as much flexibility as we can so that we learn as much as possible. I do not wish to have over-constraining rules. Part of this is the humility that I might not have realized a valid use case or useful solution.

Yes, I want to agree on the rules that we need before we dive into solutions that satisfy those rules. I thought you were disagreeing about the rules; instead, you want to bless your preferred solution (and implicitly deprecate others').

I am completely supportive of developing one or more concrete sets of infrastructure that satisfy the rules.

I would like to make it possible that existing quasi-registries could become, with small amounts of effort, Registries as defined and prescribed by the W3C process. So, for example, a Wiki could host a registry as long as the defining document and the registry have the right material.

I think we will need some guidance documents and tools, that can be much more flexibly handled than the formally approved process. One such guidance could be "how to manage a registry AS a section in a document on /TR." (Basically, say something like "this section constitutes a Registry [[as defined in the process]] and is updated according to the update process for Registries. Updates of the this section -- the Registry -- can occur without a change of name, version, or publication date, of the document.")

Answer 93 · 2019-07-04T18:29:51.000Z

On 2019-07-04 16:08, David Singer wrote: OK, so, yes, I want to agree on what the rules are, and write them in the (relatively hard to change) process.

i like the idea of first trying to say *what* is being done, before determining *how* it is done. i think so far we went from a very fuzzy idea with no definitions for solution quality to a solution. and then the argument mostly went that the solution technically can do all the things necessary. we have only a fuzzy idea *what* we want to do, and no discussion on what a good solution looks like, so that makes it hard to have meaningful discussions around solution quality.

Yes, while we're learning, I want to leave as much flexibility as we can so that we learn as much as possible. I do not wish to have over-constraining rules. Part of this is the humility that I might not have realized a valid use case or useful solution.

while i agree in principle, we should also acknowledge that there are many examples out there that we can learn from. most of the ones i am aware of have a number of similarities that might be good starting points (and none of the ones i am aware of publish and process registries in the same way they publish their specifications).

Yes, I want to agree on the rules that we need before we dive into solutions that satisfy those rules. I thought you were disagreeing about the rules; instead, you want to bless your preferred solution (and implicitly deprecate others').

good idea: first talk about the *what* before jumping to a *how*.

I am completely supportive of developing one or more concrete sets of infrastructure that satisfy the rules.

i think one should be sufficient.

I would like to make it possible that existing quasi-registries could become, with small amounts of effort, Registries as defined and prescribed by the W3C process. So, for example, a Wiki could host a registry as long as the defining document and the registry have the right material.

i can imagine w3c blessing certain solutions, but in my mind, ideally there should be one that's easy enough to use so that there is no need for allowing fragmentation. i'd like the process to say things not just about registry management itself but also ease of use for registry users (i.e., developers working with specs and not writing them). when they have to use/write different tooling because different registries represent content and publish updates in different ways, we risk diminishing the value of a w3c-wide registry approach. after all, https://tools.ietf.org/html/draft-wilde-registries-02#appendix-A does list existing specs "setting up their own registries", and one benefit of w3c supporting registries should be that these registries are published uniformly (in the same way as w3c specs are published uniformly as well).

I think we will need some guidance documents and tools, that can be much more flexibly handled than the formally approved process. One such guidance could be "how to manage a registry AS a section in a document on /TR." (Basically, say something like "this section constitutes a Registry [[as defined in the process]] and is updated according to the update process for Registries. Updates of the this section -- the Registry -- can occur without a change of name, version, or publication date, of the document.")

i am still not a fan of the "section as a registry" approach, but that's just my preference. as soon as we have some quality attributes defined we can figure out whether that approach qualifies as a good solution or not. ideally, we don't reverse engineer quality attributes from the solutions that we might individually might champion, but instead start from "how can we help w3c spec authors to improve spec quality", and see where this takes us.

Answer 94 · 2019-07-23T07:32:58.000Z

For completeness, I wanted to add this inventory of W3C related registries:

The W3C Credentials Community (https://w3c-ccg.github.io) has been maintaining several registries (at https://github.com/w3c-ccg) because we have been around for a long time over the existence of multiple WGs, we do not terminate or expire, and are quite active (weekly meetings with 20+ people and many different companies).

We are informally are using this work process for them: https://lists.w3.org/Archives/Public/public-credentials/2017Dec/0020.html but the plan is to formalize this, have our community approve it formally, and move our final registries process to here: https://github.com/w3c-ccg/registries-process

We initially inherited some cryptography related registries from the Web Payments, Verifiable Credential WG, and JSON-LD WGs as these groups were not chartered to do crypto:

https://github.com/w3c-ccg/ld-cryptosuite-registry

We have been asked by the existing W3C Verifiable Claims Working Group chairs to maintain these Verifiable Credentials related registries, as the WG will hopefully soon be complete and they need someone to maintain it after that WG winds down.

This is the first registry for evolving Decentralized Identifier specification, which will hopefully soon be an official WG. I anticipate there will be more added, and since these need to be long-lived, the intent is that they will stay in the W3C-CCG.

https://github.com/w3c-ccg/did-method-registry

-- Christopher Allen — co-chair W3C Credentials CG

cc: @jandrieu, @kimdhamilton, @msporny, @burnburn, @stonematt

Answer 95 · 2019-07-23T22:55:51.000Z

I agree that abstract rules without any clue on how to meet them are probably unhelpful as a way to get things going. But I also feel that as we learn how to manage registries, we should set the rules such that they express only what we must have, and leave as much latitude as possible for learning, modes of working, and so on. I am particularly keen that it should be as easy as possible for groups with "proto registries" to make the changes needed to become a Registry. So if they are already inline, minor edits to the document; if they already in a Wiki, minor edits to the document and Wiki, and so on.

So I suggest we write a crisp process-like section that expresses the rules and eschews verbosity and examples; but support it with a guidelines document that we can update, clarify, use to provide examples, and so on, that helps people get going.

I did those edits in the Wiki page.

Answer 96 · 2019-07-24T02:48:01.000Z

@dwsinger Random wiki systems don't have the same longevity and archival support that /TR does. If a group wants to use a wiki as the intake system and automatically copy to /TR, that's fine, but I don't think wikis are sufficient for something that has normative Recommendation-type status at W3C.

Answer 97 · 2019-07-24T03:42:09.000Z

Registries don't have normative specification status. Not everything is a Rec. you know. You seem fixated on something magical about /TR? If Wikis don't meet the requirement of being archived (we know they maintain history), then we either need to fix that or not use them.

Answer 98 · 2019-07-25T08:11:37.000Z

Hi Dave,

On Wed, 24 Jul 2019 05:42:11 +0200, David Singer ***@***.***> wrote: Not everything is a Rec. you know. You seem fixated on something magical about /TR?

That's not a very good phrasing of a technical comment on the public record. Might be worth a further edit to clarify your core point. cheers

…

-- Using Opera's mail client: http://www.opera.com/mail/

Answer 99 · 2019-07-30T23:13:25.000Z

W3C just published a FPWD (https://www.w3.org/TR/2019/WD-timing-entrytypes-registry-20190723/) that ideally should be a registry but currently is published as a WD. i have added that one to the list of W3C specs that should be registries in the draft about "The Use of Registries", and published a new version: draft: https://tools.ietf.org/html/draft-wilde-registries-03 list: https://tools.ietf.org/html/draft-wilde-registries-03#appendix-A

Answer 100 · 2019-07-30T23:16:00.000Z

On 2019-07-23 00:33, Christopher Allen wrote: For completeness, I wanted to add this inventory of W3C registries:

that's an interesting list! should any of these be added to the list maintained at https://tools.ietf.org/html/draft-wilde-registries-03#appendix-A, or are these more detached from the "official" W3C publication track? if you think they should be added, please feel free to raise an issue in the draft repo, or ideally add them yourself and submit a PR. thanks!