Allow user-defined patterns

Question

Allow user-defined patterns

Closed this issue 8 years ago · 6 comments

For example, to require subject lines consisting of semver level, one or more repository-specific components (separated by commas or comma-space pairs and ending with a colon), and a non-empty sentence-case message, all of which must be separated by spaces:

"commitplease": {
  // Braces like http://tools.ietf.org/html/rfc6570 URI Template expressions
  // Expression syntax like https://tools.ietf.org/html/rfc5234#section-3.6 ABNF repetition
  // …with introduction of optional separator definition in ECMAScript RegExp syntax
  "subjectPattern": "{semver-level} {1* ,[\\x20]? components}: {message}",

  // Variables are usually arrays of acceptable values
  "semver-level": [ "major", "minor", "patch" ],
  "components": [ "Build", "Test", "Core", "Legacy" ],

  // Variables can also be ECMAScript regular expressions in ESTree format
  // https://github.com/estree/estree/blob/master/spec.md#regexpliteral
  "message": {
    "regex": { "pattern": "^[^a-z\\s].*" },

    // …plus an optional human-readable description
    "description": "non-empty and sentence case (initial letter capitalized)"
  }
}

Another possibility is dropping the RegExp/variable expansion options in favor of pure RFC 5234 + RFC 7405 ABNF.

Answer 1 · 2015-11-02T17:38:45.000Z

Note that for the jQuery use case we might not want to prevent contributors from commiting if they don't include the semver level, this may be a barrier to contribute. Unless we have a commit template they just modify.

Answer 2 · 2015-11-02T17:43:17.000Z

I agree, but consider that a separate issue. Ideally, we'll get to a point where things like commit message formatting is purely an issue for maintainers.

Answer 3 · 2015-11-02T17:45:40.000Z

Sounds good, just wanted to have people keep that in mind.

Answer 4 · 2015-11-03T10:25:50.000Z

For the record, I'm interested in improving the validation, but will wait for the parent discussion on contribute to get resolved.

Answer 5 · 2016-06-22T01:03:29.000Z

Initially, I was thinking that specifying a commit-message style with a context-free grammar is a great idea. However, after playing a tiny bit with an implementation of an ABNF parser (its homepage, I looked at JavaScript APG: Version 2.0 and JavaScript APG: Examples), here are some challenges that come up:

I do not know how to efficiently communicate what is wrong with the message to the end user. By default the parser will only give the line numbers of lines with mistakes. You could register callbacks for Rules (as described here) that would get some details. However, since these rules are created by the user (in the package.json, as is suggested at the top), there is no way to prepare the callback functions in advance. So, if the user wants better error messages, then it looks like they have to write callbacks themselves too.
The "semver is obligatory somewhere in the body" logic discussed above is not possible with pure context-free grammar (I am pretty sure) and needs a non-greedy extension to it:

"grammar": {
    "commit": "{header}\n\n{body}\n\n{footer}}",
    "header": ...
    "body": "{semver}/{text}" <-- allows a body without a semver
    "body": "{text}{semver}{text}" <-- is not a context-free rule (semver is surrounded by text) 
}

That parser supports such an extension. However, the user must first understand the limitation, then must configure the parser. That is in my opinion a lot more work than opening up commitplease and fixing it by hand.

Answer 6 · 2016-06-22T10:15:35.000Z

Thanks for investigating this. Since helpful error messages are central to commitplease, I don't think this is a direction we should explore further.

@gibson042 if validating/warning for semver lines is still interesting to you, please create a separate issue for that.