/clopts

Type-safe, compile-time header-only command line options library for C++20

Primary LanguageC++MIT LicenseMIT

Info

This is a type-safe, header-only compile-time command line options library that generates a parser and a formatted help message at compile time.

It also ensures that your program doesn't compile if you, for instance, misspell the name of an option when trying to access its value.

Amongst other things, both --option value and --option=value are supported.

Everything in this library is defined in the command_line_options namespace, which is rather long to avoid name collisions, but it’s intended to be used in conjunction with either using namespace or a namespace alias (e.g. namespace cmd = command_line_options;).

Note: This is a library for modern C++ and requires a compiler that supports at least C++20. However, std::format support is not required. Note that if you’re using GCC, this library will likely fail to compile on GCC versions older than 11.2 due to missing C++20 support or GCC segfaulting.

This library works on Linux and Windows, but is mainly tested on Linux.

Building

You only need to #include <clopts.hh> and then you're good to go. There is no build step as this is a header-only library. The test directory contains some tests that you can build if you want to, but they’re not part of the main library.

Usage

Example

The following is a simple, self-contained program that shows how to use this library:

#include <clopts.hh>

using namespace command_line_options;
using options = clopts<
    option<"--repeat", "How many times the output should be repeated (default 1)", int64_t>,
    positional<"file", "The file whose contents should be printed", file<>, /*required=*/true>,
    help<>
>;

int main(int argc, char** argv) {
    auto opts = options::parse(argc, argv);

    auto& file_contents = opts.get<"file">()->contents;
    auto repeat_count = opts.get_or<"--repeat">(1);

    for (std::int64_t i = 0; i < repeat_count; i++)
        std::cout << file_contents;
}

The program can then be called as follows:

# Print the contents of file.txt 15 times in total.
$ ./program file.txt
$ ./program --repeat 4 file.txt
$ ./program file.txt --repeat=10

# Print an auto-generated help message.
$ ./program --help
Usage: ./program <file> [options]
Options:
    --repeat <number>  How many times the output should be repeated (default 0)
    --help             Print this help information

Here’s another, more complicated example if you want to see more of the features of this library:

#include <clopts.hh>
using namespace command_line_options;

static void print_number_and_exit(void* arg, std::string_view) {
    int* i = reinterpret_cast<int*>(arg);
    std::cout << *i;
    std::exit(0);
}

using options = clopts<
    positional<"file", "The name of the file", file<std::vector<std::byte>>, true>,
    positional<"foobar", "[description goes here]", std::string, false>,
    option<"--size", "The size parameter (whatever that means)", int64_t>,
    multiple<option<"--int", "Integers", int64_t, true>>,
    flag<"--test", "Test flag">,
    option<"--prime", "A prime number that is less than 14", values<2, 3, 5, 7, 11, 13>>,
    func<"--func", "Print 42 and exit", print_number_and_exit>,
    help<>
>;

int main(int argc, char** argv) {
    int number = 42;
    auto opts = options::parse(argc, argv, nullptr, &number);

    auto ints = opts.get<"--int">();
    if (ints.empty()) std::cout << "No ints!\n";
    else for (auto i : ints) std::cout << i << "\n";
}

For how errors are handled, see the ‘Error Handling’ section below.

Defining Options

The clopts type is the top-level container for all of your options.

using options = clopts<
    option<"--size", "The size of the file", int64_t>
>;

Make sure you do not try and do this:

clopts<
    option<"--size", "The size of the file", int64_t>
> options;

The vast majority of types defined by this library are not meant to be instantiated.

At runtime, the options are parsed by calling the parse() function of your clopts type, passing it argc and argv. This function returns an object containing the parsed option values.

auto opts = options::parse(argc, argv);

For a detailed description of all the option types, see the Option Types section below.

Accessing Option Values

The get<>() function returns a pointer to an option value, or nullptr if the option is not found, unless the option is a multiple option (see below). If the option is a flag (see below), it instead returns a true if the option was found, and false if it wasn’t.

auto opts = options::parse(argc, argv);
if (auto* size = opts.get<"--size">()) {
    /// Do something.
}

Note that if no option with the name --size exists, you'll get a compile-time error.

Alternatively, the get_or<>(value) function can be used to get either the option value or a default value if the option wasn’t found. Note that this function creates a copy of the option value and may thus incur extra overhead if the option value happens to be a large string. If value is returned, it is first static_cast to the option type.

auto opts = options::parse(argc, argv);

/// Default size is 10.
std::cout << "Size: " << opts.get_or<"--size">(10) << "\n";

Error Handling

This section only concerns errors that occur when parsing the options; errors that would make the options unparseable or ill-formed, such as having two different options with the same name, or trying to call get<>() on an option that doesn't exist, are compile-time errors.

When an error is encountered (such as an unrecognised option name or if an option that is marked as required was not specified), the parser invokes your clopts type's error handler, which, by default, prints the error, the help message, and then exits the program with code 1.

You can change this behaviour by passing an error handler to the parse() function.

static bool error_handler(std::string&& error) {
    throw std::runtime_error(error);
}

auto opts = options::parse(argc, argv, error_handler);

The error handler must return a bool which tells the parser whether it should continue the parsing process: true means continue, false means abort (that is, the parsing process, not the entire program).

If you pass nullptr as the error handler, the default error handler is used.

Option types

This library comes with several builtin option types that are meant to address the most common use cases. You can also define your own custom option types by deriving from option, but more on that later.

Depending on the type, some options take arguments, others don't.

Different types of options can be added by passing them to the clopts type as template parameters.

using options = clopts<
    option<"--size", "The size of the file", int64_t>,
    positional<"foobar", "Description goes here">,
    flag<"--test", "Test flag">,
>;

The options themselves are also templates. What parameters they take depends on the type, but in general every option must take at least two parameters, both of which must be string literals (or, more generally, constant expressions of type char[N]): first, the name of the option, which is how you will be accessing its value; and secondly, a description of what the option is, which is used when generating the help message.

Note that the name can be anything you want. The -- are not required.

Option Type: option

The most basic option type is option. This option type takes up to five template parameters, the last two of which are optional.

option<"--name", "Description">
option<"--name", "Description", std::string>
option<"--name", "Description", std::string, /* required? */ false, /* overridable? */ false>

Parameters

  1. The option name, which must be a string literal (at most 256 bytes).
  2. A description of the option, also a string literal (at most 512 bytes).
  3. The type of the option (see below). The default is std::string.
  4. Whether the option is required, i.e. whether omitting it is an error. The default is false.
  5. Whether the option is overridable, i.e. whether it can be specified more than once, in which case only the last value is retained. This is different from multiple<> (see below). The default is false.

Types and Arguments

The option type always takes an argument. Both --option value and --option=value are recognised by the parser. (Note: strictly speaking, this is not exactly true as option is the base class of most other option types, but users of the library should only use option<> with the data types listed here).

Supported types for the 3rd template parameter are:

  • std::string: Any string.
  • file<>: A path to a file that must exist and must be accessible.
  • int64_t: A valid (signed) 64-bit integer (as per std::strtoll).
  • double: A valid floating point number (as per std::strtod).
  • values<>: See below.
  • ref<>: See below.
Type: file<>

The file<> type indicates that the argument should be treated as a path to a file, the contents of which will be loaded into memory at parse time (note: lazy loading is not supported). When accessed with get<>(), both the path and contents will be returned. If the parser can't load the file (for instance, because it doesn't exist), it will invoke the error handler with an appropriate message, and the option value is left in an indeterminate state. The template arguments are the type to use for the file contents and path, respectively; the default is std::string and std::filesystem::path, but you can also use a std::string or std::vector<char> for either. Other types that have a constructor that takes a begin()/end() pair of char iterators should also work.

Type: values<>

The values<> type is used to indicate a set of valid values. The values must either all be strings or all be integers (doubles are currently not allowed to avoid the usual problems associated with comparing floating-point numbers for equality). For example, possible values for a values<> option are:

values<1, 2, 3>
values<"foo", "bar", "baz">

If the values are strings, get<> will return a std::string; if the values are integers, get<> will return an int64_t.

Type: ref<>

The ref<> type is used to reference other options and will capture the state of that option whenever this option is encountered. Consider

option<"--type", "Thing type", std::int64_t>,
multiple<option<"--thing", "A thing", ref<std::string, "--type">> 

Here ref<std::string, "--type"> means ‘this option takes a string and also captures the state of the --type option whenever it is encountered. A possible invocation would be

./program --thing a \
          --type 1 --thing foo \
          --type 2 --thing bar

Here, the content of "--thing" (which is a span because of the multiple<>) is

std::span<
    std::tuple<
        std::string, 
        std::optional<std::int64_t>
    >
> args = *opts.get<"--thing">();

ASSERT(
    args ==
    {
        {"a",   std::nullopt},
        {"foo", std::optional<std::int64_t>(1)},
        {"bar", std::optional<std::int64_t>(2}}
    }
);

As you can see, the individual values are stored as tuples here. Because ref<> could reference the value of an option before it has been encountered, the referenced values are optionals, and std::nullopt is used if the option hasn’t been seen yet.

More notes about ref<>s:

  • Because it would theoretically be possible to construct cycles, ref<> options currently cannot reference each other (but conversely, referencing options that are passed after the ref<> option is perfectly fine).
  • multiple<> interacts with ref<> as expected; particularly, a ref<> referencing another multiple<> will store a vector of the referenced values.

Option Type: flag

Flags have no argument. For flags, get<>() returns a bool that is true when they're present, and false otherwise, i.e. they default to false. Flags are never required as that wouldn’t make much sense (just ... don’t pass the flag if you don’t want it to be set); they also can’t be overridable, because once they’re set, there is no unsetting them.

flag<"--name", "Description">

Option Type: positional

Positional arguments are arguments that do not have a name on the command line. If the parser encounters an option whose name it does not recognise, it will store it in the first positional argument that does not yet have a value.

Just like options, positional arguments take up to four template parameters and need not be strings. However, unlike options, positional arguments are required by default. Additionally, positional options can’t be overridable (see multiple<> for more info on this).

By way of illustration, consider the following options:

clopts<
    flag<"--foo", "Description">,
    positional<"bar", "Description">,
    positional<"baz", "Description">
>

A program that uses these options can be called as follows:

./program opt1 opt2 --foo
./program opt1 --foo opt2
./program --foo opt1 opt2

In the examples above, *get<"bar">() is always "opt1", and *get<"baz">() is always "opt2".

Option Type: help<>

This is a special option type that takes up to one argument and no template parameters and adds a --help option that simply prints a help message containing the names, types, and descriptions of all options and then exits the program. Note: You have to use help<>, rather than just help.

This message can also be retrieved by calling the static help() function of the clopts type.

You can specify a custom help message handler to override the default behaviour by adding it to the help type as a template parameter. The void* data pointer optionally passed to options::parse() is passed as the third argument to the help message handler (see also the description of the func<> type below). Both the program name and user data argument are optional, but they must always occur in the order indicated below.

static void custom_help(
    std::string_view program_name, 
    std::string_view msg,
    void* user_data
) {
    std::cerr << program_name << msg << "\n";
    std::cerr << "Additional help information goes here.\n";
    std::exit(1);
}

clopts<
    help<custom_help>
>;

Option Type: overridable<>

This is just an alias:

overridable<name, description, type>

is equivalent to

option<name, description, type, /* required */ false, /* overridable */ true>

in every respect.

Meta-Option Type: multiple<>

The multiple option type can’t be used on its own and instead wraps another option and modifies it such that multiple occurrences of that option are allowed:

multiple<option<"--int", "A number", int64_t>>

Calling get<>() on a multiple<option<>> or multiple<positional<>> returns a (possibly empty) std::span of the option result type instead (e.g in the case of the --int option above, it will return a std::span<int64_t>). However, get_or<>() will still the return default value if the option wasn’t found.

Properties

  • If the wrapped option is marked as required, then it is required to be present at least once.
  • multiple<> options cannot be overridable.
  • multiple<> should only ever be used with flags and options.
  • multiple<func> is currently invalid, as func options can already occur multiple times.
  • There can only be at most one multiple<positional<>> option.
  • multiple<multiple<>> is invalid.

**Design Note: multiple<> and overridable options **

multiple<> is similar to overridable, and both interact with positional<> options in an interesting manner. The difference between the former is that multiple<> captures all values that are passed, whereas overridable only holds on to the last one. This is also why combining is not allowed, as it doesn’t make sense (both retaining all and only the last one obviously doesn’t work).

Furthermore, positional<> options can be multiple<>, but not overridable. This is currently more of a design choice than a necessary restriction: An overridable positional option would have to act identically to a multiple<positional<>> option in that it would gobble up any positional arguments that are passed. This means we’d also have to check that there isn’t both a multiple<positional<>> and an overridable positional option, because it’s not clear which one would take precedence. Imo, there also isn’t much of a use case for an overridable positional<> option that you can’t just use a multiple<positional<>> option for, but if anyone has one, feel free to open an issue.

Option Type: stop_parsing<>

This option is used to indicate that the parser should stop processing options when it is encountered. It takes a single optional string argument whose default value is "--":

stop_parsing<> // Equivalent to 'stop_parsing<"--">'.

Properties

  • This option is never required and cannot be overridable.
  • More than one stop_parsing<> option is allowed, if you really have a need for that somehow.
  • multiple<stop_parsing<>> is invalid.
  • Any unprocessed options after the stop parsing option can be retrieved using the unprocessed() function of the type returned by parse().
  • The parser will still error if there are any required options that were not seen before parsing was stopped.

Option Type: func

A func defines a callback that is called by the parser when the option is encountered. You can specify additional data to be passed to the callback in the form of a void*; this pointer is passed as the optional fourth parameter to the parse() function (recall that the third parameter is the error handler).

Furthermore, if the specified function takes a std::string_view, the option now takes a value, and the parser will pass the option value to the callback, and if the function takes two std::string_views, the parser will pass the option name and value to the callback. The void*, if present, must be the first parameter.

The same void* passed to parse() is passed to all func options, as well as to the help option if you specify a custom help handler that takes a void* as its last parameter.

A func option can always occur multiple times, which is why multiple<func<>> is invalid.

The following is an example of how to use the func option type:

static void print_number_and_exit(void* arg) {
    int* i = reinterpret_cast<int*>(arg);
    std::cout << *i;
    std::exit(0);
}

using options = clopts<
    func<"--print42", "Print 42 and exit", print_number_and_exit>
>;

int main(int argc, char** argv) {
    int x = 42;
    options::parse(argc, argv, nullptr, &x);
}

It is a compile-time error to call get<>() on a func option.

Custom Option Types

You can define your own custom option types by inheriting from option. For instance the builtin flag type is defined as

template <
    detail::static_string _name,
    detail::static_string _description = "",
    bool required = false>
struct flag : option<_name, _description, bool, required> {};

Make sure to use static_string if you want to be able to pass a string literal as a non-type template parameter.

However, keep in mind that implementing certain features, like the func type, would also require modifying the main library code and that the library is not designed to allow for the addition of arbitrary new option types. It is recommended to use the builtin option types whenever possible, and there are no plans to add additional support for user-defined option types.

At the same time, if you have an idea for an option type that it would make sense supporting, feel free to open an issue or a pull request. Additional option types may be added in the future.