Polygott
Overview
Replit.com allows you to quickly get started with any programming language
online. In order to provide this capability, our evaluation server uses
Docker with the
buildx
CLI plugin to ensure
that all these languages are installed with the appropriate environment set up.
We previously used a separate Docker image for each language, but concluded that it was both simpler and more efficient to use a single image which contains all supported languages simultaneously. The code necessary to build this combined image, Polygott, resides in this repository. If you're lost and need some reference, we have a blog where we added elisp.
Because of the fact that building a monolithic image is unwieldy and takes too much time, the build itself is made of a directed graph of intermediate nodes, where each language is a node, and they all get stitched together at the end to build the final combined image.
Build and run
You can build either the entire image, a version that has a bundle of languages, and a version that is limited to a single language. The second one is used for CI, and the latter is recommended when you are adding or modifying support for a particular language, since building the entire image takes an extremely long time. Makefile targets are provided for each of these cases:
% make help
usage:
make image Build Docker image with all languages
make image-ci Build Docker image with all languages needed for CI
make image-LANG Build Docker image with single language LANG
make run Build and run image with all languages
make run-LANG Build and run image with single language LANG
make test Build and test all languages
make test-LANG Build and test single language LANG
make changed-test Build and test only changed/added languages
make help Show this message
As you can see, there is a facility for testing that languages have
been installed and configured correctly. This involves running
commands which are specified in the languages' configuration files,
and checking that the output is as expected. To debug, you can also
use the make run
and make run-LANG
targets to launch shells within
the images.
You may want to bypass Docker's cache temporarily, for example to
debug an intermittent network error while building one of the
languages. To do this, identify the docker build
command which is
run by the Makefile, and run it yourself with the --no-cache
flag.
The CI requires having up-to-date generated artifacts (the files in out/
)
committed to ensure a consistent Docker build environment. These will be
refreshed by running the tests, or by running make -B build/stamps/out
).
Language configuration
Each supported language has a
TOML file in the languages
subdirectory. The meaningful keys are as follows:
Mandatory
entrypoint
: The name of the file which will be executed on Replit.com when you press the Run button. This is used in thedetect-language
script built into the Polygott image: if a file exists with this name, then the project is detected to have this language. (Ties are resolved bypopularity
.) It is also used by therun-project
script in order to identify the main file of the project.extensions
: List of file extensions (use"py"
, not".py"
) which files of this language may have. This is used in thedetect-language
script built into the Polygott image: if a file exists with one of these extensions, then the project is detected to have this language. (Ties are resolved bypopularity
.)name
: The name of the language. The TOML file should then be named<name>.toml
. This is also what you pass to the Makefile'simage-LANG
andtest-LANG
targets.
Optional
aliases
: List of strings indicating alternate names for the language, in addition toname
. This is used to allow therun-language-server
script to accept-l c++
in addition to-l cpp
, among other things.aptKeys
: List of PGP key IDs that must be passed toapt-key
in order for the customaptRepos
configured to be trusted. For example,"09617FD37CC06B54"
.aptRepos
: List of repository strings that must be passed toadd-apt-repository
in order for the custompackages
configured to be available. For example,"deb https://dist.crystal-lang.org/apt crystal main"
.compile
command
: The full command to compile theentrypoint
file, as a list, including the filename. This is run before therun
command.
languageServer
command
: Command to start an LSP language server, as a list. This is used in therun-language-server
script built into the Polygott image.
packages
: List of additional Ubuntu packages to install for this language. (Packages which are required by all or many languages should be placed instead inpackages.txt
.) Check the Ubuntu Bionic package listing to see what your options are.popularity
: Floating-point number indicating how popular the language is. Defaults to2.0
. This is used in thedetect-language
script built into the Polygott image: if a project is detected as more than one language, the winner is chosen by comparing popularity scores.run
: Required, unless you provide notests
or you have asked toskip
all of them.command
: The full command to run theentrypoint
file, as a list, including the filename. It is run after thecompile
command, if one is provided. This is used to run tests.
runtimeSetup
: List of shell commands to be run by thepolygott-lang-setup
script built into the Polygott image.setup
: List of shell commands to be run in phase 2 of the build process, as post-install steps.tests
TESTNAME
code
: String to write to theentrypoint
file before invoking therun
command.output
: String expected to be written to stdout (not stderr) by running the code.skip
: Boolean, optional. If true, then the test is skipped. This allows you to easily "comment out" a test if there is something wrong with the infrastructure.
versionCommand
: A command to output the version of the language, as a list of strings. For example,["kotlin", "-version"]
. This is used in thepolygott-survey
command built into the Polygott image. (IfversionCommand
is omitted, some heuristics are used to guess it.)
Build process
- The language configuration files are validated against
a JSON Schema specification located in
schema.json
. - The
.ejs
files in thegen
directory are transformed into shell scripts using the language configuration via the EJS templating system. This is done by a Node.js script inside the Docker image. - The multi-stage
Dockerfile
is created from the language configuration files. - Languages are installed in several phases. In phase 0,
shared packages in
packages.txt
are installed, as well as APT keys and repositories are configured. In phase 1.0,apt-get update -y
is run, which can be reused in later phases. In phase 1.5, language-specific packages are installed. In phase 2, language-specificsetup
commands are run.
Usage
Aside from all the language executables (python3
, ruby
, rust
,
etc.), there are several additional scripts available within the
Docker image. They are documented below.
polygott-self-test
Run the tests defined in each language's configuration file, as in
make test
or make test-LANG
. Always run all the tests, but if one
of them fails, exit with a non-zero return code.
polygott-survey
Run the versionCommand
specified for every language, and output the
results in tabular format to stdout.
polygott-lang-setup [-l LANG]
Copy the contents of /opt/homes/LANG/
to /home/runner/
, and run
the runtimeSetup
commands for it, if any were provided. LANG
defaults to the output of detect-language
.
detect-language
Try to identify the language used by the project in the current
directory. This first checks if the entrypoint
file exists for any
language, and then checks if a file with any of the registered
extensions
exists for a language. If multiple languages match in
either of those two phases, then the popularity
of the two languages
is used to resolve ties.
Output the language name to stdout
if one is detected, otherwise do
nothing.
run-project [-s] [-b] [-l LANG]
Execute the compile
and run
commands on the entrypoint
file in
the current directory. LANG
defaults to the output of
detect-language
. If -s
is passed, then the entrypoint
file is
written with the contents of stdin. If -b
is passed, then some
special logic is used instead of the compile
and run
commands;
see the source for details.
run-language-server [-l LANG]
Run the languageServer
command configured in the language's
configuration file. LANG
defaults to the output of
detect-language
.
Deployment
When a commit is merged to master
, CircleCI
automatically builds Polygott and pushes the image to Docker
Hub. A Replit
engineer has to then push the new Polygott to production.