/intcodeutils

Intcode toolchain for Advent of Code 2019 intcode computer

Primary LanguagePython

intcodeutils

Intcode toolchain for Advent of Code 2019 intcode computer

The toolchain originates around the intelf format, a format understandable by the Elfs maintaining the intcode computer

Advent of Code

An advent calender with programming puzzles, made by Eric Wastl

Author of intcodeutils is not affiliated with Advent of Code

More information about Advent of Code is available their homepage

Installation

Recommended method of testing is using pip, in an virtualenv

Run in a shell:

$ virtualenv venv
$ . venv/bin/activate
$ pip install git+https://github.com/pengi/intcodeutils.git

Tools available

$ intcode-asm
$ intcode-ld
$ intcode-objcopy

Since intcodeutils is under development, it is not recommended to install outside a virtualenv.

More information about virtualenv is available on their homepage

intcode

A big part of the Advent of Code 2019 puzzles was the intcode computer. The intcode computer enabled the possibility to hide puzzle logic from the user, but at the same time can be executed on the users computer.

The intcode computer is described in detail in the puzzles building the computer:

  • Day 2 - opcode 1,2 and 99
  • Day 5 - opcode 3,4 from part 1, 5,6,7,8 from part 2 and parameter mode 1
  • Day 9 - opcode 9 and parameter mode 2

Note that the intcode computer contains a "base reference" from day 9. In intcodeutils, it is called "stack pointer" or "sp".

The intcode files is handled by intcodeutils as described in the puzzles, using the file extension .intcode

An example

109,72,21101,0,9,0,1105,1,40,1101,9,0,30,1006,30,29,1001,30,-1,30,1001,30,31,25,4,0,1105,1,13,99,0,10,101,100,111,99,116,110,105,32,109,1,104,72,104,101,104,108,104,108,104,111,109,-1,2106,0,0,14,72,101,108,108,111,32,105,110,116,99,111,100,101,10

intelf

To be able to link mulitple files, a file that can be read by the intcode-elfs is requred, containing linking information. For non-intcode computers, the elf files exists.

For intcode, an intelf file is available, containing intcode with relocation information, and symbol names

The format is divided in section. A section is atomic, and is not intended to be split. A section can contain symbols, wich is named offsets within the section. The sections should be unique within the intelf, and consecently int the intcode project.

Intelf format

Example:

.section.a: 1,2,3,4,5,6,7
sym_a.offset: 4

.section.b: 1,2,3,4,5,6,7
sym_b.offset: 4

The file contains two sections, named ".section.a" and ".section.b"

The symbol "sym_a" points to the number "5" in the ".section.a" block, since all addresses starts at 0

Also, symbol-relative values can be added:

.text: 1101,2,3,dest+2
main.offset: 0

.data: 0,0,0,0
var.offset: 0

For relocation within a section, it it possible to add values relative to section start:

.text: 1101,2,3,.+4

Where . is the address of section start, when relocated

Tool: intcode-asm

Assembler for assembly language to intelf

The syntax used for intcode-asm is as follows:

@section .section.a
    add [2],[3],[4]
sym_a:
    jt [6],[7]

@section .section.b
    add [2],[3],[4]
sym_b:
    jt [6],[7]

There are four kinds of lines:

  1. @section starts a section
  2. label: defines a label (which currently also is a symbol...)
  3. op arg1,arg2,arg3 is an instruction, which will be converted to intcode
  4. ; comment is a comment line. The line must start with ; (except for whitespaces)

Available instructions are:

Instruction Description
add a,b,c Add a and b, store at c
mul a,b,c Multiply a and b, store at c
in a Input a value, store at a
out a Output value a
jt a,b Jump to b if a is not zero (is true)
jf a,b Jump to b if a is zero (is false)
lt a,b,c Compare a and b, store 1 in c if a<b, store 0 otherwise
eq a,b,c Compare a and b, store 1 in c if a=b, store 0 otherwise
addsp a Increases stack pointer with a
halt Halt the intcode computer

Values can be written in 4 formats:

Format Example Description
Immediate 32, -12 Contant value
Symbol relative var, var+12 Contant address
Memory lookup, constant address [32] Value at contant memory location
Memory lookup, symbol relative [var+12] Value at address relative to symbol

The var can have some special values:

  • %sp - The value of the current stack pointer
  • %pc - The address of the current instruction start address. Useful for relative jumps. Converted to .+var automatically
  • . - The address of section start, filled in when linking.

Tool: intcode-ld

Linker for intelf files

The linker needs a linker script as input, to have information about how to combine the files. The linker script has the extension .intld

The output of a linker is a set of sections, which is a combination of the sections of the input intelf files, given a script.

A normal linker script would be:

.exec:
    origin 0
    load .loader
    load .text.*
    load .data.*
    sym _stack

Indentation is optional

There are two types of lines:

Format Example Description
Start with ., end with : .exec: Defines an output section
operation + value origin 0 An operation on the current section

All section names starts with a .

There are three operations:

  • origin N sets the origin of the section, which is the start address. Must be set before loading data.
  • load pattern Loads all content from sections matching defined pattern. See section pattern below
  • sym name defines a symbol at given point, so code can refer to parts of section

Section pattern

Section pattern is used to pattern match sections names. It can be one of two types:

  1. Absolute match
  2. Prefix match

Absolute match is an extact section name. for example, .text matches .text but not .text.something

Prefix match matches a section name prefix. The preifx must not split parts between .. Having a prefix match is marked by adding .* to the end. For example .text.* matches .text, .text.something, but not .textsomething or .something

Tool: intcode-objcopy

Export a non-relocatable intelf to intcode

To export an intelf to executable intcode, there are some requirements:

  1. All sections needs to be non-relocatable, which means specified origin
  2. All data needs to be absolute, which means not references to unresolved symbols

To export intelf to intcode, run:

$ intcode-objcopy -o program.intcode program.intelf