Machine Programmer

Overview

The goal of this project is to create a program that can program, and explore different AI strategies to do so. When given test cases, it should write a program that can successfully pass all those test cases.

MachineProgrammer is a proof of concept that utilizes genetic algorithms to write working programs in x86 NASM Assembly. The instruction set is simplified for faster evolutions but is just right for writing simple math programs.

Technologies
Client-Side: jQuery, javascript, JSON
Server-Side: Python, x86 Assembly, Flask, JSON, Deap, Pusher, Envoy, NASM, GCC

Example

Let's say we want our machine "programmer" to write a program that multiplies the the first two integers, then add the third integer. Our programmer needs test cases to figure out what kind of output we're looking for. So here we have some simple test cases that cover positive integers, negative integers, and zeros.

#	Inputs	Function	Output
1	10, 10, 10	10 * 10 + 10	110
2	-10, -10, -10	-10 * -10 + -10	90
4	-10, 10, 10	-10 * 10 + 10	-90
3	0, 10, 10	0 * 10 + 10	10

In this prettified interface, we would enter our test cases as thus:

Our programmer will then utilize genetic algorithms to generate programs that it tests against our test cases to see if it meets our requirements. It'll keep going until it finds and outputs a successful assembly program or reaches the max number of generations it can run.

Under the Hood

###Creating a population of programs The server begins by initializing evolution settings such as population and max generations. It then triggers the birth of many little program instructions. Deap, a genetic algorithm library, is used as the base class for creating our program class with random assembly instructions and operands. Our population is then generated and built out of that default blueprint.

###Evolving codes with the genetic algorithms We then loop through our population and randomly mutate or mate them to create a new program for the next generation. Our mutating algorithm takes in a single program and randomly shuffles the order of its instructions around. The mating algorithm takes in two programs and cross-breed them by taking parts of one program and parts of another to create a new "offspring" program. And with every generation of new programs, they are sifted through and evaluated for fitness.

###Compiling and evaluating fitness Our machine programmer then compiles and runs the program to get an output, which it tests against our expected output. If it succeeds, then it can happily stop the evolution process and send the fruit of its success to the client. If it fails then it moves on to the next experiment.

Usage

Will need a Linux OS to work right out of the box. Works for both 32-bit and 64-bit.

The following are installation instructions on a 64-bit Linux. If you're on 32-bit, replace 'gcc-multilib' with 'gcc' if you don't already have it (it's usually in Linux by default).

apt-get install nasm
apt-get install gcc-multilib

git clone https://github.com/evac/MachineProgrammer.git
cd MachineProgrammer
pip install -r requirements.txt
python run.py

You can adjust the evolution settings in the run.py file. The default settings are:

SETTINGS = {
  "population": 100,
  "max_generations": 10,
}

You can also increase the complexity of the output programs by adding to the assembly instruction set in /main/asm.py. Beware that the more complex the problem you're trying to solve or the more numbers of instructions added, the longer it takes to reach a successful solution.

Final Thoughts

As mentioned in the introduction, the goal was to create a program that can program. Using genetic algorithms was just one of several approaches I wanted to experiment with and see how far it could go. While it's certainly been a fascinating and educational approach to implement, there are several weaknesses that will prevent me from pursuing this approach in my next version.

Does not scale well with complexity. The more complex the problem or number of factors it has to deal with, the longer it takes to reach a solution. This approach may work better for a single big objective where it can take its time, but not so much for small, immediate tasks.
Not able to take advantage of the evolutionary benefits. The main advantage of genetic algorithms is the ability for the quality of the solutions to improve with every generation. However, programs are either successful or not, and there's no in-between like having 50% correct code. The closest measure for incremental improvements are the number of test cases it can pass, which is not exactly an ideal fitness evaluation to rely on.

In future versions, a few approaches that I plan to explore are reinforcement learning, simulated annealing and neural networks.