Artificial Intelligence / ML

AEI-2021-I-3.4

Digit recognition (source)

This solution trains a TensorFlow model to correctly identify handwritten digits. This uses a JavaScript solution using Keras and Tensorflow, using the MNIST dataset of handwritten digits.

Input

Datasets are provided by the MNIST database. A sample of this dataset can be seen below.

Model

The network contains convolution functions, each shifting bits in the image based one its brightness. This function applies image kernels to the image in 5x5 blocks (pixels) with 8 filters. Each iteration (at most) 1 block is changed. The convolution function has activation relu, which returns the original value if it is positive, or zero. The kernel itself is varianceScaling, which indicates the expected deviation of a random variable from the mean value of the kernel size.

After the image kernel is applied, max pooling is applied (maxPooling2d). This acts as a type of downsampling, using the maximum value of an area instead of the average. This results in a chunk such as the one below returns 10 (max) instead of 4.25 (3+10+5-1 -> 17, 17/4 -> 4,25)

The convolution and max pooling is then applied again, but with a filter count of 16. After this the output is flattened into a 1D vector (from a 2D layered vector). Turning e.g. [[0,2], [1,3]] into [0,2,1,3]. Finally the output is separated into 10 classes, each representing a digit from 0 to 9. This is again done using variance scaling, with softmax activation. Softmax gets the maximum value by using the exponent of the value divided by the exponent of the sum of all values. This ensures no value is below zero, or above one (range [0,1]).

Finally the model is compiled using a standard training compiler (Adam), with categorical cross-entropy loss and accuracy metrics.

Sources

Parameters

There are a total of 65.000 dataset elements, of which 55.000 can be used as training elements (from MNIST).

Baseline values are configured to run a single epoch with a batch size of 128 using a training dataset size of 500. Default values are configured to run a total of 25 epochs with a batch size of 512 using a training dataset size of 5500. For additional accuracy, a maximum of 200 epochs can be chosen with the default values. More epochs can be used, though after ~200 the accuracy is typically already 1.0 ±.

Results

Baseline

Default

Accuracy

Alternative (source)

This same model has been implemented in Java, using Deep Learning for Java (DL4J). This implementation has a lower weight decay, and takes longer to train. However the accuracy of results is noticeably better, as can be seen in the results below.

Confusion Matrix

    0    1    2    3    4    5    6    7    8    9
---------------------------------------------------
  972    0    1    0    0    1    2    1    3    0 | 0 = 0
    0 1123    2    1    0    0    2    0    7    0 | 1 = 1
    3    2 1005    6    2    1    2    6    5    0 | 2 = 2
    1    1    3  985    0    7    0    5    6    2 | 3 = 3
    1    0    3    0  958    0    5    2    2   11 | 4 = 4
    4    2    1   13    0  862    7    1    2    0 | 5 = 5
    8    3    1    0    6    5  933    0    2    0 | 6 = 6
    1    6   22    3    0    0    0  987    3    6 | 7 = 7
    7    0    2    6    3    2    6    4  938    6 | 8 = 8
    4    8    0    7   12    6    1    5    2  964 | 9 = 9

Accuracy

Label               AUC         # Pos     # Neg
0                   0.9998      980       9020
1                   0.9998      1135      8865
2                   0.9995      1032      8968
3                   0.9989      1010      8990
4                   0.9996      982       9018
5                   0.9994      892       9108
6                   0.9995      958       9042
7                   0.9986      1028      8972
8                   0.9988      974       9026
9                   0.9987      1009      8991

Average AUC: 0.9993

Killer Sudoku Solver (source)

A killer sudoku is a variant of the classical sudoku. Its main difference lays in the fact that in a killer sudoku no squares are filled with numbers beforehand. Instead some areas are indicated with additional numbers which represent the sum of the squared involved. A more in-depth description can be found here.

An example of a killer sudoku is the following:

This can be solved as:

The baseline solution provided in this repository uses backtracking to validate all options until a match is found. If no match is found, the solver first tries other combinations. For the example above, this results in the solution below, which matches the expected solution above.

 ╔═══╤═══╤═══╦═══╤═══╤═══╦═══╤═══╤═══╗ 
 ║ 2 │ 1 │ 5 ║ 6 │ 4 │ 7 ║ 3 │ 9 │ 8 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 3 │ 6 │ 8 ║ 9 │ 5 │ 2 ║ 1 │ 7 │ 4 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 7 │ 9 │ 4 ║ 3 │ 8 │ 1 ║ 6 │ 5 │ 2 ║ 
 ╠═══╪═══╪═══╬═══╪═══╪═══╬═══╪═══╪═══╣ 
 ║ 5 │ 8 │ 6 ║ 2 │ 7 │ 4 ║ 9 │ 3 │ 1 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 1 │ 4 │ 2 ║ 5 │ 9 │ 3 ║ 8 │ 6 │ 7 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 9 │ 7 │ 3 ║ 8 │ 1 │ 6 ║ 4 │ 2 │ 5 ║ 
 ╠═══╪═══╪═══╬═══╪═══╪═══╬═══╪═══╪═══╣ 
 ║ 8 │ 2 │ 1 ║ 7 │ 3 │ 9 ║ 5 │ 4 │ 6 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 6 │ 5 │ 9 ║ 4 │ 2 │ 8 ║ 7 │ 1 │ 3 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 4 │ 3 │ 7 ║ 1 │ 6 │ 5 ║ 2 │ 8 │ 9 ║ 
 ╚═══╧═══╧═══╩═══╧═══╧═══╩═══╧═══╧═══╝

Defining the cells

The solution does not use any type of image recognition, and instead expects a file to contain the values of the input sudoku. Using the example above, a board could look like the following. Each group is indicated using a character, where any character but = is allowed.

a,a,b,b,b,c,d,e,f
g,g,h,h,c,c,d,e,f
g,g,i,i,c,j,k,k,f
l,m,m,i,n,j,k,o,f
l,q,q,r,n,j,o,o,p
s,q,v,r,n,t,u,u,p
s,v,v,r,w,t,t,y,y
s,z,@,w,w,x,x,y,y
s,z,@,w,_,_,_,#,#

Here each character indicates a specific group (cage). However each cage should also have the value indicated, which is done by using simple properties below the board definition, for example:

a=3
b=15
c=22
d=4
e=16
f=15
g=25
h=17
...

Improving results

The baseline solution takes approximately 700ms to solve the above killer sudoku, with a total of 3,042,963 attempts. Each attempt indicating a single cell change. The improved, or optimised, solution expands on the backtracking functionality of the baseline solution by using a constraint satisfaction problem (CSP) method.

Constraint Satisfaction Problem

The first step to doing this is to avoid any attempts from being made if we already know a certain value will not be allowed. To do so each cell is first evaluated before an attempt is made, generating a list of potentially valid values. This is done by resolving any numbers already present in the current row, column, block, and (in case of a killer sudoku) cage. This way if the number 4 is already present in one of these elements, a default constraint is that our value cannot also be 4, allowing us to skip it so it will not be evaluated.

Forward checking

A second action taken is removing any values which would cause the next cell to have no remaining options left. Imagine a scenario where the current cell has the options [1,2] while the next cell only has the option [2]. We can predict that if we change the current cell value to 2, the next cell will fail causing the need to backtrack. When this happens we can eliminate the 2 from the options for the current cell, making it so the value is 1.

Informed search

Before the sudoku is attempted to be solved, each cell is assigned a set of possible values. Early on this will eliminate any values which cannot be present in the cell. For example because of final values in a row, column, or block, or the value being higher than the expected sum of a cage.

Improved results

The above improvements have been performed against three different sudoku's:

A completely empty sudoku without further constraints
A regular sudoku, with the below pattern:

0,0,0,9,2,0,0,0,4
0,7,0,0,0,0,8,5,0
0,0,0,6,0,5,0,0,0
4,0,0,8,0,0,3,0,5
5,0,0,0,0,0,0,0,1
2,0,7,0,0,1,0,0,6
0,0,0,4,0,8,0,0,0
0,3,0,0,0,0,0,4,0
6,0,0,0,1,3,0,0,0

A killer sudoku, with the pattern matching the before mentioned sudoku definition

The results are detailed below, showing the solution of both methods (if they are the same only one is shown) and the results of the baseline and optimised methods. Each result includes the time and attempts for both methods, with the last line indicating the improvement of the amount of attempts. This improvement is indicated as a percentage and is calculated as follows:

-(((optimised - baseline) / baseline) * 100)

Empty sudoku

 ╔═══╤═══╤═══╦═══╤═══╤═══╦═══╤═══╤═══╗ 
 ║ 1 │ 2 │ 3 ║ 4 │ 5 │ 6 ║ 7 │ 8 │ 9 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 4 │ 5 │ 6 ║ 7 │ 8 │ 9 ║ 1 │ 2 │ 3 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 7 │ 8 │ 9 ║ 1 │ 2 │ 3 ║ 4 │ 5 │ 6 ║ 
 ╠═══╪═══╪═══╬═══╪═══╪═══╬═══╪═══╪═══╣ 
 ║ 2 │ 1 │ 4 ║ 3 │ 6 │ 5 ║ 8 │ 9 │ 7 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 3 │ 6 │ 5 ║ 8 │ 9 │ 7 ║ 2 │ 1 │ 4 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 8 │ 9 │ 7 ║ 2 │ 1 │ 4 ║ 3 │ 6 │ 5 ║ 
 ╠═══╪═══╪═══╬═══╪═══╪═══╬═══╪═══╪═══╣ 
 ║ 5 │ 3 │ 1 ║ 6 │ 4 │ 2 ║ 9 │ 7 │ 8 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 6 │ 4 │ 2 ║ 9 │ 7 │ 8 ║ 5 │ 3 │ 1 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 9 │ 7 │ 8 ║ 5 │ 3 │ 1 ║ 6 │ 4 │ 2 ║ 
 ╚═══╧═══╧═══╩═══╧═══╧═══╩═══╧═══╧═══╝ 
Results:
- Not optimised: 0s 12ms and 3,195 attempts
- Optimised: 0s 9ms and 611 attempts
- Improvement: 80.88%

Regular sudoku

 ╔═══╤═══╤═══╦═══╤═══╤═══╦═══╤═══╤═══╗ 
 ║ 1 │ 5 │ 8 ║ 9 │ 2 │ 7 ║ 6 │ 3 │ 4 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 9 │ 7 │ 6 ║ 1 │ 3 │ 4 ║ 8 │ 5 │ 2 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 3 │ 2 │ 4 ║ 6 │ 8 │ 5 ║ 7 │ 1 │ 9 ║ 
 ╠═══╪═══╪═══╬═══╪═══╪═══╬═══╪═══╪═══╣ 
 ║ 4 │ 6 │ 1 ║ 8 │ 7 │ 9 ║ 3 │ 2 │ 5 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 5 │ 8 │ 3 ║ 2 │ 4 │ 6 ║ 9 │ 7 │ 1 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 2 │ 9 │ 7 ║ 3 │ 5 │ 1 ║ 4 │ 8 │ 6 ║ 
 ╠═══╪═══╪═══╬═══╪═══╪═══╬═══╪═══╪═══╣ 
 ║ 7 │ 1 │ 2 ║ 4 │ 9 │ 8 ║ 5 │ 6 │ 3 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 8 │ 3 │ 9 ║ 5 │ 6 │ 2 ║ 1 │ 4 │ 7 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 6 │ 4 │ 5 ║ 7 │ 1 │ 3 ║ 2 │ 9 │ 8 ║ 
 ╚═══╧═══╧═══╩═══╧═══╧═══╩═══╧═══╧═══╝ 
Results:
- Not optimised: 0s 7ms and 6,579 attempts
- Optimised: 0s 5ms and 911 attempts
- Improvement: 86.15%

Killer sudoku

 ╔═══╤═══╤═══╦═══╤═══╤═══╦═══╤═══╤═══╗ 
 ║ 2 │ 1 │ 5 ║ 6 │ 4 │ 7 ║ 3 │ 9 │ 8 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 3 │ 6 │ 8 ║ 9 │ 5 │ 2 ║ 1 │ 7 │ 4 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 7 │ 9 │ 4 ║ 3 │ 8 │ 1 ║ 6 │ 5 │ 2 ║ 
 ╠═══╪═══╪═══╬═══╪═══╪═══╬═══╪═══╪═══╣ 
 ║ 5 │ 8 │ 6 ║ 2 │ 7 │ 4 ║ 9 │ 3 │ 1 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 1 │ 4 │ 2 ║ 5 │ 9 │ 3 ║ 8 │ 6 │ 7 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 9 │ 7 │ 3 ║ 8 │ 1 │ 6 ║ 4 │ 2 │ 5 ║ 
 ╠═══╪═══╪═══╬═══╪═══╪═══╬═══╪═══╪═══╣ 
 ║ 8 │ 2 │ 1 ║ 7 │ 3 │ 9 ║ 5 │ 4 │ 6 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 6 │ 5 │ 9 ║ 4 │ 2 │ 8 ║ 7 │ 1 │ 3 ║ 
 ╟───┼───┼───╫───┼───┼───╫───┼───┼───╢ 
 ║ 4 │ 3 │ 7 ║ 1 │ 6 │ 5 ║ 2 │ 8 │ 9 ║ 
 ╚═══╧═══╧═══╩═══╧═══╧═══╩═══╧═══╧═══╝ 
Results:
- Not optimised: 0s 742ms and 3,042,963 attempts
- Optimised: 0s 701ms and 1,272,935 attempts
- Improvement: 58.17%

GuusLieben/Artificial-Intelligence-AEI