/knapsack

Recursive Depth-First Search (DFS) algorithm solution to the knapsack problem hand-written in C.

Primary LanguageCMIT LicenseMIT

Knapsack

Recursive brute-force depth-first search (DFS) algorithm solution to the knapsack problem hand-written in C.

Usage

Download the binary files here.

Note: The numbers in the set can be negative and everything will still work as expected.

Linux Terminal

./[i686/x86_64]-redhat-linux [target] [set[0] set[1] ...]

Example

./x86_64-redhat-linux 42 1 -13 0 3 0 -1 4 2 8 9 10 24 36
./x86_64-redhat-linux 42 1 -13 0 3 0 -1 4 2 8 9 10 24 36

Subset of the above commands should be {8, 10, 24}.

Windows Command Line

.\[i686/x86_64]-windows.exe [target] [set[0] set[1] ...]

Example

.\x86_64-windows.exe 42 1 -13 0 3 0 -1 4 2 8 9 10 24 36
.\i686-windows.exe 42 1 -13 1 3 0 4 0 8 2 -1 9 10 24 36

Subset of the above commands should also be {8, 10, 24}.

Legend

  • [i686/x86_64] - CPU architecture (32-bit / 64-bit)

  • [target] - Value of subset's sum

  • [set[0] set[1] ...] - Set of numbers seperated by spaces; set[0] is the first number in the set, set[1] is the second, etc.

What

There is a set of numbers that you need to select a subset of to fill a knapsack with, where the sum of the numbers in the subset is equal to the maximum the knapsack can hold.

To solve the knapsack problem, find the optimal way (assuming there is only one) to fill the knapsack, where the subset has the least amount of elements.

Why

For fun; to practice writing, debugging, and compiling programs in a low-level language; and to try solving algorithmic problems using recursion.

How

Below is an explanation of how the program works.

  • The program first checks if the set is sorted, and sorts it (using an iterative implemention of selection sort) if it is not. This is to ensure the subset that is found is one with the least amount of elements.
  • Next, it calls the function that goes through and filters all the possible subsets recursively, choosing either to take or skip each element in the set. This creates 2 'branches' with each function call (therefore making the algorithm have a time complexity of O(2^n)).
  • A more precise description of the previous bullet-point would be that it technically branches all the way down to the subset with 0 elements first, goes back 1 branch to take the right-most (largest) element in the sorted set, and (if that is not the optimal subset) then goes back up another branch to repeat. This is repeated for every branch until it reaches the root, in which it starts the same process all over again but with the first element of the set being in the subset. The fact that it travels down to the leaves first before exploring other branches is why this algorithm would classified as DFS.
  • The sum of the elements in the subset are compared to the target sum (maximum knapsack capacity) until the branches reach a dead end (aptly named the 'leaves' of the algorithmic 'tree'), and returned back up the tree until it reaches the root of the function tree.
  • Due to how arrays having fixed size in C, a zero is used to denote the end of the subset (an optimal subset should have no zeros, as zeros contribute nothing to the sum) if the subset is not full (the same size as the original set) so that random numbers from the leftover allocated array memory are not displayed when we later loop through the array to print out the numbers in the subset.
  • The root function then returns the subset for the main program to print out on the command-line terminal / console using a for-loop in another function.

Limitation

Below is a limitation of this program.

  • This program only handles values of type int (integer), but it can be easily modified to allow more (such as type float (floating-point) and even type char (character)).