Ktorch

Intro

This project serves as an experimental playground for learning particular areas of knowledge using leaning by doing approach. The learning knowledge primary focus is on the following areas:

Machine Learning
PyTorch
Kotlin
New Java APIs (Vector API, Virtual Thread, Foreign Function & Memory API)
Effective Java

Steps

Set up development environment

User Java incubator module (in Kotlin gradle project)

Implement Numpy in Java

Numpy Basic Operations

Implement Numpy Broadcasting and Dot Product in pure Java/Kotlin, and use SIMD to accelerate computation.

References

Broadcasting

https://numpy.org/doc/stable/user/basics.broadcasting.html

Dot Product

https://www.reddit.com/r/java/comments/17pkcgx/how_fast_can_we_do_matrix_multiplications_in_pure/
https://www.elastic.co/blog/accelerating-vector-search-simd-instructions
Whether we need to use Virtual Thread to parallelize Matrix Multiplication? Answer is NO. Virtual Thread is more suitable for I/O bound tasks. And Matrix Multiplication is a CPU bound task. ref
https://github.com/lessthanoptimal/VectorPerformance/blob/master/src/main/java/benchmark/MatrixMultiplication.java
https://www.baeldung.com/jvm-tiered-compilation

Performance testing (JMH)

Principle: make matrix multiplication as fast as possible, and other operations more elegant.

Implement Simple NN

Implement backward

https://github.com/tinygrad/tinygrad/blob/91a352a8e2697828a4b1eafa2bdc1a9a3b7deffa/tinygrad/tensor.py

the original plan is to implement backward using closure same as the implementation in micrograd. however, after some investigation. I suspect that the closure may cause performance issues. so we transit to the tinygrad way

Implement MNIST

./gradlew run --args=mnist

https://github.com/tinygrad/tinygrad/blob/91a352a8e2697828a4b1eafa2bdc1a9a3b7deffa/test/mnist.py

Predict: input.dot(l1).relu().dot(l2).logSoftmax()
Loss: NLL loss or CrossEntropyLoss

Tool

Print image in terminal

████████████████████████████
████████████████████████████
█████████████▀▀▀▀███████████
█████████████▀  ▄▄ ▀████████
████████▀  ▄▄▄▄  █  ████████
████████  ██████▀ ▄█████████
████████▄▄ ▀▀█▀ ▄███████████
████████████   █████████████
███████████  █▄ ▀███████████
██████████  ████▄ ██████████
██████████ ██████ ██████████
██████████ ▀██▀▀  ██████████
███████████▄▄▄▄▄████████████
████████████████████████████

Progress bar:
- https://medium.com/javarevisited/how-to-display-progressbar-on-the-standard-console-using-java-18f01d52b30e
- https://github.com/ctongfei/progressbar/blob/main/src/main/java/me/tongfei/progressbar/ProgressBarStyle.java
```
Training 1950/2500  78%: │███████████████████▌     │ 4.316723s loss 0.041252743, accuracy 0.90625
```

onheap/ktorch