Pinned Repositories
tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
TornadoVM
TornadoVM: A practical and efficient heterogeneous programming framework for managed languages
academicpages.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
aparapi
Official AMD Aparapi repository
aparapi-examples
A framework for executing native Java code on the GPU.
collage-non-tvm-fork
Collage non-forked version for POC
llama-shepherd-cli
A CLI to manage install and configure llama inference implemenation in multiple languages
llama2.tornadovm.java
An extension to Llama2.java implementation accelerated with GPUs, using TornadoVM
mikepapadim's Repositories
mikepapadim/llama-shepherd-cli
A CLI to manage install and configure llama inference implemenation in multiple languages
mikepapadim/llama2.tornadovm.java
An extension to Llama2.java implementation accelerated with GPUs, using TornadoVM
mikepapadim/collage-non-tvm-fork
Collage non-forked version for POC
mikepapadim/commitgpt
Automatically generate commit messages using ChatGPT
mikepapadim/devoxx2024
mikepapadim/fzf
:cherry_blossom: A command-line fuzzy finder
mikepapadim/gpt-engineer
Specify what you want it to build, the AI asks for clarification, and then builds it.
mikepapadim/java
Java bindings for TensorFlow
mikepapadim/Jlama
Jlama is a pure Java implementation of a LLM inference engine.
mikepapadim/jvm_allocation_ref
A toy application comparing primitive array allocation on heap with Panama off-heap memory segment allocation
mikepapadim/kernl
Kernl lets you run PyTorch transformer models several times faster on GPU with a single line of code, and is designed to be easily hackable.
mikepapadim/Llama-2-Onnx
mikepapadim/llama2.c
Inference Llama 2 in one file of pure C
mikepapadim/llama2.java
Inference Llama 2 in one file of pure Java
mikepapadim/llama3.java
Practical Llama 3 inference in Java
mikepapadim/llamafile
Distribute and run LLMs with a single file.
mikepapadim/llm-apps-java-spring-ai
Samples showing how to build Java applications powered by Generative AI and LLMs using Spring AI and Spring Boot.
mikepapadim/mikepapadim
Github profile custom
mikepapadim/mlir-tutorial
mikepapadim/models
A collection of pre-trained, state-of-the-art models in the ONNX format
mikepapadim/rjvm
A tiny JVM written in Rust. Learning project
mikepapadim/sd4j
Stable diffusion pipeline in Java using ONNX Runtime
mikepapadim/Simple
A Simple showcase for the Sea-of-Nodes compiler IR
mikepapadim/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
mikepapadim/TinyLlama
The TinyLlama project is an open endeavor to pretrain a 1.1B Llama model on 3 trillion tokens.
mikepapadim/torchscript-to-tvm
mikepapadim/TornadoVM
Tornado: A practical and efficient heterogeneous programming framework for managed languages
mikepapadim/tutorials
Tutorials for creating and using ONNX models
mikepapadim/tvm
Open deep learning compiler stack for cpu, gpu and specialized accelerators
mikepapadim/wasmtime
A fast and secure runtime for WebAssembly