honggyukim's Stars
milvus-io/milvus
Milvus is a high-performance, cloud-native vector database built for scalable vector ANN search
Mozilla-Ocho/llamafile
Distribute and run LLMs with a single file.
microsoft/onnxruntime
ONNX Runtime: cross-platform, high performance ML inferencing and training accelerator
NVIDIA/open-gpu-kernel-modules
NVIDIA Linux open GPU kernel module source
giampaolo/psutil
Cross-platform lib for process and system monitoring in Python
grafana/pyroscope
Continuous Profiling Platform. Debug performance issues down to a single line of code
abetlen/llama-cpp-python
Python bindings for llama.cpp
intel-analytics/ipex-llm
Accelerate local LLM inference and finetuning (LLaMA, Mistral, ChatGLM, Qwen, Mixtral, Gemma, Phi, MiniCPM, Qwen-VL, MiniCPM-V, etc.) on Intel XPU (e.g., local PC with iGPU and NPU, discrete GPU such as Arc, Flex and Max); seamlessly integrate with llama.cpp, Ollama, HuggingFace, LangChain, LlamaIndex, vLLM, GraphRAG, DeepSpeed, Axolotl, etc
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
XuehaiPan/nvitop
An interactive NVIDIA-GPU process viewer and beyond, the one-stop solution for GPU process management.
SchedMD/slurm
Slurm: A Highly Scalable Workload Manager
util-linux/util-linux
GStreamer/gstreamer
GStreamer open-source multimedia framework
linux-rdma/rdma-core
RDMA core userspace libraries and daemons
dstat-real/dstat
Versatile resource statistics tool (the real one, not the Red Hat clone)
likejazz/llama3.np
llama3.np is a pure NumPy implementation for Llama 3 model.
tenstorrent/tt-metal
:metal: TT-NN operator library, and TT-Metalium low level kernel programming model.
TPC-Council/HammerDB
HammerDB Database Load Testing and Benchmarking Tool
gregkh/kernel-development
Presentation on how the Linux kernel is developed
UpstageAI/dataverse
The Universe of Data. All about data, data science, and data engineering
scottchiefbaker/dool
Python3 compatible fork of dstat
likejazz/llama3.cuda
llama3.cuda is a pure C/CUDA implementation for Llama 3 model.
huggingface/optimum-benchmark
🏋️ A unified multi-backend utility for benchmarking Transformers, Timm, PEFT, Diffusers and Sentence-Transformers with full support of Optimum's hardware optimizations & quantization schemes.
UpstageAI/evalverse
The Universe of Evaluation. All about the evaluation for LLMs.
kaistAI/FLASK
[ICLR 2024 Spotlight] FLASK: Fine-grained Language Model Evaluation based on Alignment Skill Sets
csl-ajou/DeepPlan
Fast and Efficient Model Serving Using Multi-GPUs with Direct-Host-Access (ACM EuroSys '23)
hygoni/precise-leak-sanitizer
A dynamic memory leak detector that can pinpoint where memory is lost, using LLVM pass
NVIDIA/grace-kernel
Upstream Kernel with Grace upstream pending patches for partners. Patches include any bug fixes during Grace production while they await upstreaming.
sjp38/idle_page_tracking
schwabe/tglx-history