Daniel-NJ's Stars
langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
ggerganov/llama.cpp
LLM inference in C/C++
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
ZuodaoTech/everyone-can-use-english
人人都能用英语
facebook/zstd
Zstandard - Fast real-time compression algorithm
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
exaloop/codon
A high-performance, zero-overhead, extensible Python compiler using LLVM
cocktailpeanut/dalai
The simplest way to run LLaMA on your local machine
huggingface/text-generation-inference
Large Language Model Text Generation Inference
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
whitead/paper-qa
LLM Chain for answering questions from documents with citations
dendibakh/perf-ninja
This is an online course where you can learn and master the skill of low-level performance analysis and tuning.
dendibakh/perf-book
The book "Performance Analysis and Tuning on Modern CPU"
zwang4/awesome-machine-learning-in-compilers
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
gpgpu-sim/gpgpu-sim_distribution
GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as well as a performance visualization tool, AerialVisoin, and an integrated energy model, GPUWattch.
lymslive/vimllearn
A book for VimL Script language
travisdowns/uarch-bench
A benchmark for low-level CPU micro-architectural features
jcxue/RDMA-Tutorial
A tutorial on RDMA based programming using code examples
jeffhammond/HPCInfo
Information about many aspects of high-performance computing. Wiki content moved to ~/docs.
It4innovations/hyperqueue
Scheduler for sub-node tasks for HPC systems with batch scheduling
simgrid/simgrid
MIRROR of the SimGrid framework, for the simulation of distributed applications (Clouds, HPC, Grids, IoT and others). Most of the dev occurs on FramaGit.
anlongfei/compilerbook
compilerbook
ROCm/rocHPL
High Performance Linpack for Next-Generation AMD HPC Accelerators
sstsimulator/sst-macro
SST Macro Element Library
sstsimulator/sst-dumpi
SST DUMPI Trace Library
LLNL/callpath
Library for representing callpaths consistently in distributed-memory performance tools.
Ezibenroc/simulating_mpi_applications_at_scale
Ezibenroc/Faithful-and-Efficient-Simulation-of-High-Performance-Linpack
Artifacts for the eponymous paper
It4innovations/Graph500
Repository contains scripts to run Graph500 benchmark on Salomon cluster