Daniel-NJ

Daniel-NJ's Stars

langchain-ai/langchain
🦜🔗 Build context-aware reasoning applications
Language:Jupyter Notebook92.3k 679 7.5k14.7k
nomic-ai/gpt4all
GPT4All: Run Local LLMs on Any Device. Open-source and available for commercial use.
Language:C++69.1k 638 1.8k7.6k
ggerganov/llama.cpp
LLM inference in C/C++
Language:C++64.9k 542 3.7k9.3k
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
Language:C34.4k 314 1.3k3.5k
ZuodaoTech/everyone-can-use-english
人人都能用英语
Language:TypeScript24.4k 275 3423.7k
facebook/zstd
Zstandard - Fast real-time compression algorithm
Language:C23.2k 410 1.4k2.1k
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
Language:Python18.6k 170 1.3k1.5k
exaloop/codon
A high-performance, zero-overhead, extensible Python compiler using LLVM
Language:C++14.3k 140 408502
cocktailpeanut/dalai
The simplest way to run LLaMA on your local machine
Language:CSS13.1k 149 3811.4k
huggingface/text-generation-inference
Large Language Model Text Generation Inference
Language:Python8.8k 99 1.3k1k
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
Language:Python8k 139 3.7k1.4k
whitead/paper-qa
LLM Chain for answering questions from documents with citations
Language:Python3.8k 40 141365
dendibakh/perf-ninja
This is an online course where you can learn and master the skill of low-level performance analysis and tuning.
Language:C++2.4k 107 33206
dendibakh/perf-book
The book "Performance Analysis and Tuning on Modern CPU"
Language:TeX2.1k 62 20154
zwang4/awesome-machine-learning-in-compilers
Must read research papers and links to tools and datasets that are related to using machine learning for compilers and systems optimisation
1.4k 69 0160
gpgpu-sim/gpgpu-sim_distribution
GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as well as a performance visualization tool, AerialVisoin, and an integrated energy model, GPUWattch.
Language:C++1.1k 46 168500
lymslive/vimllearn
A book for VimL Script language
Language:Vim Script887 26 3117
travisdowns/uarch-bench
A benchmark for low-level CPU micro-architectural features
Language:C++678 34 8559
jcxue/RDMA-Tutorial
A tutorial on RDMA based programming using code examples
Language:C489 18 9145
jeffhammond/HPCInfo
Information about many aspects of high-performance computing. Wiki content moved to ~/docs.
Language:C273 27 157
It4innovations/hyperqueue
Scheduler for sub-node tasks for HPC systems with batch scheduling
Language:Rust272 7 21221
simgrid/simgrid
MIRROR of the SimGrid framework, for the simulation of distributed applications (Clouds, HPC, Grids, IoT and others). Most of the dev occurs on FramaGit.
Language:C++165 18 23891
anlongfei/compilerbook
compilerbook
43 2 026
ROCm/rocHPL
High Performance Linpack for Next-Generation AMD HPC Accelerators
Language:C++41 16 420
sstsimulator/sst-macro
SST Macro Element Library
Language:C++34 30 30941
sstsimulator/sst-dumpi
SST DUMPI Trace Library
Language:C14 27 1316
LLNL/callpath
Library for representing callpaths consistently in distributed-memory performance tools.
Language:Shell7 6 25
Ezibenroc/simulating_mpi_applications_at_scale
Language:TeX2 2 01
Ezibenroc/Faithful-and-Efficient-Simulation-of-High-Performance-Linpack
Artifacts for the eponymous paper
Language:Jupyter Notebook1 2 0
It4innovations/Graph500
Repository contains scripts to run Graph500 benchmark on Salomon cluster
Language:Shell1