jiguanglizipao's Stars
pytorch-labs/attention-gym
Helpful tools and examples for working with flex-attention
anthropics/anthropic-cookbook
A collection of notebooks/recipes showcasing some fun and effective ways of using Claude.
NousResearch/DisTrO
Distributed Training Over-The-Internet
huggingface/datatrove
Freeing data processing from scripting madness by providing a set of platform-agnostic customizable pipeline processing blocks.
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
AnswerDotAI/gpu.cpp
A lightweight library for portable low-level GPU computation using WebGPU.
labradon/yuvio
Python package for reading and writing uncompressed yuv image and video data.
lipracer/cuda-rt-hook
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
nitnelave/lru_cache
A C++ implementation of a LRU cache
mlfoundations/dclm
DataComp for Language Models
Jellyfish042/uncheatable_eval
Evaluating LLMs with Dynamic Data
ventoy/Ventoy
A new bootable USB solution.
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
pigirons/cpufp
A CPU tool for benchmarking the peak of floating points
predibase/lorax
Multi-LoRA inference server that scales to 1000s of fine-tuned LLMs
epfl-dlab/transformers-CFG
🤗 A specialized library for integrating context-free grammars (CFG) in EBNF with the Hugging Face Transformers
16bit-ykiko/magic-cpp
A C++20 header-only library that supports powerful reflection for C++
microsoft/mscclpp
MSCCL++: A GPU-driven communication stack for scalable AI applications
microsoft/superbenchmark
A validation and profiling tool for AI infrastructure
microsoft/taccl
TACCL: Guiding Collective Algorithm Synthesis using Communication Sketches
oatpp/oatpp
🌱Light and powerful C++ web framework for highly scalable and resource-efficient web application. It's zero-dependency and easy-portable.
mlc-ai/tokenizers-cpp
Universal cross-platform tokenizers binding to HF and sentencepiece
NVIDIA/cutlass
CUDA Templates for Linear Algebra Subroutines
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
nemtrif/utfcpp
UTF-8 with C++ in a Portable Way
unicode-org/icu
The home of the ICU project source code.
vnmakarov/mir
A lightweight JIT compiler based on MIR (Medium Internal Representation) and C11 JIT compiler and interpreter based on MIR
reorx/awesome-chatgpt-api
Curated list of apps and tools that not only use the new ChatGPT API, but also allow users to configure their own API keys, enabling free and on-demand usage of their own quota.
nebuly-ai/optimate
A collection of libraries to optimise AI model performances