irasin's Stars
CompVis/stable-diffusion
A latent text-to-image diffusion model
labmlai/annotated_deep_learning_paper_implementations
🧑🏫 60+ Implementations/tutorials of deep learning papers with side-by-side notes 📝; including transformers (original, xl, switch, feedback, vit, ...), optimizers (adam, adabelief, sophia, ...), gans(cyclegan, stylegan2, ...), 🎮 reinforcement learning (ppo, dqn), capsnet, distillation, ... 🧠
Stability-AI/stablediffusion
High-Resolution Image Synthesis with Latent Diffusion Models
chenzomi12/AISystem
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
ggerganov/ggml
Tensor library for machine learning
tpn/pdfs
Technically-oriented PDF Collection (Papers, Specs, Decks, Manuals, etc)
AntixK/PyTorch-VAE
A Collection of Variational Autoencoders (VAE) in PyTorch.
wenda-LLM/wenda
闻达:一个LLM调用平台。目标为针对特定环境的高效内容生成,同时考虑个人和中小企业的计算资源局限性,以及知识安全和私密性问题
hsutter/cppfront
A personal experimental C++ Syntax 2 -> Syntax 1 compiler
THUDM/VisualGLM-6B
Chinese and English multimodal conversational language model | 多模态中英双语对话语言模型
microsoft/AI-System
System for AI Education Resource.
ztxz16/fastllm
纯c++的全平台llm加速库,支持python调用,chatglm-6B级模型单卡可达10000+token / s,支持glm, llama, moss基座,手机端流畅运行
banach-space/llvm-tutor
A collection of out-of-tree LLVM passes for teaching and learning
DSXiangLi/DecryptPrompt
总结Prompt&LLM论文,开源数据&模型,AIGC应用
Neargye/nameof
Nameof operator for modern C++, simply obtain the name of a variable, type, function, macro, and enum
Evian-Zhang/llvm-ir-tutorial
LLVM IR入门指南
NervanaSystems/maxas
Assembler for NVIDIA Maxwell architecture
Liu-xiandong/How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
MengRao/fmtlog
fmtlog is a performant fmtlib-style logging library with latency in nanoseconds.
xiaoweiChen/Learn-LLVM-12
《Learn LLVM 12》的非专业个人翻译
cloudcores/CuAssembler
An unofficial cuda assembler, for all generations of SASS, hopefully :)
xiaoweihan/Books
yzhaiustc/Optimizing-SGEMM-on-NVIDIA-Turing-GPUs
Optimizing SGEMM kernel functions on NVIDIA GPUs to a close-to-cuBLAS performance.
Yinghan-Li/YHs_Sample
Yinghan's Code Sample
lijiansong/clang-llvm-tutorial
clang & llvm examples, e.g. AST Interpreter, Function Pointer Analysis, Value Range Analysis, Data-Flow Analysis, Andersen Pointer Analysis, LLVM Backend...
BrianPulfer/PapersReimplementations
Personal short implementations of Machine Learning papers
anyscale/llm-continuous-batching-benchmarks
Ahdhn/CUDATemplate
Template for starting CUDA/C++ project using CMake with Github Action for CI
randyli/luac
hova88/CUDA-MatMul-Practice