irasin

everything for fun

irasin's Stars

Textualize/rich
Rich is a Python library for rich text and beautiful formatting in the terminal.
Language:Python50.1k 538 1.4k1.8k
xiaolincoder/CS-Base
图解计算机网络、操作系统、计算机组成、数据库，共 1000 张图 + 50 万字，破除晦涩难懂的计算机基础知识，让天下没有难懂的八股文！🚀 在线阅读：https://xiaolincoding.com
14.8k 96 1331.9k
adam-maj/tiny-gpu
A minimal GPU design in Verilog to learn how GPUs work from the ground up
Language:SystemVerilog7.2k 69 24547
HazyResearch/ThunderKittens
Tile primitives for speedy kernels
Language:Cuda1.8k 30 3186
DefTruth/CUDA-Learn-Notes
📚150+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
Language:Cuda1.8k 14 9187
ray-project/llm-applications
A comprehensive guide to building RAG-based LLM applications for production.
Language:Jupyter Notebook1.7k 18 12234
tip-of-the-week/cpp
C++ Tip Of The Week
Language:Python1.6k 140 573
boost-ext/ut
C++20 μ(micro)/Unit Testing Framework
Language:C++1.3k 29 166122
gpgpu-sim/gpgpu-sim_distribution
GPGPU-Sim provides a detailed simulation model of contemporary NVIDIA GPUs running CUDA and/or OpenCL workloads. It includes support for features such as TensorCores and CUDA Dynamic Parallelism as well as a performance visualization tool, AerialVisoin, and an integrated energy model, GPUWattch.
Language:C++1.2k 46 171517
lizhe2004/Awesome-LLM-RAG-Application
the resources about the application based on LLM with RAG pattern
965 15 156
ilqvya/random
Random for modern C++ with convenient API
Language:C++914 33 1681
banach-space/clang-tutor
A collection of out-of-tree Clang plugins for teaching and learning
Language:C++712 20 1764
owenliang/qwen-vllm
通义千问VLLM推理部署DEMO
Language:Python482 7 971
PacktPublishing/Learn-LLVM-12
Learn LLVM 12, published by Packt
Language:C++477 12 17103
Cjkkkk/CUDA_gemm
A simple high performance CUDA GEMM implementation.
Language:Cuda339 5 337
NVIDIA/nvbandwidth
A tool for bandwidth measurements on NVIDIA GPUs.
Language:C++337 12 1830
blackinkkkxi/RAG_langchain
一个基于langchain实现RAG的简单示例
Language:Jupyter Notebook335 2 156
Bruce-Lee-LY/cuda_hgemm
Several optimization methods of half-precision general matrix multiplication (HGEMM) using tensor core with WMMA API and MMA PTX instruction.
Language:Cuda324 4 1268
KnowingNothing/MatmulTutorial
A Easy-to-understand TensorOp Matmul Tutorial
Language:C++301 9 1132
te42kyfo/gpu-benches
collection of benchmarks to measure basic GPU capabilities
Language:Jupyter Notebook269 9 1141
codeplaysoftware/portBLAS
An implementation of BLAS using the SYCL open standard.
Language:C++261 25 4750
franneck94/CppProjectTemplate
C++ project template with unit-tests, documentation, ci-testing and workflows.
Language:CMake241 15 491
TiledTensor/TiledCUDA
TiledCUDA is a highly efficient kernel template library designed to elevate CUDA C’s level of abstraction for processing tiles.
Language:C++172 3 6410
hunterzju/llvm-tutorial
llvm-tutorial文档，翻译以及代码仓库
Language:C++157 6 125
wzsh/wmma_tensorcore_sample
Matrix Multiply-Accumulate with CUDA and WMMA( Tensor Core)
Language:Cuda122 4 219
nicolaswilde/cuda-tensorcore-hgemm
Language:Cuda118 5 021
AyakaGEMM/Hands-on-GEMM
Language:Cuda101 2 415
wmmae/wmma_extension
An extension library of WMMA API (Tensor Core API)
Language:Cuda87 8 414
MARD1NO/CUDA-PPT
79 2 012
NVIDIA/online-softmax
Benchmark code for the "Online normalizer calculation for softmax" paper
Language:Cuda61 6 07

irasin

irasin's Stars

Textualize/rich

xiaolincoder/CS-Base

adam-maj/tiny-gpu

HazyResearch/ThunderKittens

DefTruth/CUDA-Learn-Notes

ray-project/llm-applications

tip-of-the-week/cpp

boost-ext/ut

gpgpu-sim/gpgpu-sim_distribution

lizhe2004/Awesome-LLM-RAG-Application

ilqvya/random

banach-space/clang-tutor

owenliang/qwen-vllm

PacktPublishing/Learn-LLVM-12

Cjkkkk/CUDA_gemm

NVIDIA/nvbandwidth

blackinkkkxi/RAG_langchain

Bruce-Lee-LY/cuda_hgemm

KnowingNothing/MatmulTutorial

te42kyfo/gpu-benches

codeplaysoftware/portBLAS

franneck94/CppProjectTemplate

TiledTensor/TiledCUDA

hunterzju/llvm-tutorial

wzsh/wmma_tensorcore_sample

nicolaswilde/cuda-tensorcore-hgemm

AyakaGEMM/Hands-on-GEMM

wmmae/wmma_extension

MARD1NO/CUDA-PPT

NVIDIA/online-softmax