qinsiyuan-cool
I am an undergraduate majoring in software engineering. Welcome communication and guidance.
qinsiyuan-cool's Stars
karpathy/llm.c
LLM training in simple, raw C/CUDA
Liu-xiandong/How_to_optimize_in_GPU
This is a series of GPU optimization topics. Here we will introduce how to optimize the CUDA kernel in detail. I will introduce several basic kernel optimizations, including: elementwise, reduce, sgemv, sgemm, etc. The performance of these kernels is basically at or near the theoretical limit.
Cjkkkk/CUDA_gemm
A simple high performance CUDA GEMM implementation.
AIoT-MLSys-Lab/Efficient-LLMs-Survey
[TMLR 2024] Efficient Large Language Models: A Survey
wgwang/awesome-LLMs-In-China
**大模型
forthespada/InterviewGuide
🔥🔥「InterviewGuide」是阿秀从校园->职场多年计算机自学过程的记录以及学弟学妹们计算机校招&秋招经验总结文章的汇总,包括但不限于C/C++ 、Golang、JavaScript、Vue、操作系统、数据结构、计算机网络、MySQL、Redis等学习总结,坚持学习,持续成长!
guaguaupup/cpp_interview
c++后台服务器开发面经或八股总结!(有深度有广度,和仅有概念的总结文章不同!)
nndeploy/nndeploy
nndeploy is an end-to-end model inference and deployment framework. It aims to provide users with a powerful, easy-to-use, high-performance, and mainstream framework compatible model inference and deployment experience.一款端到端的模型推理和部署框架。它旨在为用户提供功能强大、简单易用、高性能且兼容主流框架的模型推理和部署体验。
Tony-Tan/CUDA_Freshman
sunface/rust-course
“连续八年成为全世界最受喜爱的语言,无 GC 也无需手动内存管理、极高的性能和安全性、过程/OO/函数式编程、优秀的包管理、JS 未来基石" — 工作之余的第二语言来试试 Rust 吧。本书拥有全面且深入的讲解、生动贴切的示例、德芙般丝滑的内容,这可能是目前最用心的 Rust 中文学习教程 / Book
sunface/rust-by-practice
Learning Rust By Practice, narrowing the gap between beginner and skilled-dev through challenging examples, exercises and projects.
jafioti/luminal
Deep learning at the speed of light.
BBuf/giantpandacv.com
www.giantpandacv.com
mlabonne/llm-course
Course to get into Large Language Models (LLMs) with roadmaps and Colab notebooks.
kamranahmedse/developer-roadmap
Interactive roadmaps, guides and other educational content to help developers grow in their careers.
AniZpZ/AutoSmoothQuant
An easy-to-use package for implementing SmoothQuant for LLMs
xlite-dev/CUDA-Learn-Notes
📚Modern CUDA Learn Notes: 200+ Tensor/CUDA Cores Kernels, FA2, HGEMM via MMA and CuTe (~99% TFLOPS of cuBLAS/FA2 🎉).
pku-minic/koopa
Library for generating/parsing/optimizing Koopa IR.
BBuf/tvm_mlir_learn
compiler learning resources collect.
royalneverwin/My-Compiler
实现从SysY语言到riscv指令的编译器
chenguokai/acwj-rv
munificent/craftinginterpreters
Repository for the book "Crafting Interpreters"
GuoYaxiang/craftinginterpreters_zh
手撸解释器教程《Crafting Interpreters》中文翻译
starfive-tech/Fedora_on_StarFive
BlankHeart0/PJDCc
A toy C compiler to x86
riscv-non-isa/riscv-elf-psabi-doc
A RISC-V ELF psABI Document
sunshaoce/rvcc-course
sunshaoce/rvcc
本课程基于Rui的chibicc,@sunshaoce和@ksco将其由原来的X86架构改写为RISC-V 64架构,同时加入了大量的中文注释,并且配有316节对应于每一个commit的课程,帮助读者可以层层推进、逐步深入的学习编译器的构造。
ChunelFeng/CGraph
【A common used C++ DAG framework】 一个通用的、无三方依赖的、跨平台的、收录于awesome-cpp的、基于流图的并行计算框架。欢迎star & fork & 交流
cnrv/riscv-soc-book
关于RISC-V你所需要知道的一切