Pinned Repositories
100-shell-script-examples
Collection of shell scripts found on the internet
abc
code snippets
Android-1
Android related examples
Android-App-Development
This repository contains all the source code examples and the FAQ for our Android App Development Specialization for Coursera
android-fundamentals
Python
学习Python过程中的练习。
ROS-Learning
学习ROS过程中的练习代码
slam14
高翔博士书籍《视觉SLAM十四讲》书上练习及部分习题
tingta
通过用户朋友圈分享的网易云音乐来获取网易云音乐用户名
vio
从零手写VIO课程
foreverlms's Repositories
foreverlms/abc
code snippets
foreverlms/armnn
Arm NN ML Software. The code here is a read-only mirror of https://review.mlplatform.org/admin/repos/ml/armnn
foreverlms/awesome-tensor-compilers
A list of awesome compiler projects and papers for tensor computation and deep learning.
foreverlms/cfx-article-src
foreverlms/composable_kernel
Composable Kernel: Performance Portable Programming Model for Machine Learning Tensor Operators
foreverlms/CUDA-Learn-Notes
📚150+ Tensor/CUDA Cores Kernels, ⚡️flash-attn-mma, ⚡️hgemm with WMMA, MMA and CuTe (98%~100% TFLOPS of cuBLAS/FA2 🎉🎉).
foreverlms/cuda-training-series
Training materials associated with NVIDIA's CUDA Training Series (www.olcf.ornl.gov/cuda-training-series/)
foreverlms/cuda_sgemm
foreverlms/cute-gemm-101
foreverlms/Cute-Learning
foreverlms/cutlass
CUDA Templates for Linear Algebra Subroutines
foreverlms/cutlass-kernels
foreverlms/dev-sidecar
开发者边车,github打不开,github加速,git clone加速,git release下载加速,stackoverflow加速
foreverlms/flash-attention
Fast and memory-efficient exact attention
foreverlms/flashinfer
FlashInfer: Kernel Library for LLM Serving
foreverlms/folly
An open-source C++ library developed and used at Facebook.
foreverlms/foreverlms.github.io
个人博客,参考的模板是izhengfan.github.io
foreverlms/gdb-dashboard
Modular visual interface for GDB in Python
foreverlms/INT8-Flash-Attention-FMHA-Quantization
foreverlms/lightllm
LightLLM is a Python-based LLM (Large Language Model) inference and serving framework, notable for its lightweight design, easy scalability, and high-speed performance.
foreverlms/llm-numbers
Numbers every LLM developer should know
foreverlms/llvm-project
The LLVM Project is a collection of modular and reusable compiler and toolchain technologies. Note: the repository does not accept github pull requests at this moment. Please submit your patches at http://reviews.llvm.org.
foreverlms/maxas
Assembler for NVIDIA Maxwell architecture
foreverlms/MegEngine
MegEngine 是一个快速、可拓展、易于使用且支持自动求导的深度学习框架
foreverlms/MegPeak
foreverlms/MNN
MNN is a blazing fast, lightweight deep learning framework, battle-tested by business-critical use cases in Alibaba
foreverlms/perf-ninja
This is an online course where you can learn and master the skill of low-level performance analysis and tuning.
foreverlms/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
foreverlms/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
foreverlms/ZhiLight
A highly optimized inference acceleration engine for Llama and its variants.