Pinned Repositories
awesome-cuda-and-hpc
🚀🚀🚀 This repository lists some awesome public CUDA, cuda-python, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, TVM, MLIR, PTX and High Performance Computing (HPC) projects.
awesome-dotnet-machine-learning
A collection of some awesome public machine learning framework, tutorial, blogs, library and applications for .NET.
awesome-llm-and-aigc
🚀🚀🚀A collection of some awesome public projects about Large Language Model(LLM), Vision Language Model(VLM), Vision Language Action(VLA), AI Generated Content(AIGC), the related Datasets and Applications.
awesome-mojo-max-mlir
A collection of some awesome public MAX platform, Mojo programming language and Multi-Level IR Compiler Framework(MLIR) projects.
awesome-object-detection-datasets
A collection of some awesome public object detection and recognition datasets.
awesome-rust-list
This repository lists some awesome public Rust projects, Videos, Blogs and Jobs.
awesome-snn
🔥🔥🔥A collection of some awesome public SNN(Spiking Neural Network) projects.
awesome-yolo-object-detection
🚀🚀🚀 A collection of some awesome public YOLO object detection series projects and the related object detection datasets.
cuda-beginner-course-cpp-version
bilibili视频【CUDA 12.x 并行编程入门(C++版)】配套代码
hello-algo-zig
Zig codes for the famous public project 《Hello, Algorithm》|《 Hello,算法 》 about data structures and algorithms.
coderonion's Repositories
coderonion/awesome-snn
🔥🔥🔥A collection of some awesome public SNN(Spiking Neural Network) projects.
coderonion/awesome-mojo-max-mlir
A collection of some awesome public MAX platform, Mojo programming language and Multi-Level IR Compiler Framework(MLIR) projects.
coderonion/aidea
AIdea 是一款支持 GPT 以及国产大语言模型通义千问、文心一言等,支持 Stable Diffusion 文生图、图生图、 SDXL1.0、超分辨率、图片上色的全能型 APP。
coderonion/AISystem
AISystem 主要是指AI系统,包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
coderonion/pymlir
Python interface for MLIR - the Multi-Level Intermediate Representation
coderonion/AIFoundation
AIFoundation 主要是指AI系统遇到大模型,从底层到上层如何系统级地支持大模型训练和推理,全栈的核心技术。
coderonion/anylabeling
Effortless AI-assisted data labeling with AI support from YOLO, Segment Anything (SAM+SAM2), MobileSAM!!
coderonion/autodistill
Images to inference with no labeling (use foundation models to train supervised models).
coderonion/cuda-python
CUDA Python Low-level Bindings
coderonion/cutlass
CUDA Templates for Linear Algebra Subroutines
coderonion/DeepSeek-V3
coderonion/expo
An open-source framework for making universal native apps with React. Expo runs on Android, iOS, and the web.
coderonion/FlagGems
FlagGems is an operator library for large language models implemented in Triton Language.
coderonion/GroundingDINO
The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
coderonion/llama.cpp
Port of Facebook's LLaMA model in C/C++
coderonion/llm.c
LLM training in simple, raw C/CUDA
coderonion/llm_note
LLM notes, including model inference, transformer model structure, and llm framework code analysis notes
coderonion/LLMFarm
llama and other large language models on iOS and MacOS offline using GGML library.
coderonion/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Qwen / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS/Plugins/Artifacts). One-click FREE deployment of your private ChatGPT/ Claude application.
coderonion/matmul.mojo
High Performance Matrix Multiplication in Pure Mojo 🔥
coderonion/OpenEMMA
OpenEMMA, a permissively licensed open source reproduction of Waymo’s EMMA model.
coderonion/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
coderonion/TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
coderonion/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
coderonion/tensorrtx
Implementation of popular deep learning networks with TensorRT network definition API
coderonion/tiny-flash-attention
flash attention tutorial written in python, triton, cuda, cutlass
coderonion/triton
Development repository for the Triton language and compiler
coderonion/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
coderonion/web3.js
Collection of comprehensive TypeScript libraries for Interaction with the Ethereum JSON RPC API and utility functions.
coderonion/yolort
yolort is a runtime stack for yolov5 on specialized accelerators such as tensorrt, libtorch, onnxruntime, tvm and ncnn.