Pinned Repositories
awesome-cuda-triton-hpc
🔥🔥🔥 A collection of some awesome public CUDA, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, MLIR and High Performance Computing (HPC) projects.
awesome-dotnet-machine-learning
A collection of some awesome public machine learning framework, tutorial, blogs, library and applications for .NET.
awesome-llm-and-aigc
🚀🚀🚀A collection of some wesome public projects about Large Language Model(LLM), Visual Language Model(VLM), AI Generated Content(AIGC), the related Datasets and Applications.
awesome-object-detection-datasets
A collection of some awesome public object detection and recognition datasets.
awesome-rust-list
This repository lists some awesome public Rust projects, Videos, Blogs and Jobs.
awesome-snn
🔥🔥🔥A collection of some awesome public SNN(Spiking Neural Network) projects.
awesome-yolo-object-detection
🚀🚀🚀 A collection of some awesome public YOLO object detection series projects.
cuda-beginner-course-cpp-version
bilibili视频【CUDA 12.x 并行编程入门(C++版)】配套代码
cuda-beginner-course-python-version
bilibili视频【CUDA 12.x 并行编程入门(Python版)】配套代码
hello-algo-zig
Zig codes for the famous public project 《Hello, Algorithm》|《 Hello,算法 》 about data structures and algorithms.
coderonion's Repositories
coderonion/awesome-yolo-object-detection
🚀🚀🚀 A collection of some awesome public YOLO object detection series projects.
coderonion/awesome-llm-and-aigc
🚀🚀🚀A collection of some wesome public projects about Large Language Model(LLM), Visual Language Model(VLM), AI Generated Content(AIGC), the related Datasets and Applications.
coderonion/awesome-cuda-triton-hpc
🔥🔥🔥 A collection of some awesome public CUDA, cuBLAS, cuDNN, CUTLASS, TensorRT, TensorRT-LLM, Triton, MLIR and High Performance Computing (HPC) projects.
coderonion/awesome-snn
🔥🔥🔥A collection of some awesome public SNN(Spiking Neural Network) projects.
coderonion/awesome-cpp20
This repository lists some awesome public projects about C++20, C++23, C++26 and beyond.
coderonion/aidea
AIdea 是一款支持 GPT 以及国产大语言模型通义千问、文心一言等,支持 Stable Diffusion 文生图、图生图、 SDXL1.0、超分辨率、图片上色的全能型 APP。
coderonion/autodistill
Images to inference with no labeling (use foundation models to train supervised models).
coderonion/coderonion
coderonion/cuda-python
CUDA Python Low-level Bindings
coderonion/cutlass
CUDA Templates for Linear Algebra Subroutines
coderonion/expo
An open-source framework for making universal native apps with React. Expo runs on Android, iOS, and the web.
coderonion/FlagGems
FlagGems is an operator library for large language models implemented in Triton Language.
coderonion/flutter
Flutter makes it easy and fast to build beautiful apps for mobile and beyond
coderonion/GroundingDINO
The official implementation of "Grounding DINO: Marrying DINO with Grounded Pre-Training for Open-Set Object Detection"
coderonion/KuiperLLama
校招、秋招、春招、实习好项目,带你从零动手实现支持LLama2/3和Qwen2.5的大模型推理框架。
coderonion/lite_llama
The llama model inference lite framework by triton.
coderonion/llm_note
LLM notes, including model inference, transformer model structure, and llm framework code analysis notes
coderonion/LLMFarm
llama and other large language models on iOS and MacOS offline using GGML library.
coderonion/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Qwen / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS/Plugins/Artifacts). One-click FREE deployment of your private ChatGPT/ Claude application.
coderonion/object-detection-inference
C++ object detection inference from video or image input source
coderonion/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
coderonion/swift
The Swift Programming Language
coderonion/TensorRT
NVIDIA® TensorRT™ is an SDK for high-performance deep learning inference on NVIDIA GPUs. This repository contains the open source components of TensorRT.
coderonion/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
coderonion/TensorRT-YOLO
🚀 你的YOLO部署神器。TensorRT Plugin、CUDA Kernel、CUDA Graphs三管齐下,享受闪电般的推理速度。| Your YOLO Deployment Powerhouse. With the synergy of TensorRT Plugins, CUDA Kernels, and CUDA Graphs, experience lightning-fast inference speeds.
coderonion/tensorrtx
Implementation of popular deep learning networks with TensorRT network definition API
coderonion/triton
Development repository for the Triton language and compiler
coderonion/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
coderonion/web3.js
Collection of comprehensive TypeScript libraries for Interaction with the Ethereum JSON RPC API and utility functions.
coderonion/YOLO-Patch-Based-Inference
Python library for YOLO small object detection and instance segmentation