ice-tong's Stars
ggerganov/llama.cpp
LLM inference in C/C++
2noise/ChatTTS
A generative speech model for daily dialogue.
OpenDevin/OpenDevin
🐚 OpenDevin: Code Less, Make More
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
benfred/py-spy
Sampling profiler for Python programs
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
ggerganov/ggml
Tensor library for machine learning
THU-MIG/yolov10
YOLOv10: Real-Time End-to-End Object Detection [NeurIPS 2024]
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
miss-mumu/developer2gwy
公务员从入门到上岸,最佳程序员公考实践教程
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
leejet/stable-diffusion.cpp
Stable Diffusion and Flux in pure C/C++
fundamentalvision/BEVFormer
[ECCV 2022] This is the official implementation of BEVFormer, a camera-only framework for autonomous driving perception, e.g., 3D object detection and semantic map segmentation.
ridgerchu/matmulfreellm
Implementation for MatMul-free LM.
pytorch/torchtitan
A PyTorch native library for large model training
DLLXW/baby-llama2-chinese
用于从头预训练+SFT一个小参数量的中文LLaMa2的仓库;24G单卡即可运行得到一个具备简单中文问答能力的chat-llama2.
LLMBook-zh/LLMBook-zh.github.io
《大语言模型》作者:赵鑫,李军毅,周昆,唐天一,文继荣
microsoft/Olive
Olive: Simplify ML Model Finetuning, Conversion, Quantization, and Optimization for CPUs, GPUs and NPUs.
NVIDIA/MatX
An efficient C++17 GPU numerical computing library with Python-like syntax
wangzhaode/llm-export
llm-export can export llm model to onnx.
spcl/pymlir
Python interface for MLIR - the Multi-Level Intermediate Representation
wejoncy/QLLM
A general 2-8 bits quantization toolbox with GPTQ/AWQ/HQQ, and export to onnx/onnx-runtime easily.
inisis/OnnxSlim
A Toolkit to Help Optimize Onnx Model
onnx/neural-compressor
Model compression for ONNX
jackdewinter/pymarkdown
Quantco/spox
Pythonic framework for building ONNX graphs
AXERA-TECH/ax-llm
Explore LLM model deployment based on AXera's AI chips
TiledTensor/ThrillerFlow
ThrillerFlow is a Dataflow Analysis and Codegen Framework written in Rust.