controlRun's Stars
huggingface/transformers
🤗 Transformers: State-of-the-art Machine Learning for Pytorch, TensorFlow, and JAX.
sgl-project/sglang
SGLang is a fast serving framework for large language models and vision language models.
ollama/ollama
Get up and running with Llama 3.2, Mistral, Gemma 2, and other large language models.
hiyouga/LLaMA-Factory
Efficiently Fine-Tune 100+ LLMs in WebUI (ACL 2024)
casper-hansen/AutoAWQ
AutoAWQ implements the AWQ algorithm for 4-bit quantization with a 2x speedup during inference. Documentation:
AutoGPTQ/AutoGPTQ
An easy-to-use LLMs quantization package with user-friendly apis, based on GPTQ algorithm.
dataelement/bisheng
BISHENG is an open LLM devops platform for next generation Enterprise AI applications. Powerful and comprehensive features include: GenAI workflow, RAG, Agent, Unified model management, Evaluation, SFT, Dataset Management, Enterprise-level System Management, Observability and more.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
jackfrued/Python-100-Days
Python - 100天从新手到大师
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
triton-inference-server/fastertransformer_backend
mlc-ai/mlc-llm
Universal LLM Deployment Engine with ML Compilation
NVIDIA/FasterTransformer
Transformer related optimization, including BERT, GPT
Ailln/cn2an
📦 快速转化「中文数字」和「阿拉伯数字」~ (最新特性:分数,日期、温度等转化)
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
THUDM/ChatGLM-6B
ChatGLM-6B: An Open Bilingual Dialogue Language Model | 开源双语对话语言模型
TheNetAdmin/zjuthesis
Zhejiang University Graduation Thesis LaTeX Template
tensorflow/models
Models and examples built with TensorFlow
iree-org/iree
A retargetable MLIR-based machine learning compiler and runtime toolkit.
ModelTC/MQBench
Model Quantization Benchmark
Jermmy/pytorch-quantization-demo
A simple network quantization demo using pytorch from scratch.
pytorch/pytorch
Tensors and Dynamic neural networks in Python with strong GPU acceleration
BBuf/tvm_mlir_learn
compiler learning resources collect.
mit-han-lab/torchsparse
[MICRO'23, MLSys'22] TorchSparse: Efficient Training and Inference Framework for Sparse Convolution on GPUs.
DeepVAC/deepvac
PyTorch Project Specification.
tensorlayer/TensorLayer
Deep Learning and Reinforcement Learning Library for Scientists and Engineers
xinge008/Cylinder3D
Rank 1st in the leaderboard of SemanticKITTI semantic segmentation (both single-scan and multi-scan) (Nov. 2020) (CVPR2021 Oral)
mit-han-lab/e3d
Efficient 3D Deep Learning
SmallPond/MyNES
一步步实现简易红白机模拟器运行超级玛丽,以https://github.com/amhndu/SimpleNES 为模板
ARM-software/CMSIS_5
CMSIS Version 5 Development Repository