GaryGao99's Stars
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
grpc/grpc
The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
Chanzhaoyu/chatgpt-web
用 Express 和 Vue3 搭建的 ChatGPT 演示网页
tatsu-lab/stanford_alpaca
Code and documentation to train Stanford's Alpaca models, and generate the data.
s0md3v/roop
one-click face swap
locustio/locust
Write scalable load tests in plain Python 🚗💨
pengzhile/pandora
潘多拉,一个让你呼吸顺畅的ChatGPT。Pandora, a ChatGPT that helps you breathe smoothly.
pytorch/vision
Datasets, Transforms and Models specific to Computer Vision
mlc-ai/web-llm
High-performance In-browser LLM Inference Engine
Mooler0410/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
FMInference/FlexGen
Running large language models on a single GPU for throughput-oriented scenarios.
triton-inference-server/server
The Triton Inference Server provides an optimized cloud and edge inferencing solution.
DA-southampton/NLP_ability
总结梳理自然语言处理工程师(NLP)需要积累的各方面知识,包括面试题,各种基础知识,工程能力等等,提升核心竞争力
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
NVIDIA-AI-IOT/torch2trt
An easy to use PyTorch to TensorRT converter
locuslab/TCN
Sequence modeling benchmarks and temporal convolutional networks
daquexian/onnx-simplifier
Simplify your onnx model
pytorch/TensorRT
PyTorch/TorchScript/FX compiler for NVIDIA GPUs using TensorRT
OpenPPL/ppq
PPL Quantization Tool (PPQ) is a powerful offline neural network quantization tool.
Eric-mingjie/rethinking-network-pruning
Rethinking the Value of Network Pruning (Pytorch) (ICLR 2019)
houko/wechatgpt
wechatgpt golang版 chatgpt机器人(可docker部署),目前支持wechat,telegram
Tencent/PatrickStar
PatrickStar enables Larger, Faster, Greener Pretrained Models for NLP and democratizes AI for everyone.
wenet-e2e/wekws
Production First and Production Ready End-to-End Keyword Spotting Toolkit
triton-inference-server/fastertransformer_backend
NVIDIA/tensorrt-laboratory
Explore the Capabilities of the TensorRT Platform
hyperconnect/TC-ResNet
Code for Temporal Convolution for Real-time Keyword Spotting on Mobile Devices
csukuangfj/kaldifeat
Kaldi-compatible online & offline feature extraction with PyTorch, supporting CUDA, batch processing, chunk processing, and autograd - Provide C++ & Python API
aadhithya/onnx-typecast
Script to typecast ONNX model parameters from INT64 to INT32.
qbxlvnf11/convert-pytorch-onnx-tensorrt
Converting weights of Pytorch models to ONNX & TensorRT engines