GaryGao99's Stars
prometheus/prometheus
The Prometheus monitoring system and time series database.
modelscope/eval-scope
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
0xBYTESHIFT/fp16
class that represents 16-bit floating point (half)
xai-org/grok-1
Grok open release
leetal/ios-cmake
A CMake toolchain file for iOS/iPadOS, visionOS, macOS, watchOS & tvOS C/C++/Obj-C++ development
jarro2783/cxxopts
Lightweight C++ command line option parser
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
OpenNMT/CTranslate2
Fast inference engine for Transformer models
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
k2-fsa/sherpa-ncnn
Real-time speech recognition using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Raspberry Pi, VisionFive2, LicheePi4A etc.
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
wangzhaode/mnn-llm
llm deploy project based mnn.
kensho-technologies/pyctcdecode
A fast and lightweight python-based CTC beam search decoder for speech recognition.
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
a16z-infra/ai-town
A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.
NVIDIA-AI-IOT/torch2trt
An easy to use PyTorch to TensorRT converter
s0md3v/roop
one-click face swap
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
pengzhile/pandora
潘多拉,一个让你呼吸顺畅的ChatGPT。Pandora, a ChatGPT that helps you breathe smoothly.
Chanzhaoyu/chatgpt-web
用 Express 和 Vue3 搭建的 ChatGPT 演示网页
locustio/locust
Write scalable load tests in plain Python 🚗💨
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Mooler0410/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
grpc/grpc
The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
mlc-ai/web-llm
High-performance In-browser LLM Inference Engine