GaryGao99

GaryGao99's Stars

prometheus/prometheus
The Prometheus monitoring system and time series database.
Language:Go53.9k8.9k
modelscope/eval-scope
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
Language:Python11315
RUCAIBox/LLMSurvey
The official GitHub page for the survey paper "A Survey of Large Language Models".
Language:Python9.6k745
0xBYTESHIFT/fp16
class that represents 16-bit floating point (half)
Language:C++112
xai-org/grok-1
Grok open release
Language:Python49.2k8.3k
leetal/ios-cmake
A CMake toolchain file for iOS/iPadOS, visionOS, macOS, watchOS & tvOS C/C++/Obj-C++ development
Language:CMake1.8k437
jarro2783/cxxopts
Lightweight C++ command line option parser
Language:C++4.1k574
facebookresearch/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
Language:Jupyter Notebook10.6k1k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python32.1k3.9k
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
Language:Python10.4k873
OpenNMT/CTranslate2
Fast inference engine for Transformer models
Language:C++3.1k273
netease-youdao/EmotiVoice
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
Language:Python6.9k576
k2-fsa/sherpa-ncnn
Real-time speech recognition using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Raspberry Pi, VisionFive2, LicheePi4A etc.
Language:C++908140
NVIDIA/TensorRT-LLM
TensorRT-LLM provides users with an easy-to-use Python API to define Large Language Models (LLMs) and build TensorRT engines that contain state-of-the-art optimizations to perform inference efficiently on NVIDIA GPUs. TensorRT-LLM also contains components to create Python and C++ runtimes that execute those TensorRT engines.
Language:C++7.6k827
wangzhaode/mnn-llm
llm deploy project based mnn.
Language:C++1.4k151
kensho-technologies/pyctcdecode
A fast and lightweight python-based CTC beam search decoder for speech recognition.
Language:Python41989
vllm-project/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Language:Python23.2k3.3k
InternLM/lmdeploy
LMDeploy is a toolkit for compressing, deploying, and serving LLMs.
Language:Python3.4k303
a16z-infra/ai-town
A MIT-licensed, deployable starter kit for building and customizing your own version of AI town - a virtual town where AI characters live, chat and socialize.
Language:TypeScript7.2k659
NVIDIA-AI-IOT/torch2trt
An easy to use PyTorch to TensorRT converter
Language:Python4.5k670
s0md3v/roop
one-click face swap
Language:Python25.8k6.3k
baichuan-inc/Baichuan-7B
A large-scale 7B pretraining language model developed by BaiChuan-Inc.
Language:Python5.7k503
pengzhile/pandora
潘多拉，一个让你呼吸顺畅的ChatGPT。Pandora, a ChatGPT that helps you breathe smoothly.
Language:Python20.7k3.5k
Chanzhaoyu/chatgpt-web
用 Express 和 Vue3 搭建的 ChatGPT 演示网页
Language:Vue31k11.2k
locustio/locust
Write scalable load tests in plain Python 🚗💨
Language:Python24.3k2.9k
ggerganov/whisper.cpp
Port of OpenAI's Whisper model in C/C++
Language:C++33.2k3.3k
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
Language:Python64.8k7.6k
Mooler0410/LLMsPracticalGuide
A curated list of practical guide resources of LLMs (LLMs Tree, Examples, Papers)
9.1k690
grpc/grpc
The C based gRPC (C++, Python, Ruby, Objective-C, PHP, C#)
Language:C++41.3k10.4k
mlc-ai/web-llm
High-performance In-browser LLM Inference Engine
Language:TypeScript11.8k742