Boxie5

neo-waveShanghai

Boxie5's Stars

oobabooga/text-generation-webui
A Gradio web UI for Large Language Models.
Language:Python39.8k 327 3.6k5.2k
TencentARC/GFPGAN
GFPGAN aims at developing Practical Algorithms for Real-world Face Restoration.
Language:Python35.6k 504 4725.9k
suno-ai/bark
🔊 Text-Prompted Generative Audio Model
Language:Jupyter Notebook35.5k 327 4374.2k
microsoft/AI-For-Beginners
12 Weeks, 24 Lessons, AI for All!
Language:Jupyter Notebook34.3k 403 1115.7k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python34.2k 287 1.1k4.1k
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python33.5k 204 1.2k3.8k
shadowsocks/ShadowsocksX-NG
Next Generation of ShadowsocksX
Language:Swift32.4k 943 1.4k7.9k
microsoft/autogen
A programming framework for agentic AI 🤖
Language:Jupyter Notebook31.5k 381 1.7k4.6k
jamiebuilds/the-super-tiny-compiler
:snowman: Possibly the smallest compiler ever
Language:JavaScript27.9k 473 02.9k
deezer/spleeter
Deezer source separation library including pretrained models.
Language:Python25.7k 386 7722.8k
huggingface/datasets
🤗 The largest hub of ready-to-use datasets for ML models with fast, easy-to-use and efficient data manipulation tools
Language:Python19.1k 279 2.9k2.6k
state-spaces/mamba
Mamba SSM architecture
Language:Python12.7k 101 5121.1k
OpenTalker/SadTalker
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Language:Python11.7k 148 8162.2k
vanna-ai/vanna
🤖 Chat with your SQL database 📊. Accurate Text-to-SQL Generation via LLMs using RAG 🔄.
Language:Python11k 79 294858
chenzomi12/AISystem
AISystem 主要是指AI系统，包括AI芯片、AI编译器、AI推理和训练框架等AI全栈底层技术
Language:Jupyter Notebook10.6k 146 351.5k
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
Language:Python10k 133 50858
numba/numba
NumPy aware dynamic Python compiler using LLVM
Language:Python9.8k 199 5.2k1.1k
xszyou/Fay
Fay is an open-source digital human framework integrating language models and digital characters. It offers retail, assistant, and agent versions for diverse applications like virtual shopping guides, broadcasters, assistants, waiters, teachers, and voice or text-based mobile assistants.
9k 110 1101.8k
OthersideAI/self-operating-computer
A framework to enable multimodal models to operate a computer.
Language:Python8.7k 124 1401.1k
fishaudio/Bert-VITS2
vits2 backbone with multilingual-bert
Language:Python7.9k 47 01.1k
PKU-YuanGroup/ChatLaw
ChatLaw：A Powerful LLM Tailored for Chinese Legal. 中文法律大模型
6.8k 38 74540
XPixelGroup/BasicSR
Open Source Image and Video Restoration Toolbox for Super-resolution, Denoise, Deblurring, etc. Currently, it includes EDSR, RCAN, SRResNet, SRGAN, ESRGAN, EDVR, BasicVSR, SwinIR, ECBSR, etc. Also support StyleGAN2, DFDNet.
Language:Python6.7k 91 5541.2k
OpenTalker/video-retalking
[SIGGRAPH Asia 2022] VideoReTalking: Audio-based Lip Synchronization for Talking Head Video Editing In the Wild
Language:Python6.4k 72 242953
MineDojo/Voyager
An Open-Ended Embodied Agent with Large Language Models
Language:JavaScript5.5k 64 146512
microsoft/LLMLingua
To speed up LLMs' inference and enhance LLM's perceive of key information, compress the prompt and KV-Cache, which achieves up to 20x compression with minimal performance loss.
Language:Python4.5k 33 120251
tensorflow/datasets
TFDS is a collection of datasets ready to use with TensorFlow, Jax, ...
Language:Python4.3k 109 1.2k1.5k
huggingface/safetensors
Simple, safe way to store and distribute tensors
Language:Python2.8k 40 179189
alitto/pond
🔘 Minimalistic and High-performance goroutine worker pool written in Go
Language:Go1.5k 23 3664
jianchang512/vocal-separate
an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems models 这是一个极简的人声和背景音乐分离工具，本地化网页操作，无需连接外网
Language:Python1.3k 8 13143
sustcsonglin/flash-linear-attention
Efficient implementations of state-of-the-art linear attention models in Pytorch and Triton
Language:Python1.2k 24 4466

Boxie5

Boxie5's Stars

oobabooga/text-generation-webui

TencentARC/GFPGAN

suno-ai/bark

microsoft/AI-For-Beginners

coqui-ai/TTS

RVC-Boss/GPT-SoVITS

shadowsocks/ShadowsocksX-NG

microsoft/autogen

jamiebuilds/the-super-tiny-compiler

deezer/spleeter

huggingface/datasets

state-spaces/mamba

OpenTalker/SadTalker

vanna-ai/vanna

chenzomi12/AISystem

AIGC-Audio/AudioGPT

numba/numba

xszyou/Fay

OthersideAI/self-operating-computer

fishaudio/Bert-VITS2

PKU-YuanGroup/ChatLaw

XPixelGroup/BasicSR

OpenTalker/video-retalking

MineDojo/Voyager

microsoft/LLMLingua

tensorflow/datasets

huggingface/safetensors

alitto/pond

jianchang512/vocal-separate

sustcsonglin/flash-linear-attention