stwrd

stwrd's Stars

gohugoio/hugo
The world’s fastest framework for building websites.
Language:Go75k 1.1k 7.4k7.5k
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
Language:Python52k 383 3.3k5.5k
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python33.2k 202 1.2k3.8k
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python31.1k 179 5173.4k
facefusion/facefusion
Industry leading face manipulation platform
Language:Python18.4k 178 4282.8k
bluenviron/mediamtx
Ready-to-use SRT / WebRTC / RTSP / RTMP / LL-HLS media server and media proxy that allows to read, publish, proxy, record and playback video and audio streams.
Language:Go11.9k 143 1.5k1.5k
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
Language:Jupyter Notebook9.1k 136 4431.4k
datawhalechina/self-llm
《开源大模型食用指南》基于Linux环境快速部署开源大模型，更适合**宝宝的部署教程
Language:Jupyter Notebook8.2k 63 150980
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
Language:Jupyter Notebook7.6k 106 291471
exadel-inc/CompreFace
Leading free and open-source face recognition system
Language:Java5.3k 83 283729
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
Language:Python5.1k 51 387524
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
Language:Python3.6k 31 255336
JDAI-CV/fast-reid
SOTA Re-identification Methods and Toolbox
Language:Python3.4k 58 631835
google-research/big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
Language:Jupyter Notebook2.2k 41 54148
6drf21e/ChatTTS_colab
🚀 一键部署（含离线整合包）！基于 ChatTTS ，支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用，无需复杂安装。
Language:Python1.9k 19 77242
tinyvision/SOLIDER
A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, which can benefit downstream human-centric tasks to the maximum extent
Language:Python1.9k 130 29344
mit-han-lab/efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
Language:Python1.8k 36 125164
Tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Language:Python1.7k 28 87137
sherlockchou86/VideoPipe
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化（视频分析）框架，觉得有帮助的请给个星星 : )
Language:C++1.4k 21 30197
Lightning-AI/deep-learning-project-template
Pytorch Lightning code guideline for conferences
Language:Python1.2k 17 16271
chflame163/ComfyUI_LayerStyle
A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.
Language:Python1.2k 11 30168
yolain/ComfyUI-Yolain-Workflows
Some awesome comfyui workflows in here, and they are built using the comfyui-easy-use node package.
730 6 971
shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
Language:Jupyter Notebook636 16 6380
CCmahua/ChatTTS-Enhanced
Language:Python473 5 2066
kijai/ComfyUI-MimicMotionWrapper
Language:Python275 6 7122
jmisilo/clip-gpt-captioning
CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.
Language:Python109 3 4532
aleksandrm8/ONVIF-Device-Manager
mirror of http://sourceforge.net/projects/onvifdm/
Language:C100 13 063
alexw914/RK_VideoPipe
Language:C++66 1 810
dchatel/comfyui_facetools
These custom nodes provide a rotation aware face extraction, paste back, and various face related masking options.
Language:Python66 2 228
Panda-NEUer/Streamlit-Documentation-Chinese
Language:Python2 0 01