stwrd's Stars
gohugoio/hugo
The world’s fastest framework for building websites.
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
2noise/ChatTTS
A generative speech model for daily dialogue.
facefusion/facefusion
Industry leading face manipulation platform
bluenviron/mediamtx
Ready-to-use SRT / WebRTC / RTSP / RTMP / LL-HLS media server and media proxy that allows to read, publish, proxy, record and playback video and audio streams.
NielsRogge/Transformers-Tutorials
This repository contains demos I made with the Transformers library by HuggingFace.
datawhalechina/self-llm
《开源大模型食用指南》基于Linux环境快速部署开源大模型,更适合**宝宝的部署教程
01-ai/Yi
A series of large language models trained from scratch by developers @01-ai
exadel-inc/CompreFace
Leading free and open-source face recognition system
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
rom1504/img2dataset
Easily turn large sets of image urls to an image dataset. Can download, resize and package 100M urls in 20h on one machine.
JDAI-CV/fast-reid
SOTA Re-identification Methods and Toolbox
google-research/big_vision
Official codebase used to develop Vision Transformer, SigLIP, MLP-Mixer, LiT and more.
6drf21e/ChatTTS_colab
🚀 一键部署(含离线整合包)!基于 ChatTTS ,支持流式输出、音色抽卡、长音频生成和分角色朗读。简单易用,无需复杂安装。
tinyvision/SOLIDER
A Semantic Controllable Self-Supervised Learning Framework to learn general human representations from massive unlabeled human images, which can benefit downstream human-centric tasks to the maximum extent
mit-han-lab/efficientvit
EfficientViT is a new family of vision models for efficient high-resolution vision.
Tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
sherlockchou86/VideoPipe
A cross-platform video structuring (video analysis) framework. If you find it helpful, please give it a star: ) 跨平台的视频结构化(视频分析)框架,觉得有帮助的请给个星星 : )
Lightning-AI/deep-learning-project-template
Pytorch Lightning code guideline for conferences
chflame163/ComfyUI_LayerStyle
A set of nodes for ComfyUI that can composite layer and mask to achieve Photoshop like functionality.
yolain/ComfyUI-Yolain-Workflows
Some awesome comfyui workflows in here, and they are built using the comfyui-easy-use node package.
shivammehta25/Matcha-TTS
[ICASSP 2024] 🍵 Matcha-TTS: A fast TTS architecture with conditional flow matching
CCmahua/ChatTTS-Enhanced
kijai/ComfyUI-MimicMotionWrapper
jmisilo/clip-gpt-captioning
CLIPxGPT Captioner is Image Captioning Model based on OpenAI's CLIP and GPT-2.
aleksandrm8/ONVIF-Device-Manager
mirror of http://sourceforge.net/projects/onvifdm/
alexw914/RK_VideoPipe
dchatel/comfyui_facetools
These custom nodes provide a rotation aware face extraction, paste back, and various face related masking options.
Panda-NEUer/Streamlit-Documentation-Chinese