winterdrive's Stars
facebookresearch/audiocraft
Audiocraft is a library for audio processing and generation with deep learning. It features the state-of-the-art EnCodec audio compressor / tokenizer, along with MusicGen, a simple and controllable music generation LM with textual and melodic conditioning.
TWValues/TW-Values
台灣價值
espnet/espnet
End-to-End Speech Processing Toolkit
speechbrain/speechbrain
A PyTorch-based Speech Toolkit
pyannote/pyannote-audio
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
m-bain/whisperX
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
2noise/ChatTTS
A generative speech model for daily dialogue.
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
junegunn/redis-stat
(UNMAINTAINED) A real-time Redis monitoring tool
sail-sg/EditAnything
Edit anything in images powered by segment-anything, ControlNet, StableDiffusion, etc. (ACM MM)
AIGCDesignGroup/ReplaceAnything
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
yisol/IDM-VTON
[ECCV2024] IDM-VTON : Improving Diffusion Models for Authentic Virtual Try-on in the Wild
Hanqer/deep-hough-transform
Jittor and Pytorch code for paper "Deep Hough Transform for Semantic Line Detection" (ECCV 2020, PAMI 2021)
wandb/openui
OpenUI let's you describe UI using your imagination, then see it rendered live.
mistralai/mistral-finetune
wal99d/recruitment_agents
phidatahq/phidata
Build AI Assistants with memory, knowledge and tools.
HITsz-TMG/UMOE-Scaling-Unified-Multimodal-LLMs
The codes about "Uni-MoE: Scaling Unified Multimodal Models with Mixture of Experts"
uezo/gpt3-contextual
Contextual chat with GPT-3 model of OpenAI API
line/line-bot-sdk-python
LINE Messaging API SDK for Python
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
facebookresearch/faiss
A library for efficient similarity search and clustering of dense vectors.
ivantaiwan/LINEBOT
mendableai/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
crewAIInc/crewAI-examples
Breakthrough/PySceneDetect
:movie_camera: Python and OpenCV-based scene cut/transition detection program & library.
likejazz/llama3.np
llama3.np is a pure NumPy implementation for Llama 3 model.
mlc-ai/web-llm
High-performance In-browser LLM Inference Engine
WebAssembly/wabt
The WebAssembly Binary Toolkit