hejiaming007's Stars
CMU-Perceptual-Computing-Lab/openpose
OpenPose: Real-time multi-person keypoint detection library for body, face, hands, and foot estimation
yaoxieyoulei/mytv-android
使用Android原生开发的电视直播软件
nomi-san/parsec-vdd
✨ Perfect 4K@240Hz Virtual Display
MetaCubeX/ClashMetaForAndroid
A rule-based tunnel for Android.
KwaiVGI/LivePortrait
Bring portraits to life!
modelscope/FunASR
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
run-llama/llama_parse
Parse files for optimal RAG
run-llama/llama_index
LlamaIndex is a data framework for your LLM applications
ltdrdata/ComfyUI-Manager
ComfyUI-Manager is an extension designed to enhance the usability of ComfyUI. It offers management functions to install, remove, disable, and enable various custom nodes of ComfyUI. Furthermore, this extension provides a hub feature and convenience functions to access a wide range of information within ComfyUI.
AIGODLIKE/AIGODLIKE-ComfyUI-Translation
A plugin for multilingual translation of ComfyUI,This plugin implements translation of resident menu bar/search bar/right-click context menu/node, etc
comfyanonymous/ComfyUI
The most powerful and modular diffusion model GUI, api and backend with a graph/nodes interface.
niedev/RTranslator
Open source real-time translation app for Android that runs locally
GuijiAI/duix.ai
iptv-org/awesome-iptv
A curated list of resources related to IPTV
opencv/opencv
Open Source Computer Vision Library
roboflow/supervision
We write your reusable computer vision tools. 💜
VikParuchuri/marker
Convert PDF to markdown quickly with high accuracy
huggingface/peft
🤗 PEFT: State-of-the-art Parameter-Efficient Fine-Tuning.
SYSTRAN/faster-whisper
Faster Whisper transcription with CTranslate2
openai/whisper
Robust Speech Recognition via Large-Scale Weak Supervision
2noise/ChatTTS
A generative speech model for daily dialogue.
jina-ai/reader
Convert any URL to an LLM-friendly input with a simple prefix https://r.jina.ai/
open-webui/open-webui
User-friendly WebUI for AI (Formerly Ollama WebUI)
microsoft/autogen
A programming framework for agentic AI 🤖
facebookresearch/segment-anything
The repository provides code for running inference with the SegmentAnything Model (SAM), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
davabase/whisper_real_time
Real time transcription with OpenAI Whisper.
juanmc2005/diart
A python package to build AI-powered real-time audio applications
s0md3v/roop
one-click face swap
openai/openai-cookbook
Examples and guides for using the OpenAI API
justinjohn0306/so-vits-svc-4.0
SoftVC VITS Singing Voice Conversion