Lijinqi's Stars
snakers4/silero-vad
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
QwenLM/Qwen-Agent
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
endless-sky/endless-sky
Space exploration, trading, and combat game.
unclecode/crawl4ai
🚀🤖 Crawl4AI: Crawl Smarter, Faster, Freely. For AI.
open-mmlab/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
ManimCommunity/manim
A community-maintained Python framework for creating mathematical animations.
danialgoodwin/android-app-contactless-vital-signs
Using the Android camera, the app detects faces and starts to calculate heart rate, blood pressure, and body temperature.
mediar-ai/screenpipe
one API to get all user desktop data (local, cross platform, 24/7, screen, voice, keyboard, mouse, camera recording). sandboxed js plugin system. keyboard and mouse control
rhasspy/piper
A fast, local neural text to speech system
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
lobehub/lobe-chat
🤯 Lobe Chat - an open-source, modern-design AI chat framework. Supports Multi AI Providers( OpenAI / Claude 3 / Gemini / Ollama / Qwen / DeepSeek), Knowledge Base (file upload / knowledge management / RAG ), Multi-Modals (Vision/TTS/Plugins/Artifacts). One-click FREE deployment of your private ChatGPT/ Claude application.
jaywalnut310/vits
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
Avik-Jain/100-Days-Of-ML-Code
100 Days of ML Coding
deepfakes/faceswap
Deepfakes Software For All
ageitgey/face_recognition
The world's simplest facial recognition api for Python and the command line
s0md3v/roop
one-click face swap
rustdesk/rustdesk
An open-source remote desktop application designed for self-hosting, as an alternative to TeamViewer.
Lightning-AI/LitServe
Lightning-fast serving engine for any AI model of any size. Flexible. Easy. Enterprise-scale.
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
FunAudioLLM/CosyVoice
Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.
continuedev/continue
⏩ Continue is the leading open-source AI code assistant. You can connect any models and any context to build custom autocomplete and chat experiences inside VS Code and JetBrains
antgroup/echomimic
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
miss-mumu/developer2gwy
公务员从入门到上岸,最佳程序员公考实践教程
ollama/ollama
Get up and running with Llama 3.3, Mistral, Gemma 2, and other large language models.
Aider-AI/aider
aider is AI pair programming in your terminal
Jack-Cherish/PythonPark
Python 开源项目之「自学编程之路」,保姆级教程:AI实验室、宝藏视频、数据结构、学习指南、机器学习实战、深度学习实战、网络爬虫、大厂面经、程序人生、资源分享。
Kedreamix/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
TMElyralab/MuseTalk
MuseTalk: Real-Time High Quality Lip Synchorization with Latent Space Inpainting