xulin790's Stars
Cinnamon/kotaemon
An open-source RAG-based tool for chatting with your documents.
t41372/Open-LLM-VTuber
Talk to any LLM with hands-free voice interaction, voice interruption, Live2D taking face, and long-term memory running locally across platforms
AkariAsai/self-rag
This includes the original implementation of SELF-RAG: Learning to Retrieve, Generate and Critique through self-reflection by Akari Asai, Zeqiu Wu, Yizhong Wang, Avirup Sil, and Hannaneh Hajishirzi.
wan-h/awesome-digital-human-live2d
Awesome Digital Human
cuda-mode/awesomeMLSys
An ML Systems Onboarding list
RQLuo/MixTeX-Latex-OCR
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.
tomasonjo/blogs
Jupyter notebooks that support my graph data science blog posts at https://bratanic-tomaz.medium.com/
run-llama/multi-agent-concierge
AUTOMATIC1111/stable-diffusion-webui
Stable Diffusion web UI
OpenTalker/SadTalker
[CVPR 2023] SadTalker:Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
NirDiamant/RAG_Techniques
This repository showcases various advanced techniques for Retrieval-Augmented Generation (RAG) systems. RAG systems combine information retrieval with generative models to provide accurate and contextually rich responses.
ZiqiaoPeng/SyncTalk
[CVPR 2024] This is the official source for our paper "SyncTalk: The Devil is in the Synchronization for Talking Head Synthesis"
FunAudioLLM/SenseVoice
Multilingual Voice Understanding Model
jianchang512/clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
zhulu111/ComfyUI_Bxb
SD变现宝:一键把comfyui工作流转换成小程序。
MahmoudAshraf97/whisper-diarization
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
AIGC-Audio/AudioGPT
AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head
seadfeng/vercel-proxy-sites
seadfeng/cloudflare-proxy-sites
mikekelly/AgentK
An autoagentic AGI that is self-evolving and modular.
QwenLM/Qwen2-Audio
The official repo of Qwen2-Audio chat & pretrained large audio language model proposed by Alibaba Cloud.
Stirling-Tools/Stirling-PDF
#1 Locally hosted web application that allows you to perform various operations on PDF files
teableio/teable
✨ The Next Gen Airtable Alternative: No-Code Postgres
jeecgboot/JeecgBoot
🔥「企业级低代码平台」前后端分离架构SpringBoot 2.x/3.x,SpringCloud,Ant Design&Vue3,Mybatis,Shiro,JWT。强大的代码生成器让前后端代码一键生成,无需写任何代码! 引领新的开发模式OnlineCoding->代码生成->手工MERGE,帮助Java项目解决70%重复工作,让开发更关注业务,既能快速提高效率,帮助公司节省成本,同时又不失灵活性。
jam3scampbell/ProctorAI
The AI to keep you focused 😈
KwaiVGI/LivePortrait
Bring portraits to life!
win4r/GraphRAG4OpenWebUI
GraphRAG4OpenWebUI integrates Microsoft's GraphRAG technology into Open WebUI, providing a versatile information retrieval API. It combines local, global, and web searches for advanced Q&A systems and search engines. This tool simplifies graph-based retrieval integration in open web environments.
lhl/voicechat2
Local SRT/LLM/TTS Voicechat
Chadwuo/li-ji-weapp
「礼记」致力于记录和管理人情往来中的随礼、礼金、份子钱、送礼、收礼,专业又懂你的人情记账簿,全家人共享账本,多维度查询统计亲友间往来记录。每一份人情都值得礼记。
harry0703/AudioNotes
快速提取音视频内容,整理成一份结构化的markdown笔记