Pinned Repositories
.NET-Deobfuscator
Lists of .NET Deobfuscator and Unpacker (Open Source)
3D_ChineseInkPaintingStyleShader
An application of 3D Chinese Ink Painting Style shader using Unity
3dio-js
JavaScript toolkit for interior apps
Agently-Daily-News-Collector
An open-source LLM based automatically daily news collecting workflow showcase powered by Agently AI application development framework.
agents
Build real-time multimodal AI applications 🤖🎙️📹
aiavatarkit
🥰 Building AI-based conversational avatars lightning fast ⚡️💬
Algorithm-Of-Thoughts
My implementation of "Algorithm of Thoughts: Enhancing Exploration of Ideas in Large Language Models"
alignment-handbook
Robust recipes for to align language models with human and AI preferences
ALVR
Stream VR games from your PC to your headset via Wi-Fi
yuan505's Repositories
yuan505/Awesome-Speech-Language-Model
Paper, Code and Resources for Speech Language Model and End2End Speech Dialogue System.
yuan505/clickclickclick
A framework to enable autonomous android and computer use using any LLM (local or remote)
yuan505/ComfyUI-FunAudioLLM
Comfyui custom node for FunAudioLLM include CosyVoice and SenseVoice
yuan505/ComfyUI-IF_MemoAvatar
Memory-Guided Diffusion for Expressive Talking Video Generation
yuan505/DiariZen
A toolkit for speaker diarization.
yuan505/docling
Get your documents ready for gen AI
yuan505/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
yuan505/firecrawl
🔥 Turn entire websites into LLM-ready markdown or structured data. Scrape, crawl and extract with a single API.
yuan505/hertz-dev
first base model for full-duplex conversational audio
yuan505/HeyGem.ai
yuan505/kestra
:zap: Workflow Automation Platform. Orchestrate & Schedule code in any language, run anywhere, 500+ plugins. Alternative to Zapier, Rundeck, Camunda, Airflow...
yuan505/llmware
Unified framework for building enterprise RAG pipelines with small, specialized models
yuan505/lobe-vidol
🧸 Lobe Vidol - Making Virtual Idols Accessible for EveryOne
yuan505/MimicTalk
MimicTalk: Mimicking a personalized and expressive 3D talking face in minutes; NeurIPS 2024; Official code
yuan505/MinerU-Lang
Specify Lang | A one-stop, open-source, high-quality data extraction tool, supports PDF/webpage/e-book extraction.一站式开源高质量数据提取工具,支持PDF/网页/多格式电子书提取。
yuan505/ml-depth-pro
Depth Pro: Sharp Monocular Metric Depth in Less Than a Second.
yuan505/OuteTTS
Interface for OuteTTS models.
yuan505/prompts
A prompting library
yuan505/refly
🎨 Refly is an open-source AI-native creation engine. Its intuitive free-form canvas interface combines multi-threaded dialogues, AI knowledge base integration, chrome extension clip & save, contextual memory, intelligent search, WYSIWYG AI editor and more, empowering you to effortlessly transform ideas into production-ready content.
yuan505/reverb
Open source inference code for Rev's model
yuan505/snap-camera-server
An alternative, self-hosted solution that allows you to continue using Snap Camera with all Snapchat filters after its shutdown on January 25, 2023.
yuan505/TRELLIS
Official repo for paper "Structured 3D Latents for Scalable and Versatile 3D Generation".
yuan505/V0-system-prompt
yuan505/ViewComfy
ViewComfy is a open source tool to help you create beautiful web apps from ComfyUI
yuan505/voice-pro
The best gradio web-ui for ai transcription, translation and TTS. Automatic subtitle creation using faster whisper. Easy one click installation. Fully portable.
yuan505/whisper-web
ML-powered speech recognition directly in your browser
yuan505/whodb
A lightweight next-gen data explorer - Postgres, MySQL, SQLite, MongoDB, Redis, MariaDB & Elastic Search with Chat interface
yuan505/wordpecker-app
A personalized language-learning tool that combines Duolingo-style lessons with your own curated vocabulary lists. Seamlessly add words from books, articles, or videos, and revisit them through interactive quizzes and LLM-generated lessons.
yuan505/yt-dlp
A feature-rich command-line audio/video downloader
yuan505/zeroth-bot
3D-printed open-source humanoid robot platform for sim-to-real and RL