Pinned Repositories
a16z-xHuman-DIY
A Javascript AI getting started stack for weekend projects, including image/text models, vector stores, auth, and deployment configs
AGI-Samantha
AGI has been achieved externally
ai-journalist
AI-RealChat-DIY
🎙️🤖Create, Customize and Talk to your AI Character/Companion in Realtime(All in One Codebase!). Have a natural seamless conversation with AI everywhere(mobile, web and terminal) using LLM OpenAI GPT3.5/4, Anthropic Claude2, Chroma Vector DB, Whisper Speech2Text, ElevenLabs Text2Speech🎙️🤖
ai-video-search-engine
Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
audio-preprocess
Preprocess Audio for training
NotionWebsite
使用 NextJS + Notion API 实现的静态博客
Pic-2-3D
Official PyTorch Implementation of Magic123: One Image to High-Quality 3D Object Generation Using Both 2D and 3D Diffusion Priors
QAGITECH's Repositories
QAGITECH/AGI-Samantha
AGI has been achieved externally
QAGITECH/ai-video-search-engine
QAGITECH/AnyDoor
Official implementations for paper: Anydoor: zero-shot object-level image customization
QAGITECH/ChatLaw
中文法律大模型
QAGITECH/clone-voice
A sound cloning tool with a web interface, using your voice or any sound to record audio / 一个带web界面的声音克隆工具,使用你的音色或任意声音来录制音频
QAGITECH/EMO
QAGITECH/ER-NeRF
[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis
QAGITECH/GeneFacePlusPlus
GeneFace++: Generalized and Stable Real-Time 3D Talking Face Generation; Official Code
QAGITECH/i-Code
QAGITECH/history_rag
QAGITECH/LapisCV
📃 开箱即用的 Obsidian / Typora 简历
QAGITECH/LLM-finetune-Factory
Easy-to-use LLM fine-tuning framework (LLaMA, BLOOM, Mistral, Baichuan, Qwen, ChatGLM)
QAGITECH/llm-viz
3D Visualization of an GPT-style LLM
QAGITECH/MaxKB
💬 基于 LLM 大语言模型的知识库问答系统。开箱即用,支持快速嵌入到第三方业务系统,1Panel 官方出品。
QAGITECH/ml-ferret
QAGITECH/MNBVC
MNBVC(Massive Never-ending BT Vast Chinese corpus)超大规模中文语料集。对标chatGPT训练的40T数据。MNBVC数据集不但包括主流文化,也包括各个小众文化甚至火星文的数据。MNBVC数据集包括新闻、作文、小说、书籍、杂志、论文、台词、帖子、wiki、古诗、歌词、商品介绍、笑话、糗事、聊天记录等一切形式的纯文本中文数据。
QAGITECH/MoneyPrinter
Automate Creation of YouTube Shorts using MoviePy.
QAGITECH/moondream
tiny vision language model
QAGITECH/musicRecVDB
🛰️ Voyager is an approximate nearest-neighbor search library for Python and Java with a focus on ease of use, simplicity, and deployability.
QAGITECH/OOTDiffusion
Official implementation of OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
QAGITECH/Open-Sora
Building your own video generation model like OpenAI's Sora
QAGITECH/ott
Api tool for local offline text translation supporting multiple languages/支持多语言的本地离线文字翻译api
QAGITECH/phidata
Build AI Assistants with memory, knowledge and tools.
QAGITECH/ProPainter
[ICCV 2023] ProPainter: Improving Propagation and Transformer for Video Inpainting
QAGITECH/reor
AI note-taking app that runs models locally.
QAGITECH/sam.cpp
QAGITECH/seamless_communication
Foundational Models for State-of-the-Art Speech and Text Translation
QAGITECH/search2ai
大模型联网服务
QAGITECH/versatile_audio_super_resolution
Versatile audio super resolution (any -> 48kHz) with AudioSR.
QAGITECH/vocal-separate
an extremely simple tool for separating vocals and background music, completely localized for web operation, using 2stems/4stems/5stems models 这是一个极简的人声和背景音乐分离工具,本地化网页操作,无需连接外网