arnode

arnode's Stars

all-in-aigc/melodisco
AI Music Player
Language:TypeScript495106
microsoft/muzic
Muzic: Music Understanding and Generation with Artificial Intelligence
Language:Python4.7k467
stakira/OpenUtau
Open singing synthesis platform / Open source UTAU successor
Language:C#2.4k323
Anjok07/ultimatevocalremovergui
GUI for a Vocal Remover that uses Deep Neural Networks.
Language:Python19.9k1.5k
feizc/FluxMusic
Text-to-Music Generation with Rectified Flow Transformers
Language:Python1.7k134
run-llama/llama_index
LlamaIndex is the leading framework for building LLM-powered agents over your data.
Language:Python40.3k5.7k
VikParuchuri/marker
Convert PDF to markdown + JSON quickly with high accuracy
Language:Python23.2k1.4k
QwenLM/Qwen-Agent
Agent framework and applications built upon Qwen>=2.0, featuring Function Calling, Code Interpreter, RAG, and Chrome extension.
Language:Python6.3k567
2noise/ChatTTS
A generative speech model for daily dialogue.
Language:Python35.3k3.8k
MetaGLM/glm-cookbook
Examples and guides for using the GLM APIs
Language:Jupyter Notebook855108
huggingface/lerobot
🤗 LeRobot: Making AI for Robotics more accessible with end-to-end learning
Language:Python10.8k1.2k
explosion/sense2vec
🦆 Contextually-keyed word vectors
Language:Python1.6k239
explosion/spaCy
💫 Industrial-strength Natural Language Processing (NLP) in Python
Language:Python31.2k4.5k
1Panel-dev/MaxKB
💬 Ready-to-use & flexible RAG Chatbot, supporting mainstream large language models (LLMs) such as DeepSeek-R1, Llama 3.3, Qwen2, OpenAI and more.
Language:Python14.9k2k
fudan-generative-vision/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
Language:Python4.2k483
instantX-research/InstantID
InstantID: Zero-shot Identity-Preserving Generation in Seconds 🔥
Language:Python11.5k832
MoonInTheRiver/DiffSinger
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
Language:Python4.4k729
Fictionarry/ER-NeRF
[ICCV'23] Efficient Region-Aware Neural Radiance Fields for High-Fidelity Talking Portrait Synthesis
Language:Python1.2k140
RVC-Boss/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
Language:Python43k4.8k
coqui-ai/TTS
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
Language:Python38.8k4.9k
Kedreamix/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
Language:Python2.5k406
lipku/LiveTalking
Real time interactive streaming digital human
Language:Python5k743
OpenTalker/SadTalker
[CVPR 2023] SadTalker：Learning Realistic 3D Motion Coefficients for Stylized Audio-Driven Single Image Talking Face Animation
Language:Python12.5k2.3k
Human3DAIGC/Make-A-Character
Official repo for Make-A-Character: High Quality Text-to-3D Character Generation within Minutes
55317
xai-org/grok-1
Grok open release
Language:Python50.2k8.4k
yuqinie98/PatchTST
An offical implementation of PatchTST: "A Time Series is Worth 64 Words: Long-term Forecasting with Transformers." (ICLR 2023) https://arxiv.org/abs/2211.14730
Language:Python1.8k308
opendilab/LightZero
[NeurIPS 2023 Spotlight] LightZero: A Unified Benchmark for Monte Carlo Tree Search in General Sequential Decision Scenarios (awesome MCTS)
Language:Python1.3k146
zhayujie/chatgpt-on-wechat
基于大模型搭建的聊天机器人，同时支持微信公众号、企业微信应用、飞书、钉钉等接入，可选择GPT3.5/GPT-4o/GPT-o1/ DeepSeek/Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI，能处理文本、语音和图片，访问操作系统和互联网，支持基于自有知识库进行定制企业智能客服。
Language:Python35.9k9k
DaiShiResearch/TransNeXt
[CVPR 2024] Code release for TransNeXt model
Language:Python49520
OrionStarAI/Orion
Orion-14B is a family of models includes a 14B foundation LLM, and a series of models: a chat model, a long context model, a quantized model, a RAG fine-tuned model, and an Agent fine-tuned model. Orion-14B 系列模型包括一个具有140亿参数的多语言基座大模型以及一系列相关的衍生模型，包括对话模型，长文本模型，量化模型，RAG微调模型，Agent微调模型等。
Language:Python79157