heawon-yoon's Stars
lalanikarim/webrtc-ai-voice-chat
A WebRTC server that allows you to interact with an LLM using your speech and responds back with generated audio.
fastapi/full-stack-fastapi-template
Full stack, modern web application template. Using FastAPI, React, SQLModel, PostgreSQL, Docker, GitHub Actions, automatic HTTPS and more.
PaddlePaddle/PaddleSpeech
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
uncle-novel/uncle-novel
📖 Uncle小说,PC版,一个全网小说下载器及阅读器,目录解析与书源结合,支持有声小说与文本小说,可下载mobi、epub、txt格式文本小说。
mkkellogg/GaussianSplats3D
Three.js-based implementation of 3D Gaussian splatting
zhayujie/chatgpt-on-wechat
基于大模型搭建的聊天机器人,同时支持 微信公众号、企业微信应用、飞书、钉钉 等接入,可选择GPT3.5/GPT-4o/GPT-o1/ Claude/文心一言/讯飞星火/通义千问/ Gemini/GLM-4/Claude/Kimi/LinkAI,能处理文本、语音和图片,访问操作系统和互联网,支持基于自有知识库进行定制企业智能客服。
eatmoreapple/openwechat
golang微信SDK
Kedreamix/Linly-Talker
Digital Avatar Conversational System - Linly-Talker. 😄✨ Linly-Talker is an intelligent AI system that combines large language models (LLMs) with visual models to create a novel human-AI interaction method. 🤝🤖 It integrates various technologies like Whisper, Linly, Microsoft Speech Services, and SadTalker talking head generation system. 🌟🔬
zjp-shadow/CharacterGen
[SIGGRAPH'24] CharacterGen: Efficient 3D Character Generation from Single Images with Multi-View Pose Canonicalization
fudan-generative-vision/hallo
Hallo: Hierarchical Audio-Driven Visual Synthesis for Portrait Image Animation
kwea123/gaussian_splatting_notes
A detailed formulae explanation on gaussian splatting
donahowe/AutoStudio
AutoStudio: Crafting Consistent Subjects in Multi-turn Interactive Image Generation
bmild/nerf
Code release for NeRF (Neural Radiance Fields)
kwea123/nerf_pl
NeRF (Neural Radiance Fields) and NeRF in the Wild using pytorch-lightning
apple/ml-hugs
Official repository of HUGS: Human Gaussian Splats (CVPR 2024)
letta-ai/letta
Letta (formerly MemGPT) is a framework for creating LLM services with memory.
lansinuote/Simple_Generative_in_PyTorch
IDEA-Research/DWPose
"Effective Whole-body Pose Estimation with Two-stages Distillation" (ICCV 2023, CV4Metaverse Workshop)
fudan-generative-vision/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
mks0601/Hand4Whole_RELEASE
Official PyTorch implementation of "Accurate 3D Hand Pose Estimation for Whole-Body 3D Human Mesh Estimation", CVPRW 2022 (Oral.)
ShenhanQian/GaussianAvatars
[CVPR 2024 Highlight] The official repo for "GaussianAvatars: Photorealistic Head Avatars with Rigged 3D Gaussians"
Moguri/blend2bam
A CLI tool to convert Blender blend files to Panda3D BAM files
KosukeFukazawa/CharacterAnimationTools
Character Animation Tools for Python.
graphdeco-inria/gaussian-splatting
Original reference implementation of "3D Gaussian Splatting for Real-Time Radiance Field Rendering"
Visualize-ML/Book7_Visualizations-for-Machine-Learning
Book_7_《机器学习》 | 鸢尾花书:从加减乘除到机器学习;欢迎批评指正
Visualize-ML/Book5_Essentials-of-Probability-and-Statistics
Book_5_《统计至简》 | 鸢尾花书:从加减乘除到机器学习;上架!
Visualize-ML/Book4_Power-of-Matrix
Book_4_《矩阵力量》 | 鸢尾花书:从加减乘除到机器学习;上架!
xiexh20/VisTracker
Official implementation for the CVPR'23 paper: Visibility Aware Human-Object Interaction Tracking from Single RGB Camera
mkocabas/VIBE
Official implementation of CVPR2020 paper "VIBE: Video Inference for Human Body Pose and Shape Estimation"
zju3dv/EasyMocap
Make human motion capture easier.