KangweiiLiu's Stars
memoavatar/memo
Memory-Guided Diffusion for Expressive Talking Video Generation
UCSC-VLAA/VL-Thinking
ShenhanQian/VHAP
A complete head tracking pipeline from videos to NeRF/3DGS-ready datasets.
RUC-AIMind/TikTalk
atong01/conditional-flow-matching
TorchCFM: a Conditional Flow Matching library
antgroup/ditto-talkinghead
Ditto: Motion-Space Diffusion for Controllable Realtime Talking Head Synthesis
SimonGiebenhain/MonoNPHM
[CVPR 2024 Highlight]
RuoyuChen10/VPS
[CVPR 2025] Interpreting Object-level Foundation Models via Visual Precision Search
jixiaozhong/Sonic
Official implementation of "Sonic: Shifting Focus to Global Audio Perception in Portrait Animation"
JeremyCJM/DiffSHEG
[CVPR'24] DiffSHEG: A Diffusion-Based Approach for Real-Time Speech-driven Holistic 3D Expression and Gesture Generation
deepseek-ai/DeepSeek-V3
youngyangyang04/leetcode-master
《代码随想录》LeetCode 刷题攻略:200道经典题目刷题顺序,共60w字的详细图解,视频难点剖析,50余张思维导图,支持C++,Java,Python,Go,JavaScript等多语言版本,从此算法学习不再迷茫!🔥🔥 来看看,你会发现相见恨晚!🚀
0nutation/SpeechGPT
SpeechGPT Series: Speech Large Language Models
gpt-omni/mini-omni
open-source multimodal large language model that can hear, talk while thinking. Featuring real-time end-to-end speech input and streaming audio output conversational capabilities.
whwjdqls/DEEPTalk
Official code release of "DEEPTalk: Dynamic Emotion Embedding for Probabilistic Speech-Driven 3D Face Animation" [AAAI2025]
FoundationVision/VAR
[NeurIPS 2024 Best Paper][GPT beats diffusion🔥] [scaling laws in visual generation📈] Official impl. of "Visual Autoregressive Modeling: Scalable Image Generation via Next-Scale Prediction". An *ultra-simple, user-friendly yet state-of-the-art* codebase for autoregressive image generation!
hustvl/ControlAR
[ICLR 2025] ControlAR: Controllable Image Generation with Autoregressive Models
MiracleDance/CAR
CAR: Controllable AutoRegressive Modeling for Visual Generation
neeek2303/EMOPortraits
Official implementation of EMOPortraits: Emotion-enhanced Multimodal One-shot Head Avatars
mli/autocut
用文本编辑器剪视频
WEIFENG2333/VideoCaptioner
🎬 卡卡字幕助手 | VideoCaptioner - 基于 LLM 的智能字幕助手 - 视频字幕生成、断句、校正、字幕翻译全流程处理!- A powered tool for easy and efficient video subtitling.
jdh-algo/JoyVASA
Diffusion-based Portrait and Animal Animation
Juzhan/TAP-Net
TAP-Net: Transport-and-Pack using Reinforcement Learning
dasvision0212/3D-Bin-Packing-Problem-with-BRKGA
An implementation ofr Biased Random Key Genetic Algorithmn for 3D Bin Packing Problem.
Xiong5Heng/GOPT
[IEEE RA-L 2024] GOPT: Generalizable Online 3D Bin Packing via Transformer-based Deep Reinforcement Learning
DiffPoseTalk/DiffPoseTalk
DiffPoseTalk: Speech-Driven Stylistic 3D Facial Animation and Head Pose Generation via Diffusion Models
hzwer/WritingAIPaper
Writing AI Conference Papers: A Handbook for Beginners
alexfrom0815/Online-3D-BPP-PCT
Code implementation of "Learning Efficient Online 3D Bin Packing on Packing Configuration Trees". We propose to enhance the practical applicability of online 3D Bin Packing Problem (BPP) via learning on a hierarchical packing configuration tree which makes the deep reinforcement learning (DRL) model easy to deal with practical constraints and well-performing even with continuous solution space.
KangweiiLiu/Talking_Face_Dataset_Preprocess_script
Optimizing Video Face Detection for Talking Face Detection: Enhancing Performance and Output Quality
LetheSec/HuggingFace-Download-Accelerator
利用HuggingFace的官方下载工具从镜像网站进行高速下载。