anchoret2009's Stars
microsoft/autogen
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
NJU-PCALab/STAR
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
innnky/so-vits-svc
基于vits与softvc的歌声音色转换模型
78/xiaozhi-esp32
Build your own AI friend
BasedHardware/omi
AI wearables
BasedHardware/OpenGlass
Turn any glasses into AI-powered smart glasses
richards199999/Thinking-Claude
Let your Claude able to think
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
jianzongwu/DiffSensei
Implementation of "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"
freqtrade/freqtrade
Free, open source crypto trading bot
bytedance/LatentSync
Taming Stable Diffusion for Lip Sync!
UCSC-VLAA/story-adapter
A Training-free Iterative Framework for Long Story Visualization
VIPL-Audio-Visual-Speech-Understanding/LipNet-PyTorch
The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxiv.org/abs/1611.01599)
afourast/deep_lip_reading
Code and models for evaluating a state-of-the-art lip reading network
astorfi/lip-reading-deeplearning
:unlock: Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
AaronComo/LipFD
[NeurIPS 2024] This is the official repo of the paper "Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-syncing DeepFakes".
myhhub/stock
stock股票.获取股票数据,计算股票指标,筹码分布,识别股票形态,综合选股,选股策略,股票验证回测,股票自动交易,支持PC及移动设备。
Francis-Rings/StableAnimator
We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference image and a sequence of poses.
mayuelala/FollowYourPose
[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"
pixeli99/SVD_Xtend
Stable Video Diffusion Training Code and Extensions.
Tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
youngwoo-yoon/youtube-gesture-dataset
This repository contains scripts to build Youtube Gesture Dataset.
alvinliu0/HA2G
[CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"
ai4r/Gesture-Generation-from-Trimodal-Context
Speech Gesture Generation from the Trimodal Context of Text, Audio, and Speaker Identity (SIGGRAPH Asia 2020)
Advocate99/DiffGesture
[CVPR'2023] Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation
antgroup/echomimic_v2
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
apoorvkh/cvpr-latex-template
Extended LaTeX template for CVPR/ICCV papers
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具,将PDF转换成Markdown和JSON格式。
Miffyli/im2latex-dataset
Python tools for creating suitable dataset for OpenAI's im2latex task: https://openai.com/requests-for-research/#im2latex
TH-MLab/DanceFusion