anchoret2009

anchoret2009's Stars

microsoft/autogen
A programming framework for agentic AI 🤖 PyPi: autogen-agentchat Discord: https://aka.ms/autogen-discord Office Hour: https://aka.ms/autogen-officehour
Language:Python37.1k5.4k
NJU-PCALab/STAR
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution
Language:Python33512
innnky/so-vits-svc
基于vits与softvc的歌声音色转换模型
Language:Python3.6k7
78/xiaozhi-esp32
Build your own AI friend
Language:C2.4k381
BasedHardware/omi
AI wearables
Language:C3.9k514
BasedHardware/OpenGlass
Turn any glasses into AI-powered smart glasses
Language:C3.4k430
richards199999/Thinking-Claude
Let your Claude able to think
Language:TypeScript13.2k1.6k
stanford-oval/storm
An LLM-powered knowledge curation system that researches a topic and generates a full-length report with citations.
Language:Python19.5k1.7k
jianzongwu/DiffSensei
Implementation of "DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation"
Language:Python53048
freqtrade/freqtrade
Free, open source crypto trading bot
Language:Python34.3k6.8k
bytedance/LatentSync
Taming Stable Diffusion for Lip Sync!
Language:Python1.5k153
UCSC-VLAA/story-adapter
A Training-free Iterative Framework for Long Story Visualization
Language:Python57775
VIPL-Audio-Visual-Speech-Understanding/LipNet-PyTorch
The state-of-art PyTorch implementation of the method described in the paper "LipNet: End-to-End Sentence-level Lipreading" (https://arxiv.org/abs/1611.01599)
Language:Python21252
afourast/deep_lip_reading
Code and models for evaluating a state-of-the-art lip reading network
Language:Python19353
astorfi/lip-reading-deeplearning
:unlock: Lip Reading - Cross Audio-Visual Recognition using 3D Architectures
Language:Python1.8k325
AaronComo/LipFD
[NeurIPS 2024] This is the official repo of the paper "Lips Are Lying: Spotting the Temporal Inconsistency between Audio and Visual in Lip-syncing DeepFakes".
Language:Python855
myhhub/stock
stock股票.获取股票数据,计算股票指标,筹码分布,识别股票形态,综合选股,选股策略,股票验证回测,股票自动交易,支持PC及移动设备。
Language:Python7.1k1.3k
Francis-Rings/StableAnimator
We present StableAnimator, the first end-to-end ID-preserving video diffusion framework, which synthesizes high-quality videos without any post-processing, conditioned on a reference image and a sequence of poses.
Language:Python1.1k52
mayuelala/FollowYourPose
[AAAI 2024] Follow-Your-Pose: This repo is the official implementation of "Follow-Your-Pose : Pose-Guided Text-to-Video Generation using Pose-Free Videos"
Language:Python1.3k90
pixeli99/SVD_Xtend
Stable Video Diffusion Training Code and Extensions.
Language:Python64965
Tencent/MimicMotion
High-Quality Human Motion Video Generation with Confidence-aware Pose Guidance
Language:Python2.1k176
youngwoo-yoon/youtube-gesture-dataset
This repository contains scripts to build Youtube Gesture Dataset.
Language:Python12018
alvinliu0/HA2G
[CVPR 2022] Code for "Learning Hierarchical Cross-Modal Association for Co-Speech Gesture Generation"
Language:Python1349
ai4r/Gesture-Generation-from-Trimodal-Context
Speech Gesture Generation from the Trimodal Context of Text, Audio, and Speaker Identity (SIGGRAPH Asia 2020)
Language:Python25035
Advocate99/DiffGesture
[CVPR'2023] Taming Diffusion Models for Audio-Driven Co-Speech Gesture Generation
Language:Python24217
antgroup/echomimic_v2
EchoMimicV2: Towards Striking, Simplified, and Semi-Body Human Animation
Language:Python2.2k260
apoorvkh/cvpr-latex-template
Extended LaTeX template for CVPR/ICCV papers
Language:TeX522177
opendatalab/MinerU
A high-quality tool for convert PDF to Markdown and JSON.一站式开源高质量数据提取工具，将PDF转换成Markdown和JSON格式。
Language:Python24.1k1.8k
Miffyli/im2latex-dataset
Python tools for creating suitable dataset for OpenAI's im2latex task: https://openai.com/requests-for-research/#im2latex
Language:Python13541
TH-MLab/DanceFusion
Language:JavaScript6