cantodelobo's Stars
Zeyi-Lin/HivisionIDPhotos
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
outlines-dev/outlines
Structured Text Generation
InternLM/MindSearch
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
1rgs/jsonformer
A Bulletproof Way to Generate Structured JSON from Language Models
facebookresearch/sapiens
High-resolution models for human tasks.
lm-sys/RouteLLM
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
X-PLUG/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
BadToBest/EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
tencent-ailab/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
THUDM/LongWriter
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
muzishen/IMAGDressing
👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing
ButzYung/SystemAnimatorOnline
XR Animator, AI-based Full Body Motion Capture and Extended Reality (XR) solution, powered by System Animator Online
TencentARC/SEED-Story
SEED-Story: Multimodal Long Story Generation with Large Language Model
tijiang13/InstantAvatar
InstantAvatar: Learning Avatars from Monocular Video in 60 Seconds (CVPR 2023)
zhenzhiwang/HumanVid
Official implementation of HumanVid, NeurIPS D&B Track 2024
remyxai/FFMPerative
Chat to Compose Video
SMPLOlympics/SMPLOlympics
tobias-kirschstein/diffusion-avatars
jimmyYliu/Animatable-3D-Gaussian
deepinstinct/ShimMe
ZebangCheng/Emotion-LLaMA
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
ashwath98/deepcharacters
Code base for Holoported Characters
SocAIty/py_audio2face
Use the NVIDIA Audio2Face headless server and interact with it through a requests API. Generate animation sequences for Unreal Engine 5, Maya and MetaHumans
detalhe/vehicle-ai
Identify any vehicle using AI. Built with Node.js, Express.js, EJS, and the Google Gemini API.
ninglab/eCeLLM
Jyxarthur/AutoAD-Zero
Official Implementation of "AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description". Junyu Xie, Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman
SuXinqi/DAAD
Offical code repository of ”DAAD: Dynamic Analysis and Adaptive Discriminator for Fake News Detection“
Ismaelbrendo/whats-spoofing
mit-ccc/AudienceView-demo
AudienceView demo