cantodelobo

cantodelobo's Stars

Zeyi-Lin/HivisionIDPhotos
⚡️HivisionIDPhotos: a lightweight and efficient AI ID photos tools. 一个轻量级的AI证件照制作算法。
Language:Python10.5k 43 871k
outlines-dev/outlines
Structured Text Generation
Language:Python8.2k 47 553414
InternLM/MindSearch
🔍 An LLM-based Multi-agent Framework of Web Search Engine (like Perplexity.ai Pro and SearchGPT)
Language:Python4.8k 35 133469
1rgs/jsonformer
A Bulletproof Way to Generate Structured JSON from Language Models
Language:Jupyter Notebook4.4k 24 42153
facebookresearch/sapiens
High-resolution models for human tasks.
Language:Python4.1k 43 107216
lm-sys/RouteLLM
A framework for serving and evaluating LLM routers - save LLM costs without compromising quality!
Language:Python3k 25 46229
X-PLUG/MobileAgent
Mobile-Agent: The Powerful Mobile Device Operation Assistant Family
Language:Python2.8k 47 53259
BadToBest/EchoMimic
Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Language:Python2.5k 38 152301
tencent-ailab/V-Express
V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
Language:Python2.2k 39 49277
AlibabaResearch/AdvancedLiterateMachinery
A collection of original, innovative ideas and algorithms towards Advanced Literate Machinery. This project is maintained by the OCR Team in the Language Technology Lab, Tongyi Lab, Alibaba Group.
Language:C++1.4k 33 166165
THUDM/LongWriter
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Language:Python1.1k 15 2896
muzishen/IMAGDressing
👔IMAGDressing👔: Interactive Modular Apparel Generation for Virtual Dressing
Language:Python977 13 3584
ButzYung/SystemAnimatorOnline
XR Animator, AI-based Full Body Motion Capture and Extended Reality (XR) solution, powered by System Animator Online
Language:JavaScript826 30 9069
TencentARC/SEED-Story
SEED-Story: Multimodal Long Story Generation with Large Language Model
Language:Python712 16 2356
tijiang13/InstantAvatar
InstantAvatar: Learning Avatars from Monocular Video in 60 Seconds (CVPR 2023)
Language:Python365 13 7631
zhenzhiwang/HumanVid
Official implementation of HumanVid, NeurIPS D&B Track 2024
Language:Python213 30 153
remyxai/FFMPerative
Chat to Compose Video
Language:Python169 7 29
SMPLOlympics/SMPLOlympics
Language:Python141 10 106
tobias-kirschstein/diffusion-avatars
Language:Jupyter Notebook129 7 217
jimmyYliu/Animatable-3D-Gaussian
Language:Python125 7 116
deepinstinct/ShimMe
Language:C++123 2 216
ZebangCheng/Emotion-LLaMA
Emotion-LLaMA: Multimodal Emotion Recognition and Reasoning with Instruction Tuning
Language:Python82 5 117
ashwath98/deepcharacters
Code base for Holoported Characters
Language:C++65 5 110
SocAIty/py_audio2face
Use the NVIDIA Audio2Face headless server and interact with it through a requests API. Generate animation sequences for Unreal Engine 5, Maya and MetaHumans
Language:Python43 3 810
detalhe/vehicle-ai
Identify any vehicle using AI. Built with Node.js, Express.js, EJS, and the Google Gemini API.
Language:JavaScript42 1 07
ninglab/eCeLLM
Language:Jupyter Notebook26 4 93
Jyxarthur/AutoAD-Zero
Official Implementation of "AutoAD-Zero: A Training-Free Framework for Zero-Shot Audio Description". Junyu Xie, Tengda Han, Max Bain, Arsha Nagrani, Gül Varol, Weidi Xie, Andrew Zisserman
Language:Python16 2 01
SuXinqi/DAAD
Offical code repository of ”DAAD: Dynamic Analysis and Adaptive Discriminator for Fake News Detection“
8
Ismaelbrendo/whats-spoofing
Language:Go30
mit-ccc/AudienceView-demo
AudienceView demo
Language:Python1