libhot's Stars
hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
peng-zhihui/Dummy-Robot
我的超迷你机械臂机器人项目。
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
iqiyi/dpvs
DPVS is a high performance Layer-4 load balancer based on DPDK.
BadToBest/EchoMimic
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
NVlabs/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
NUS-HPC-AI-Lab/VideoSys
VideoSys: An easy and efficient system for video generation
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
THUDM/LongWriter
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
menyifang/MIMO
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
AuvaLab/itext2kg
Incremental Knowledge Graphs Constructor Using Large Language Models
Alpha-VLLM/Lumina-mGPT
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
Yuliang-Liu/MultimodalOCR
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
lujiazho/SegDrawer
Simple static web-based mask drawer, supporting semantic segmentation and video segmentation with interactive Segment Anything Model 2 (SAM2).
opendatalab/magic-html
aim-uofa/MovieDreamer
stylellm/stylellm_models
StyleLLM文风大模型:基于大语言模型的文本风格迁移项目。Text style transfer base on Large Language Model. #文字修饰 # 润色 #风格模仿
GuijiAI/ReHiFace-S
Real Time High-Fidelity Faceswap
LingyvKong/OneChart
[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
TaoHuUMD/StructLDM
vaew/SkyScript-100M
SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2
light-and-ray/sd-webui-lama-cleaner-masked-content
Use lama cleaner before inpainting inside stable-diffusion-webui
LayTextLLM/LayTextLLM
WuTao-CS/CustomCrafter
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
chxy95/GenLV
ACM MM2024 - Learning A Low-Level Vision Generalist via Visual Task Prompt
loiccoyle/phomo
📷 Python package and CLI utility to create photo mosaics - now with GPU support
encryptorion-lab/cuDilithium
CUDA-accelerated Dilithium Implementation
yaojunWang/smx
无任何第三方依赖sm2,sm3,sm4,完全参照国家密码局要求实现的国密算法,java,javascript,swift实现.
Faizan0100/tsn-svgenius
A Streamlit-based web application that converts images to SVG format. Features include AI-powered background removal, edge enhancement, image generation from text prompts, and customizable SVG output. Perfect for designers, developers, and digital artists looking to create scalable vector graphics from raster images.