libhot

libhot's Stars

hacksider/Deep-Live-Cam
real time face swap and one-click video deepfake with only a single image
Language:Python41.1k 243 5456k
OpenBMB/MiniCPM-V
MiniCPM-V 2.6: A GPT-4V Level MLLM for Single Image, Multi Image and Video on Your Phone
Language:Python12.7k 106 589893
peng-zhihui/Dummy-Robot
我的超迷你机械臂机器人项目。
Language:C12.4k 335 1742.7k
Ucas-HaoranWei/GOT-OCR2.0
Official code implementation of General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Language:Python6.1k 53 157528
iqiyi/dpvs
DPVS is a high performance Layer-4 load balancer based on DPDK.
Language:C3k 192 397727
BadToBest/EchoMimic
EchoMimic: Lifelike Audio-Driven Portrait Animations through Editable Landmark Conditioning
Language:Python2.9k 43 180339
NVlabs/VILA
VILA - a multi-image visual language model with training, inference and evaluation recipe, deployable from cloud to edge (Jetson Orin and laptops)
Language:Python2k 28 131163
NUS-HPC-AI-Lab/VideoSys
VideoSys: An easy and efficient system for video generation
Language:Python1.8k 27 88123
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
Language:Python1.5k 19 110108
THUDM/LongWriter
LongWriter: Unleashing 10,000+ Word Generation from Long Context LLMs
Language:Python1.5k 17 34136
menyifang/MIMO
Official implementation of "MIMO: Controllable Character Video Synthesis with Spatial Decomposed Modeling"
1.3k 110 2553
AuvaLab/itext2kg
Incremental Knowledge Graphs Constructor Using Large Language Models
Language:Python585 13 2256
Alpha-VLLM/Lumina-mGPT
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
Language:Python506 6 3022
Yuliang-Liu/MultimodalOCR
On the Hidden Mystery of OCR in Large Multimodal Models (OCRBench)
Language:Python475 14 3032
lujiazho/SegDrawer
Simple static web-based mask drawer, supporting semantic segmentation and video segmentation with interactive Segment Anything Model 2 (SAM2).
Language:Python359 11 1633
opendatalab/magic-html
Language:Python268 4 1424
aim-uofa/MovieDreamer
254 21 37
stylellm/stylellm_models
StyleLLM文风大模型：基于大语言模型的文本风格迁移项目。Text style transfer base on Large Language Model. #文字修饰 # 润色 #风格模仿
249 1 914
GuijiAI/ReHiFace-S
Real Time High-Fidelity Faceswap
Language:Python24055
LingyvKong/OneChart
[ACM'MM 2024 Oral] Official code for "OneChart: Purify the Chart Structural Extraction via One Auxiliary Token"
Language:Python199 1 2115
TaoHuUMD/StructLDM
Language:Python103 26 74
vaew/SkyScript-100M
SkyScript-100M: 1,000,000,000 Pairs of Scripts and Shooting Scripts for Short Drama: https://arxiv.org/abs/2408.09333v2
102 5 25
light-and-ray/sd-webui-lama-cleaner-masked-content
Use lama cleaner before inpainting inside stable-diffusion-webui
Language:Python85 1 72
LayTextLLM/LayTextLLM
Language:Jupyter Notebook67 3 139
WuTao-CS/CustomCrafter
CustomCrafter: Customized Video Generation with Preserving Motion and Concept Composition Abilities
341
chxy95/GenLV
ACM MM2024 - Learning A Low-Level Vision Generalist via Visual Task Prompt
26 3 11
loiccoyle/phomo
📷 Python package and CLI utility to create photo mosaics - now with GPU support
Language:Python16 2 13
encryptorion-lab/cuDilithium
CUDA-accelerated Dilithium Implementation
Language:Cuda81
yaojunWang/smx
无任何第三方依赖sm2,sm3,sm4,完全参照国家密码局要求实现的国密算法,java,javascript,swift实现.
Language:Java4 2 00
Faizan0100/tsn-svgenius
A Streamlit-based web application that converts images to SVG format. Features include AI-powered background removal, edge enhancement, image generation from text prompts, and customizable SVG output. Perfect for designers, developers, and digital artists looking to create scalable vector graphics from raster images.
Language:Python2