Pinned Repositories
dbn
my dbn
linux-wallpaperengine
An attempt to make wallpaper engine wallpapers compatible with Linux
pytorch-b3d
based on pytorch-i3d
TAP
TAP: An automated jailbreaking method for black-box LLMs
lizhongguo's Repositories
lizhongguo/4K4D
[CVPR 2024] 4K4D: Real-Time 4D View Synthesis at 4K Resolution
lizhongguo/Amphion
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
lizhongguo/AnimatableGaussians
Code of [CVPR 2024] "Animatable Gaussians: Learning Pose-dependent Gaussian Maps for High-fidelity Human Avatar Modeling"
lizhongguo/CatVTON
CatVTON is a simple and efficient virtual try-on diffusion model with 1) Lightweight Network (899.06M parameters totally), 2) Parameter-Efficient Training (49.57M parameters trainable) and 3) Simplified Inference (< 8G VRAM for 1024X768 resolution).
lizhongguo/champ
Champ: Controllable and Consistent Human Image Animation with 3D Parametric Guidance
lizhongguo/ChatTTS
ChatTTS is a generative speech model for daily dialogue.
lizhongguo/ck
Concurrency primitives, safe memory reclamation mechanisms and non-blocking (including lock-free) data structures designed to aid in the research, design and implementation of high performance concurrent systems developed in C99+.
lizhongguo/dspatch
The Refreshingly Simple Cross-Platform C++ Dataflow / Patching / Pipelining / Graph Processing / Stream Processing / Reactive Programming Framework
lizhongguo/facefusion
Next generation face swapper and enhancer
lizhongguo/Flash-VStream
This is the official implementation of "Flash-VStream: Memory-Based Real-Time Understanding for Long Video Streams"
lizhongguo/HoT
[CVPR 2024 🔥] Official implementation of the paper "⏳ Hourglass Tokenizer for Efficient Transformer-Based 3D Human Pose Estimation"
lizhongguo/InternVideo
Video Foundation Models & Data for Multimodal Understanding
lizhongguo/LivePortrait
Bring portraits to life!
lizhongguo/llama3-jailbreak
A trivial programmatic Llama 3 jailbreak. Sorry Zuck!
lizhongguo/llm-attacks
Universal and Transferable Attacks on Aligned Language Models
lizhongguo/LLM-Pruner
[NeurIPS 2023] LLM-Pruner: On the Structural Pruning of Large Language Models. Support LLaMA, Llama-2, BLOOM, Vicuna, Baichuan, etc.
lizhongguo/llm-sp
Papers and resources related to the security and privacy of LLMs 🤖
lizhongguo/MiniCPM
MiniCPM-2B: An end-side LLM outperforms Llama2-13B.
lizhongguo/MiniGPT4-video
lizhongguo/MusePose
MusePose: a Pose-Driven Image-to-Video Framework for Virtual Human Generation
lizhongguo/Open-AnimateAnyone
Unofficial Implementation of Animate Anyone
lizhongguo/OpenTAD
OpenTAD is an open-source temporal action detection (TAD) toolbox based on PyTorch.
lizhongguo/scenic
Scenic: A Jax Library for Computer Vision Research and Beyond
lizhongguo/shap
A game theoretic approach to explain the output of any machine learning model.
lizhongguo/surya
OCR, layout analysis, reading order, line detection in 90+ languages
lizhongguo/TensorRT-YOLO
🚀 TensorRT-YOLO: Support YOLOv5, YOLOv8, YOLOv9, PP-YOLOE using TensorRT acceleration with EfficientNMS!
lizhongguo/Torch-Pruning
[CVPR 2023] Towards Any Structural Pruning; LLMs / SAM / Diffusion / Transformers / YOLOv8 / CNNs
lizhongguo/WindFlow
A C++17 Data Stream Processing Parallel Library for Multicores and GPUs
lizhongguo/YOWOv2
The second generation of YOWO action detector.
lizhongguo/ZORG-Jailbreak-Prompt-Text
Bypass restricted and censored content on AI chat prompts 😈