zideliu

Lucky!

Zhejiang UniversityHangzhou Zhejiang

zideliu's Stars

microsoft/generative-ai-for-beginners
21 Lessons, Get Started Building with Generative AI 🔗 https://microsoft.github.io/generative-ai-for-beginners/
Language:Jupyter Notebook65.3k 561 12933.4k
mem0ai/mem0
The Memory layer for your AI apps
Language:Python23k 131 6792.1k
KwaiVGI/LivePortrait
Bring portraits to life!
Language:Python13.1k 116 3761.4k
facebookresearch/segment-anything-2
The repository provides code for running inference with the Meta Segment Anything Model 2 (SAM 2), links for downloading the trained model checkpoints, and example notebooks that show how to use the model.
Language:Jupyter Notebook11.1k 64 259944
camel-ai/camel
🐫 CAMEL: Finding the Scaling Law of Agents. The first and the best multi-agent framework. https://www.camel-ai.org
Language:Python5.7k 68 527692
VueTorrent/VueTorrent
The sleekest looking WEBUI for qBittorrent made with Vuejs!
Language:Vue5.1k 24 825256
Kwai-Kolors/Kolors
Kolors Team
Language:Python3.9k 38 138276
poloclub/transformer-explainer
Transformer Explained Visually: Learn How LLM Transformer Models Work with Interactive Visualization
Language:JavaScript3.4k 34 17300
XLabs-AI/x-flux
Language:Python1.7k 29 118118
TencentQQGYLab/ELLA
ELLA: Equip Diffusion Models with LLM for Enhanced Semantic Alignment
Language:Python1.1k 42 4757
test-time-training/ttt-lm-pytorch
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
Language:Python1.1k 7 2562
RQLuo/MixTeX-Latex-OCR
MixTeX multimodal LaTeX, ZhEn, and, Table OCR. It performs efficient CPU-based inference in a local offline on Windows.
Language:Python888 2 2044
OSU-NLP-Group/Mind2Web
[NeurIPS'23 Spotlight] "Mind2Web: Towards a Generalist Agent for the Web"
Language:Jupyter Notebook724 21 43101
buoyancy99/diffusion-forcing
code for "Diffusion Forcing: Next-token Prediction Meets Full-Sequence Diffusion"
Language:Python622 6 2329
Alpha-VLLM/Lumina-mGPT
Official Implementation of "Lumina-mGPT: Illuminate Flexible Photorealistic Text-to-Image Generation with Multimodal Generative Pretraining"
Language:Python506 6 3022
Bujiazi/MotionClone
Official implementation of MotionClone: Training-Free Motion Cloning for Controllable Video Generation
Language:Python370 18 1528
shikiw/OPERA
[CVPR 2024 Highlight] OPERA: Alleviating Hallucination in Multi-Modal Large Language Models via Over-Trust Penalty and Retrospection-Allocation
Language:Python292 2 4526
showlab/Awesome-GUI-Agent
💻 A curated list of papers and resources for multi-modal Graphical User Interface (GUI) agents.
269 5 311
aim-uofa/MovieDreamer
254 21 37
linzhiqiu/t2v_metrics
Evaluating text-to-image/video/3D models with VQAScore
Language:Python233 15 1221
yk7333/d3po
[CVPR 2024] Code for the paper "Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model"
Language:Python173 7 1618
aim-uofa/AutoStory
145 22 44
tobias-kirschstein/diffusion-avatars
Language:Jupyter Notebook137 8 418
Yushi-Hu/tifa
TIFA: Accurate and Interpretable Text-to-Image Faithfulness Evaluation with Question Answering
Language:Python137 3 119
HansenHuang0823/PlacidDreamer
The official implementation of ACM Multimedia 2024 paper "PlacidDreamer: Advancing Harmony in Text-to-3D Generation".
Language:Python125 6 615
aim-uofa/FreeCustom
[CVPR 2024] Official PyTorch implementation of FreeCustom: Tuning-Free Customized Image Generation for Multi-Concept Composition
Language:Python112 14 113
aim-uofa/FreeCompose
Language:Jupyter Notebook26 6 21
WebVLN/WebVLN
Official implementation of WebVLN: Vision-and-Language Navigation on Websites
Language:Python23 1 10
zrealli/TIGIC
[ECCV 2024] Tuning-Free Image Customization with Image and Text Guidance
Language:Python11 2 30
cuixing100876/InstaStyle
Language:Python10 1 00

zideliu

zideliu's Stars

microsoft/generative-ai-for-beginners

mem0ai/mem0

KwaiVGI/LivePortrait

facebookresearch/segment-anything-2

camel-ai/camel

VueTorrent/VueTorrent

Kwai-Kolors/Kolors

poloclub/transformer-explainer

XLabs-AI/x-flux

TencentQQGYLab/ELLA

test-time-training/ttt-lm-pytorch

RQLuo/MixTeX-Latex-OCR

OSU-NLP-Group/Mind2Web

buoyancy99/diffusion-forcing

Alpha-VLLM/Lumina-mGPT

Bujiazi/MotionClone

shikiw/OPERA

showlab/Awesome-GUI-Agent

aim-uofa/MovieDreamer

linzhiqiu/t2v_metrics

yk7333/d3po

aim-uofa/AutoStory

tobias-kirschstein/diffusion-avatars

Yushi-Hu/tifa

HansenHuang0823/PlacidDreamer

aim-uofa/FreeCustom

aim-uofa/FreeCompose

WebVLN/WebVLN

zrealli/TIGIC

cuixing100876/InstaStyle