JethroJames's Stars
KwaiVGI/LivePortrait
Bring portraits to life!
SakanaAI/AI-Scientist
The AI Scientist: Towards Fully Automated Open-Ended Scientific Discovery 🧑🔬
OpenGVLab/InternVL
[CVPR 2024 Oral] InternVL Family: A Pioneering Open-Source Alternative to GPT-4o. 接近GPT-4o表现的开源多模态对话模型
Tencent/HunyuanDiT
Hunyuan-DiT : A Powerful Multi-Resolution Diffusion Transformer with Fine-Grained Chinese Understanding
hustvl/Vim
[ICML 2024] Vision Mamba: Efficient Visual Representation Learning with Bidirectional State Space Model
Alpha-VLLM/Lumina-T2X
Lumina-T2X is a unified framework for Text to Any Modality Generation
aigc-apps/EasyAnimate
📺 An End-to-End Solution for High-Resolution and Long Video Generation Based on Transformer Diffusion
wangkai930418/awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
test-time-training/ttt-lm-pytorch
Official PyTorch implementation of Learning to (Learn at Test Time): RNNs with Expressive Hidden States
rohitgandikota/erasing
Erasing Concepts from Diffusion Models
face-analysis/emonet
Official implementation of the paper "Estimation of continuous valence and arousal levels from faces in naturalistic conditions", Antoine Toisoul, Jean Kossaifi, Adrian Bulat, Georgios Tzimiropoulos and Maja Pantic, Nature Machine Intelligence, 2021
tianyi-lab/HallusionBench
[CVPR'24] HallusionBench: You See What You Think? Or You Think What You See? An Image-Context Reasoning Benchmark Challenging for GPT-4V(ision), LLaVA-1.5, and Other Multi-modality Models
YangLing0818/EditWorld
EditWorld: Simulating World Dynamics for Instruction-Following Image Editing
OPTML-Group/Unlearn-Saliency
[ICLR24 (Spotlight)] "SalUn: Empowering Machine Unlearning via Gradient-based Weight Saliency in Both Image Classification and Generation" by Chongyu Fan*, Jiancheng Liu*, Yihua Zhang, Eric Wong, Dennis Wei, Sijia Liu
LeMei/Multimodal-Affective-Computing-Survey
HaroldChen19/GaussianVTON
GaussianVTON: 3D Human Virtual Try-ON via Multi-Stage Gaussian Splatting Editing with Image Prompting
AmourWaltz/Reliable-LLM
MengyuanChen21/Awesome-Evidential-Deep-Learning
A curated publication list on evidential deep learning.
chi0tzp/FFHQFaceAlignment
Face alignment tool for transforming face images into FFHQ-style.
benedettaliberatori/T3AL
Official implementation of "Test-Time Zero-Shot Temporal Action Localization", CVPR 2024
Jiaxuan-Li/EVCap
[CVPR 2024] Retrieval-Augmented Image Captioning with External Visual-Name Memory for Open-World Comprehension
mlvlab/RALF
Official implementation of CVPR 2024 paper "Retrieval-Augmented Open-Vocabulary Object Detection".
Eaaguilart/cedl
Continual Evidential Deep Learning ICCVW 2023
MengyuanChen21/CVPR2023-OWTAL
[CVPR 2023] Cascade Evidential Learning for Open-world Weakly-supervised Temporal Action Localization
Jazzcharles/Egoinstructor
Pytorch implementation for Egoinstructor at CVPR 2024
KaijingOfficial/sram_vtg
source code of sram
JethroJames/TUNED
arXiv` 24: Trusted Unified Feature-Neighborhood Dynamics for Multi-View Classification
qtli/COMP7607-2024
dunknsabsw/BoViLA
HaroldChen19/gsvton