jainie-max

jainie-max's Stars

lucidrains/mixture-of-experts
A Pytorch implementation of Sparsely-Gated Mixture of Experts, for massively increasing the parameter count of language models
Language:Python61447
TUDB-Labs/MoE-PEFT
An Efficient LLM Fine-Tuning Factory Optimized for MoE PEFT
Language:Python244
THUDM/CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
Language:Python7.8k722
VITA-Group/Diffusion4D
"Diffusion4D: Fast Spatial-temporal Consistent 4D Generation via Video Diffusion Models", Hanwen Liang*, Yuyang Yin*, Dejia Xu, Hanxue Liang, Zhangyang Wang, Konstantinos N. Plataniotis, Yao Zhao, Yunchao Wei
Language:Python2183
QwenLM/Qwen2.5
Qwen2.5 is the large language model series developed by Qwen team, Alibaba Cloud.
Language:Shell8.6k535
lloongx/DIKI
[ECCV 2024] Mind the Interference: Retaining Pre-trained Knowledge in Parameter Efficient Continual Learning of Vision-Language Models
Language:Python26
google-research/l2p
Learning to Prompt (L2P) for Continual Learning @ CVPR22 and DualPrompt: Complementary Prompting for Rehearsal-free Continual Learning @ ECCV22
Language:Python40241
JingyangQiao/prompt-gradient-projection
Language:Python222
liangyanshuo/InfLoRA
The official implementation of the CVPR'2024 work Interference-Free Low-Rank Adaptation for Continual Learning
Language:Python48
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
Language:Python2.4k133
LLaVA-VL/LLaVA-NeXT
Language:Python2.5k186
e2b-dev/awesome-ai-agents
A list of AI autonomous agents
9.9k720
jun0wanan/awesome-large-multimodal-agents
30419
WooooDyy/LLM-Agent-Paper-List
The paper list of the 86-page paper "The Rise and Potential of Large Language Model Based Agents: A Survey" by Zhiheng Xi et al.
6.3k383
langgenius/dify
Dify is an open-source LLM app development platform. Dify's intuitive interface combines AI workflow, RAG pipeline, agent capabilities, model management, observability features and more, letting you quickly go from prototype to production.
Language:TypeScript46.9k6.6k
donydchen/mvsplat
🌊 [ECCV'24 Oral] MVSplat: Efficient 3D Gaussian Splatting from Sparse Multi-View Images
Language:Python74235
Ekko-zn/AIGCDetectBenchmark
Language:Python20022
Daisy-Zhang/Awesome-AIGC-Detection
A collection list of AIGC detection related papers.
473
NeeluMadan/ViFM_Survey
Foundation Models for Video Understanding: A Survey
843
yunlong10/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
1.3k69
THUDM/GLM-4
GLM-4 series: Open Multilingual Multimodal Chat LMs | 开源多语言多模态对话模型
Language:Python4.7k386
ShuvenduRoy/CoPrompt
[ICLR'24] Consistency-guided Prompt Learning for Vision-Language Models
Language:Python501
SCLBD/DeepfakeBench
A comprehensive benchmark of deepfake detection
Language:Python48764
flyingby/Awesome-Deepfake-Generation-and-Detection
A Survey on Deepfake Generation and Detection
25210
Daisy-Zhang/Awesome-Deepfakes-Detection
A list of tools, papers and code related to Deepfake Detection.
97394
DepthAnything/Depth-Anything-V2
Depth Anything V2. A More Capable Foundation Model for Monocular Depth Estimation
Language:Python3.4k277
VITA-Group/4DGen
"4DGen: Grounded 4D Content Generation with Spatial-temporal Consistency", Yuyang Yin*, Dejia Xu*, Zhangyang Wang, Yao Zhao, Yunchao Wei
Language:Python21310
ZjjConan/Multi-Modal-Adapter
The official pytorch implemention of our CVPR-2024 paper "MMA: Multi-Modal Adapter for Vision-Language Models".
Language:Python312
miccunifi/KDPL
[ECCV 2024] - Improving Zero-shot Generalization of Learned Prompts via Unsupervised Knowledge Distillation
Language:Python411
black-forest-labs/flux
Official inference repo for FLUX.1 models
Language:Python14.3k1k