Pinned Repositories
Chat-UniVi
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
ChatLaw
ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型
ConsisID
Identity-Preserving Text-to-Video Generation by Frequency Decomposition
LanguageBind
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
LLaVA-CoT
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
Machine-Mindset
An MBTI Exploration of Large Language Models
MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
PKU-YUAN-Lab (袁粒课题组-北大信工)'s Repositories
PKU-YuanGroup/Open-Sora-Plan
This project aim to reproduce Sora (Open AI T2V model), we wish the open source community contribute to this project.
PKU-YuanGroup/ChatLaw
ChatLaw:A Powerful LLM Tailored for Chinese Legal. 中文法律大模型
PKU-YuanGroup/Video-LLaVA
【EMNLP 2024🔥】Video-LLaVA: Learning United Visual Representation by Alignment Before Projection
PKU-YuanGroup/MoE-LLaVA
Mixture-of-Experts for Large Vision-Language Models
PKU-YuanGroup/LLaVA-CoT
LLaVA-CoT, a visual language model capable of spontaneous, systematic reasoning
PKU-YuanGroup/MagicTime
MagicTime: Time-lapse Video Generation Models as Metamorphic Simulators
PKU-YuanGroup/Chat-UniVi
[CVPR 2024 Highlight🔥] Chat-UniVi: Unified Visual Representation Empowers Large Language Models with Image and Video Understanding
PKU-YuanGroup/LanguageBind
【ICLR 2024🔥】 Extending Video-Language Pretraining to N-modality by Language-based Semantic Alignment
PKU-YuanGroup/ConsisID
Identity-Preserving Text-to-Video Generation by Frequency Decomposition
PKU-YuanGroup/Machine-Mindset
An MBTI Exploration of Large Language Models
PKU-YuanGroup/repaint123
Official implementation of Repaint123: Fast and High-quality One Image to 3D Generation with Progressive Controllable 2D Repainting (ECCV 2024)
PKU-YuanGroup/Cycle3D
[AAAI 2025🔥] Official implementation of Cycle3D: High-quality and Consistent Image-to-3D Generation via Generation-Reconstruction Cycle
PKU-YuanGroup/ChronoMagic-Bench
[NeurIPS 2024 D&B Spotlight🔥] ChronoMagic-Bench: A Benchmark for Metamorphic Evaluation of Text-to-Time-lapse Video Generation
PKU-YuanGroup/ProLLaMA
A Protein Large Language Model for Multi-Task Protein Language Processing
PKU-YuanGroup/Hallucination-Attack
Attack to induce LLMs within hallucinations
PKU-YuanGroup/Video-Bench
A Comprehensive Benchmark and Toolkit for Evaluating Video-based Large Language Models!
PKU-YuanGroup/Envision3D
Envision3D: One Image to 3D with Anchor Views Interpolation
PKU-YuanGroup/WF-VAE
Enhancing Video VAE by Wavelet-Driven Energy Flow for Latent Video Diffusion Model
PKU-YuanGroup/Open-Sora-Dataset
PKU-YuanGroup/ChatExcel
PKU-YuanGroup/Next-Patch-Prediction
PKU-YuanGroup/TaxDiff
The official code for "TaxDiff: Taxonomic-Guided Diffusion Model for Protein Sequence Generation"
PKU-YuanGroup/EvaGaussians
PKU-YuanGroup/LLaVA-o1
PKU-YuanGroup/Peer-review-in-LLMs
Peer-review-in-LLMs: Automatic Evaluation Method for LLMs in Open-environment,https://arxiv.org/pdf/2402.01830.pdf
PKU-YuanGroup/N-LoRA
【COLING 2025🔥】Code for the paper "Is Parameter Collision Hindering Continual Learning in LLMs?".
PKU-YuanGroup/LLMBind
LLMBind: A Unified Modality-Task Integration Framework