yuezih

Ph.D. Student, RUC

Renmin University of ChinaBeijing

yuezih's Stars

zhaoyue-zephyrus/AVION
[arXiv:2309.16669] Code release for "Training a Large Video Model on a Single Machine in a Day"
Language:Python1116
bdaiinstitute/theia
Theia: Distilling Diverse Vision Foundation Models for Robot Learning
Language:Python1516
zhangyikaii/Proto-CAT
The code repository for "Audio-Visual Generalized Few-Shot Learning with Prototype-Based Co-Adaptation"
Language:Python10
zhangyikaii/LAMDA-ZhiJian
ZhiJian: A Unifying and Rapidly Deployable Toolbox for Pre-trained Model Reuse
Language:Python502
FlagOpen/FlagScale
FlagScale is a large model toolkit based on open-sourced projects.
Language:Python14342
yuezih/SMILE
Learning Descriptive Image Captioning via Semipermeable Maximum Likelihood Estimation (NeurIPS 2023)
Language:Jupyter Notebook211
ylwhxht/SRKD-DRET
AAAI2024 - Sunshine to Rainstorm: Cross-Weather Knowledge Distillation for Robust 3D Object Detection
Language:Python312
yuezih/less-is-more
Less is More: Mitigating Multimodal Hallucination from an EOS Decision Perspective (ACL 2024)
Language:Python27
yuezih/Movie101
Narrative movie understanding benchmark
Language:Python56
showlab/Awesome-MLLM-Hallucination
📖 A curated list of resources dedicated to hallucination of multimodal large language models (MLLM).
40711
X-PLUG/mPLUG-DocOwl
mPLUG-DocOwl: Modularized Multimodal Large Language Model for Document Understanding
Language:Python1.4k87
OpenGVLab/Ask-Anything
[CVPR2024 Highlight][VideoChatGPT] ChatGPT with video understanding! And many more supported LMs such as miniGPT4, StableLM, and MOSS.
Language:Python3k248
EricLee8/MPD_EMVI
Official implementation of our paper at ACL 2023: Pre-training Multi-party Dialogue Models with Latent Discourse Inference
Language:Python10
ccfddl/ccf-deadlines
⏰ Collaboratively track deadlines of conferences recommended by CCF (Website, Python Cli, Wechat Applet) / If you find it useful, please star this project, thanks~
Language:Vue6.1k429
YixunLiang/ReTR
Official code of ReTR (NeurIPS 2023)
Language:Python41
yaolinli/CapEnrich
Language:Python5
haotian-liu/LLaVA
[NeurIPS'23 Oral] Visual Instruction Tuning (LLaVA) built towards GPT-4V level capabilities and beyond.
Language:Python19.8k2.2k
voxel51/fiftyone
Refine high-quality datasets and visual AI models
Language:Python8.8k553
ML-GSAI/DPT
Official PyTorch implementation for "Diffusion Models and Semi-Supervised Learners Benefit Mutually with Few Labels"
Language:Python762
xieyuquanxx/awesome-Large-MultiModal-Hallucination
😎 up-to-date & curated list of awesome LMM hallucinations papers, methods & resources.
14111
eric-ai-lab/awesome-vision-language-navigation
A curated list for vision-and-language navigation. ACL 2022 paper "Vision-and-Language Navigation: A Survey of Tasks, Methods, and Future Directions"
36919
EnVision-Research/LucidDreamer
Official implementation of "LucidDreamer: Towards High-Fidelity Text-to-3D Generation via Interval Score Matching"
Language:Python74532
DarkHighness/opendigger-cli
Language:Rust21
bcbi-edu/p_eickhoff_isoscore
Language:Jupyter Notebook292
TideDancer/iclr21_isotropy_contxt
Language:Python273
ainagari/monopoly
Language:Python121
wtimkey/rogue-dimensions
replication code for EMNLP 2021 paper
Language:Jupyter Notebook113
diff-usion/Awesome-Diffusion-Models
A collection of resources and papers on Diffusion Models
Language:HTML10.9k942
wangkai930418/awesome-diffusion-categorized
collection of diffusion model papers categorized by their subareas
1.2k60
THUDM/CogVLM
a state-of-the-art-level open visual language model | 多模态预训练模型
Language:Python5.9k408

yuezih

yuezih's Stars

zhaoyue-zephyrus/AVION

bdaiinstitute/theia

zhangyikaii/Proto-CAT

zhangyikaii/LAMDA-ZhiJian

FlagOpen/FlagScale

yuezih/SMILE

ylwhxht/SRKD-DRET

yuezih/less-is-more

yuezih/Movie101

showlab/Awesome-MLLM-Hallucination

X-PLUG/mPLUG-DocOwl

OpenGVLab/Ask-Anything

EricLee8/MPD_EMVI

ccfddl/ccf-deadlines

YixunLiang/ReTR

yaolinli/CapEnrich

haotian-liu/LLaVA

voxel51/fiftyone

ML-GSAI/DPT

xieyuquanxx/awesome-Large-MultiModal-Hallucination

eric-ai-lab/awesome-vision-language-navigation

EnVision-Research/LucidDreamer

DarkHighness/opendigger-cli

bcbi-edu/p_eickhoff_isoscore

TideDancer/iclr21_isotropy_contxt

ainagari/monopoly

wtimkey/rogue-dimensions

diff-usion/Awesome-Diffusion-Models

wangkai930418/awesome-diffusion-categorized

THUDM/CogVLM