yongliang-wu's Stars
hiyouga/LLaMA-Factory
Unified Efficient Fine-Tuning of 100+ LLMs (ACL 2024)
Caldis/Mos
一个用于在 macOS 上平滑你的鼠标滚动效果或单独设置滚动方向的小工具, 让你的滚轮爽如触控板 | A lightweight tool used to smooth scrolling and set scroll direction independently for your mouse on macOS
QwenLM/Qwen2-VL
Qwen2-VL is the multimodal large language model series developed by Qwen team, Alibaba Cloud.
AIGCDesignGroup/ReplaceAnything
LKI/chinese-calendar
判断一天是不是法定节假日/法定工作日(查看节假日安排)
lucidrains/transfusion-pytorch
Pytorch implementation of Transfusion, "Predict the Next Token and Diffuse Images with One Multi-Modal Model", from MetaAI
yaoyao-liu/minimal-light
A simple and elegant Jekyll theme for an academic personal homepage
ChenLiu-1996/CitationMap
A simple pip-installable Python tool to generate your own HTML citation world map from your Google Scholar ID.
EvolvingLMMs-Lab/LongVA
Long Context Transfer from Language to Vision
xiaoachen98/Open-LLaVA-NeXT
An open-source implementation for training LLaVA-NeXT.
ZHKKKe/Harmonizer
High-Resolution Image/Video Harmonization [ECCV 2022]
DAMO-NLP-SG/CoI-Agent
Official code for paper: Chain of Ideas: Revolutionizing Research via Novel Idea Development with LLM Agents
Con6924/SPM
Official implementation of paper "One-dimensional Adapter to Rule Them All: Concepts, Diffusion Models and Erasing Applications".
JoyHuYY1412/DDE_CIL
HKUNLP/DiffuLLaMA
DiffuGPT and DiffuLLaMA: Scaling Diffusion Language Models via Adaptation from Autoregressive Models
JoyHuYY1412/LST_LVIS
lhanchao777/LVLM-Hallucinations-Survey
This is the first released survey paper on hallucinations of large vision-language models (LVLMs). To keep track of this field and continuously update our survey, we maintain this repository of relevant references.
404874351/seu-lecture-reserve
东南大学讲座预约半自动脚本
yongliang-wu/ExploreCfg
[NeurIPS2023] Exploring Diverse In-Context Configurations for Image Captioning
FeipengMa6/VLoRA
[NeurIPS 2024] Visual Perception by Large Language Model’s Weights
yongliang-wu/MM-VID
Open source implementation of the paper "MM-Vid: Advancing Video Understanding with GPT-4V(ision)".
JoyHuYY1412/Class_Imbalanced_Semi_Supervised_Learning
jmhessel/pycocoevalcap
Python 3 support for the MS COCO caption evaluation tools
LaBaZh/Awesome-Opus-Long-Video-Understanding
Awesome research works specifically focused on long form videos understanding.
JoyHuYY1412/S4Former
Training Vision Transformers for Semi-Supervised Semantic Segmentation
yongliang-wu/seu-lecture-reserve
东南大学研究生人文讲座自动预约脚本
LaBaZh/OpenLongVA
An open-source implementation of LongVA for facilitating the large multi-modal model community.
yongliang-wu/unlearn-DoCoPreG
This repository contains the PyTorch implementation for [Preprint] Unlearning Concepts in Diffusion Model via Concept Domain Correction and Concept Preserving Gradient
saumyamalik/ClipScoreProject
yongliang-wu/NumPro