Pinned Repositories
HA-DPO
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization
Latex-Report-Template-for-SEU
本项目维护SEU实验报告Latex模板
lmm-r1
Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
lmms-eval
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
mert-mel
Multimodality-Linking
seu-thesis-typst
东南大学Typst论文模板库
TideDra.github.io
My blog
VL-RLHF
A RLHF Infrastructure for Vision-Language Models
zotero-arxiv-daily
Recommend new arxiv papers of your interest daily according to your Zotero libarary.
TideDra's Repositories
TideDra/zotero-arxiv-daily
Recommend new arxiv papers of your interest daily according to your Zotero libarary.
TideDra/lmm-r1
Extend OpenRLHF to support LMM RL training for reproduction of DeepSeek-R1 on multimodal tasks.
TideDra/VL-RLHF
A RLHF Infrastructure for Vision-Language Models
TideDra/seu-thesis-typst
东南大学Typst论文模板库
TideDra/TideDra.github.io
My blog
TideDra/HA-DPO
Beyond Hallucinations: Enhancing LVLMs through Hallucination-Aware Direct Preference Optimization
TideDra/Latex-Report-Template-for-SEU
本项目维护SEU实验报告Latex模板
TideDra/lmms-eval
Accelerating the development of large multimodal models (LMMs) with one-click evaluation module - lmms-eval.
TideDra/mert-mel
TideDra/Multimodality-Linking
TideDra/DeepSpeed
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
TideDra/GPTFactory
An all-in-one pipeline that collects data from ChatGPT models.
TideDra/gptpdf
Using GPT to parse PDF
TideDra/MILe
TideDra/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
TideDra/sglang
SGLang is a structured generation language designed for large language models (LLMs). It makes your interaction with models faster and more controllable.
TideDra/TideDra
Config files for my GitHub profile.
TideDra/VideoxDemo
TideDra/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support 160+ VLMs, 50+ benchmarks
TideDra/GPUSnatcher
抢占显卡
TideDra/OpenRLHF
An Easy-to-use, Scalable and High-performance RLHF Framework (70B+ PPO Full Tuning & Iterative DPO & LoRA & RingAttention & RFT)
TideDra/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
TideDra/Woodpecker
✨✨Woodpecker: Hallucination Correction for Multimodal Large Language Models. The first work to correct hallucinations in MLLMs.