junming-yang

Graduate Student @SEU PALM Lab

Southeast UniversityNanjing

Pinned Repositories

d3rlpy
An offline deep reinforcement learning library
Language:Python0 0 00
junming-yang.github.io
Personal Website: Junming Yang (杨骏铭)
Language:CSS30
mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
Language:Python0 0 00
MBPO-pytorch
Model-based Policy Optimization re-implement by pytorch
Language:Python50
MOAN
Model-based Offline Policy Optimization with Adversarial Network
4 1 10
mopo
Model-based Offline Policy Optimization re-implement all by pytorch
Language:Python27 2 26
offline_rl
Offline RL codebase for Unstable Baselines
10
VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 30+ HF models, 15+ benchmarks
Language:Python10
VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support ~100 VLMs, 40+ benchmarks
Language:Python1.1k 10 167157
WorldModelPapers
Paper collections of the continuous effort start from World Models.
128 10 06

junming-yang/mopo
Model-based Offline Policy Optimization re-implement all by pytorch
Language:Python27 2 26
junming-yang/MBPO-pytorch
Model-based Policy Optimization re-implement by pytorch
Language:Python50
junming-yang/MOAN
Model-based Offline Policy Optimization with Adversarial Network
4 1 10
junming-yang/junming-yang.github.io
Personal Website: Junming Yang (杨骏铭)
Language:CSS30
junming-yang/offline_rl
Offline RL codebase for Unstable Baselines
10
junming-yang/VLMEvalKit
Open-source evaluation toolkit of large vision-language models (LVLMs), support GPT-4v, Gemini, QwenVLPlus, 30+ HF models, 15+ benchmarks
Language:Python10
junming-yang/d3rlpy
An offline deep reinforcement learning library
Language:Python0 0 00
junming-yang/mbpo
Code for the paper "When to Trust Your Model: Model-Based Policy Optimization"
Language:Python0 0 00
junming-yang/mopo-pytorch
re-implementation of the offline model-based RL algorithm MOPO in pytorch
Language:Python00
junming-yang/opencompass
OpenCompass is an LLM evaluation platform, supporting a wide range of models (InternLM2,GPT-4,LLaMa2, Qwen,GLM, Claude, etc) over 100+ datasets.
junming-yang/WorldModelPapers
Paper collections of the continuous effort start from World Models.