huangshiyu13
Shiyu Huang(黄世宇), Deep RL, Multi-agent RL, CV, NLP, AGI, https://github.com/OpenRL-Lab/openrl
Zhipu AIBeijing, China
Pinned Repositories
couplet_generation
generate couplet(对联生成) Tensorflow
deepfake_detection
detect deepfake images(AI换脸检测), Pytorch
recite_English_words
按照词频背英语单词
RPNplus
RPN+(Tensorflow) for people detection
webtemplate
收集各种网站前端模板
openrl
Unified Reinforcement Learning Framework
Wandb_Tutorial
How to use wandb?
CogVideo
text and image to video generation: CogVideoX (2024) and CogVideo (ICLR 2023)
CogVLM2
GPT4V-level open-source multi-modal model based on Llama3-8B
LVBench
LVBench: An Extreme Long Video Understanding Benchmark
huangshiyu13's Repositories
huangshiyu13/webtemplate
收集各种网站前端模板
huangshiyu13/Fleet
Fleet is a generic distributed task distribution framework based on a distributed file system.
huangshiyu13/glm-4v-plus_API_usage
How to use GLM-4V-Plus API
huangshiyu13/openrl
通用强化学习研究框架
huangshiyu13/Awesome-LLMs-for-Video-Understanding
🔥🔥🔥Latest Papers, Codes and Datasets on Vid-LLMs.
huangshiyu13/Awesome-Papers-Autonomous-Agent
A collection of recent papers focusing on autonomous agent.
huangshiyu13/Awesome_Quadrupedal_Robots
Awesome Quadrupedal Robots
huangshiyu13/BEPb
Config files for my GitHub profile.
huangshiyu13/ChufanSuki
huangshiyu13/codecov-demo
huangshiyu13/DeepSpeed_hsy
DeepSpeed is a deep learning optimization library that makes distributed training and inference easy, efficient, and effective.
huangshiyu13/DI-engine-1
OpenDILab Decision AI Engine
huangshiyu13/DI-engine-docs
DI-engine docs (Chinese and English)
huangshiyu13/easytrader
提供同花顺客户端/国金/华泰客户端/雪球的基金、股票自动程序化交易以及自动打新,支持跟踪 joinquant /ricequant 模拟交易 和 实盘雪球组合, 量化交易组件
huangshiyu13/free_dog_sdk_cpp
huangshiyu13/go2-webrtc
WebRTC API for Unitree GO2 Robots
huangshiyu13/huangshiyu13
huangshiyu13/huangshiyu13.github.io
Shiyu Huang's Personal Website
huangshiyu13/Megatron-LM
Ongoing research training transformer models at scale
huangshiyu13/newinml.com
huangshiyu13/newinml.org
huangshiyu13/openrl-docs
OpenRL文档
huangshiyu13/parkour
[CoRL 2023] Robot Parkour Learning
huangshiyu13/ray
Ray is a unified framework for scaling AI and Python applications. Ray consists of a core distributed runtime and a toolkit of libraries (Ray AIR) for accelerating ML workloads.
huangshiyu13/rl-baselines3-zoo
A training framework for Stable Baselines3 reinforcement learning agents, with hyperparameter optimization and pre-trained agents included.
huangshiyu13/stable-baselines3-1
PyTorch version of Stable Baselines, reliable implementations of reinforcement learning algorithms.
huangshiyu13/test-docs
huangshiyu13/TiZero_hsy
Code accompanying the paper "TiZero: Mastering Multi-Agent Football with Curriculum Learning and Self-Play" (AAMAS 2023) 足球游戏智能体
huangshiyu13/videollm-online
VideoLLM-online: Online Video Large Language Model for Streaming Video (CVPR 2024)
huangshiyu13/vnpy
基于Python的开源量化交易平台开发框架