Pinned Repositories
EVCS-rollling-opt
general-tls
general traffic light agent
LLaMA-Efficient-Tuning
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)
LLM-EasyDeploy
We decide to construct a lightweight, scalable platform from scratch for deploying LLM and MLLM large models.
tzm-vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
UC-using-pulp
Modelling Unit Commitment Problem as a MIP problem, using pulp package and cplex(or gurobi) to solve it.
Xiangxiangzhu.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
charginghub-env
Code for the charginghub environment used in our paper "__"
all-in-one-llm
Deployment a light and full OpenAI API for production
ZhongjiaoGPT
AI power road design (under dev)
Xiangxiangzhu's Repositories
Xiangxiangzhu/UC-using-pulp
Modelling Unit Commitment Problem as a MIP problem, using pulp package and cplex(or gurobi) to solve it.
Xiangxiangzhu/tzm-vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
Xiangxiangzhu/EVCS-rollling-opt
Xiangxiangzhu/general-tls
general traffic light agent
Xiangxiangzhu/LLaMA-Efficient-Tuning
Easy-to-use LLM fine-tuning framework (LLaMA-2, BLOOM, Falcon, Baichuan, Qwen, ChatGLM2)
Xiangxiangzhu/LLM-EasyDeploy
We decide to construct a lightweight, scalable platform from scratch for deploying LLM and MLLM large models.
Xiangxiangzhu/LLM-inference
Xiangxiangzhu/mldm_temp
Xiangxiangzhu/PARL
A high-performance distributed training framework for Reinforcement Learning
Xiangxiangzhu/safe-rlhf
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
Xiangxiangzhu/Xiangxiangzhu.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
Xiangxiangzhu/proc-tensorflow-tls
distributed learning and simulation using tensorflow for general traffic light agent
Xiangxiangzhu/rqwrwq
Xiangxiangzhu/safe-explorer
Pytorch implementation of "Safe Exploration in Continuous Action Spaces" [Dalal et al.]
Xiangxiangzhu/safety-starter-agents
Basic constrained RL agents used in experiments for the "Benchmarking Safe Exploration in Deep Reinforcement Learning" paper.
Xiangxiangzhu/sumo_net
Xiangxiangzhu/sumolights
SUMO adaptive traffic signal control - DQN, DDPG, Webster's, Max-pressure, Self-Organizing Traffic Lights
Xiangxiangzhu/test_lag_theory
Xiangxiangzhu/test_tt_theory
Xiangxiangzhu/TrafficGPT