Pinned Repositories
TimeChamber
A Massively Parallel Large Scale Self-Play Framework
acm-challenge-workbook
《挑战程序设计竞赛》习题册攻略
AdversarialNetsPapers
The classical paper list with code about generative adversarial nets
awesome
😎 Awesome lists about all kinds of interesting topics
awesome-public-datasets
A topic-centric list of HQ open datasets. PR ☛☛☛
Awesome-PyTorch-Chinese
【干货】史上最全的PyTorch学习资源汇总
CRL
DeepMARL-PyTorch
Reinforcement Learning Codes
GRIP_Plus_Plus
rl_games
RL implementations
ZiyiLiubird's Repositories
ZiyiLiubird/GRIP_Plus_Plus
ZiyiLiubird/rl_games
RL implementations
ZiyiLiubird/Demo
Demo repo for tutotial articles on Opensource.com
ZiyiLiubird/EVO-PopulationBasedTraining
Population-Based Training (PBT) for Reinforcement Learning using Message Passing Interface (MPI)
ZiyiLiubird/RejectSampling
ZiyiLiubird/SRPPO
ZiyiLiubird/tianshou
An elegant PyTorch deep reinforcement learning library.
ZiyiLiubird/AgentLite
Customized AgentLite
ZiyiLiubird/AI-Toolbox
A C++ framework for MDPs and POMDPs with Python bindings
ZiyiLiubird/Bert-VITS2
vits2 backbone with multilingual-bert
ZiyiLiubird/BeyondDialogue
ZiyiLiubird/ContrastiveReflexion
ZiyiLiubird/crazyflie-clients-python
Host applications and library for Crazyflie written in Python.
ZiyiLiubird/Cxx_HOPL4_zh
Chinese translation of Bjarne Stroustrup's HOPL4 paper
ZiyiLiubird/evalscope
A streamlined and customizable framework for efficient large model evaluation and performance benchmarking
ZiyiLiubird/F5-TTS
Official code for "F5-TTS: A Fairytaler that Fakes Fluent and Faithful Speech with Flow Matching"
ZiyiLiubird/faster-whisper
Faster Whisper transcription with CTranslate2
ZiyiLiubird/GPT-SoVITS
1 min voice data can also be used to train a good TTS model! (few shot voice cloning)
ZiyiLiubird/LangChain_Examples
ZiyiLiubird/llm_inference
ZiyiLiubird/LLM_Tree_Search
(ICML 2024) Alphazero-like Tree-Search can guide large language model decoding and training
ZiyiLiubird/open-instruct
ZiyiLiubird/OpenRLHF
A Ray-based High-performance RLHF framework (support 70B+ models)
ZiyiLiubird/reflexion
[NeurIPS 2023] Reflexion: Language Agents with Verbal Reinforcement Learning
ZiyiLiubird/safe-rlhf
Safe-RLHF: Constrained Value Alignment via Safe Reinforcement Learning from Human Feedback
ZiyiLiubird/SIMA
Pytorch Implementation of Deepmind's SIMA: "Scaling Instructable Agents Across Many Simulated Worlds"
ZiyiLiubird/SPIN
The official implementation of Self-Play Fine-Tuning (SPIN)
ZiyiLiubird/TimeChamber
A Massively Parallel Large Scale Self-Play Framework
ZiyiLiubird/vits2
VITS2: Improving Quality and Efficiency of Single-Stage Text-to-Speech with Adversarial Learning and Architecture Design
ZiyiLiubird/ZiyiLiubird.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes