ZhengyaoJiang
Cofounder of @WecoAI , PhD in Machine Learning @ucl-dark. Building AI Agents that build AI
University College LondonLondon, UK
Pinned Repositories
chatarena
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
aideml
AIDE: AI-Driven Exploration in the Space of Code. State of the Art machine Learning engineering agents that automates AI R&D.
GradientInduction
Framework of DataLog Neural Program Synthesis
GTG
Source code of "Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning" (AAMAS 2021).
latentplan
Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.
NLRL
Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)
OLPS
Online Portfolio Selection toolbox
PGPortfolio
PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).
rl-portfolio-management
Attempting to replicate "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem" https://arxiv.org/abs/1706.10059 (and an openai gym environment)
SURF2016
ZhengyaoJiang's Repositories
ZhengyaoJiang/PGPortfolio
PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).
ZhengyaoJiang/latentplan
Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.
ZhengyaoJiang/NLRL
Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)
ZhengyaoJiang/GTG
Source code of "Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning" (AAMAS 2021).
ZhengyaoJiang/GradientInduction
Framework of DataLog Neural Program Synthesis
ZhengyaoJiang/rl-portfolio-management
Attempting to replicate "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem" https://arxiv.org/abs/1706.10059 (and an openai gym environment)
ZhengyaoJiang/OLPS
Online Portfolio Selection toolbox
ZhengyaoJiang/SURF2016
ZhengyaoJiang/graphbackup
Code release for Graph Backup: Data Efficient Backup Exploiting Markovian Transitions https://arxiv.org/abs/2205.15824
ZhengyaoJiang/awesome-decentralized-llm
Collection of LLM resources that can be used to build products you can "own" or to perform reproducible research.
ZhengyaoJiang/MentalVr
The virtual reality controlled by mental command and voice
ZhengyaoJiang/pdf-to-markdown
Convert PDF files into markdown files
ZhengyaoJiang/RnnFromScratch
build tensorflow high level rnn api from scratch
ZhengyaoJiang/tensorflow
Computation using data flow graphs for scalable machine learning
ZhengyaoJiang/cardboard-unity
Google Cardboard
ZhengyaoJiang/d4rl
A benchmark for offline reinforcement learning.
ZhengyaoJiang/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
ZhengyaoJiang/draw_convnet
ZhengyaoJiang/dreamerv2
Mastering Atari with Discrete World Models
ZhengyaoJiang/Inline_asm_snake
ZhengyaoJiang/neural-style
Neural style in TensorFlow! :art:
ZhengyaoJiang/ntp
End-to-End Differentiable Proving
ZhengyaoJiang/ray
A high-performance distributed execution engine
ZhengyaoJiang/TankAI
a programming game ,in which you can use code to control the tank.
ZhengyaoJiang/TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
ZhengyaoJiang/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
ZhengyaoJiang/tflearn
Deep learning library featuring a higher-level API for TensorFlow.
ZhengyaoJiang/ucl-dark.github.io
UCL Deciding, Acting, and Reasoning with Knowledge (DARK) Lab
ZhengyaoJiang/ucl-latex-thesis-templates
UCL LaTeX thesis templates.
ZhengyaoJiang/ZhengyaoJiang.github.io