ZhengyaoJiang

Cofounder of @WecoAI , PhD in Machine Learning @ucl-dark. Building AI Agents that build AI

University College LondonLondon, UK

Pinned Repositories

chatarena
ChatArena (or Chat Arena) is a Multi-Agent Language Game Environments for LLMs. The goal is to develop communication and collaboration capabilities of AIs.
Language:Python1.4k 19 23140
aideml
AIDE: AI-Driven Exploration in the Space of Code. State of the Art machine Learning engineering agents that automates AI R&D.
Language:Python802 19 15100
GradientInduction
Framework of DataLog Neural Program Synthesis
Language:Python26 3 34
GTG
Source code of "Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning" (AAMAS 2021).
Language:Python27 6 38
latentplan
Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.
Language:Python104 3 212
NLRL
Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)
Language:Python75 4 328
OLPS
Online Portfolio Selection toolbox
Language:Matlab8 2 03
PGPortfolio
PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).
Language:Python1.8k 132 129758
rl-portfolio-management
Attempting to replicate "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem" https://arxiv.org/abs/1706.10059 (and an openai gym environment)
Language:Jupyter Notebook16 2 09
SURF2016
Language:Python6 2 03

ZhengyaoJiang's Repositories

ZhengyaoJiang/PGPortfolio
PGPortfolio: Policy Gradient Portfolio, the source code of "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem"(https://arxiv.org/pdf/1706.10059.pdf).
Language:Python1.8k 132 129758
ZhengyaoJiang/latentplan
Code release for Efficient Planning in a Compact Latent Action Space (ICLR2023) https://arxiv.org/abs/2208.10291.
Language:Python104 3 212
ZhengyaoJiang/NLRL
Source code of Neural Logic Reinforcement Learning (https://arxiv.org/abs/1904.10729)
Language:Python75 4 328
ZhengyaoJiang/GTG
Source code of "Grid-to-Graph: Flexible Spatial Relational Inductive Biases for Reinforcement Learning" (AAMAS 2021).
Language:Python27 6 38
ZhengyaoJiang/GradientInduction
Framework of DataLog Neural Program Synthesis
Language:Python26 3 34
ZhengyaoJiang/rl-portfolio-management
Attempting to replicate "A Deep Reinforcement Learning Framework for the Financial Portfolio Management Problem" https://arxiv.org/abs/1706.10059 (and an openai gym environment)
Language:Jupyter Notebook16 2 09
ZhengyaoJiang/OLPS
Online Portfolio Selection toolbox
Language:Matlab8 2 03
ZhengyaoJiang/SURF2016
Language:Python6 2 03
ZhengyaoJiang/graphbackup
Code release for Graph Backup: Data Efficient Backup Exploiting Markovian Transitions https://arxiv.org/abs/2205.15824
Language:Python5 2 11
ZhengyaoJiang/awesome-decentralized-llm
Collection of LLM resources that can be used to build products you can "own" or to perform reproducible research.
1 1 01
ZhengyaoJiang/MentalVr
The virtual reality controlled by mental command and voice
Language:Java1 3 01
ZhengyaoJiang/pdf-to-markdown
Convert PDF files into markdown files
Language:Python1 2 0
ZhengyaoJiang/RnnFromScratch
build tensorflow high level rnn api from scratch
Language:Jupyter Notebook1 2 01
ZhengyaoJiang/tensorflow
Computation using data flow graphs for scalable machine learning
Language:C++1 2 0
ZhengyaoJiang/cardboard-unity
Google Cardboard
Language:C#2 0
ZhengyaoJiang/d4rl
A benchmark for offline reinforcement learning.
Language:Python1 01
ZhengyaoJiang/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
Language:Python1 0
ZhengyaoJiang/draw_convnet
Language:Python2 0
ZhengyaoJiang/dreamerv2
Mastering Atari with Discrete World Models
Language:Python1 0
ZhengyaoJiang/Inline_asm_snake
Language:C++1 0
ZhengyaoJiang/neural-style
Neural style in TensorFlow! :art:
Language:Python2 0
ZhengyaoJiang/ntp
End-to-End Differentiable Proving
Language:NewLisp2 0
ZhengyaoJiang/ray
A high-performance distributed execution engine
Language:Python3 0
ZhengyaoJiang/TankAI
a programming game ,in which you can use code to control the tank.
Language:Java2 0
ZhengyaoJiang/TD3_BC
Author's PyTorch implementation of TD3+BC, a simple variant of TD3 for offline RL
Language:Python1 0
ZhengyaoJiang/tensor2tensor
Library of deep learning models and datasets designed to make deep learning more accessible and accelerate ML research.
Language:Python1 0
ZhengyaoJiang/tflearn
Deep learning library featuring a higher-level API for TensorFlow.
Language:Python2 0
ZhengyaoJiang/ucl-dark.github.io
UCL Deciding, Acting, and Reasoning with Knowledge (DARK) Lab
Language:JavaScript1 01
ZhengyaoJiang/ucl-latex-thesis-templates
UCL LaTeX thesis templates.
Language:TeX1 0
ZhengyaoJiang/ZhengyaoJiang.github.io
Language:HTML2 0