51616
Ph.D student at Information Science and Technology (IST), VISTEC. Looking for research internship!
https://vistec.ist/Thailand
Pinned Repositories
51616
51616.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
advent_of_code_2019
My solution for advent of code 2019 in python
CU_Makhos
Thai Checkers deep reinforcement learning AI
denoise_generated_text
My computer vision project which aims to train an autoencoder model to clean dirty/noisy text images from generated data.
marl-lipo
Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)
openai_requests_for_research
My attemp to do requests for research from OpenAI
split-vae
Original implementation of Separated Paths for Local and Global Information framework (SPLIT) in TensorFlow 2.
vim
Vim config and bundle
wordwar
Object oriented game using java
51616's Repositories
51616/split-vae
Original implementation of Separated Paths for Local and Global Information framework (SPLIT) in TensorFlow 2.
51616/marl-lipo
Official codebase for Generating Diverse Cooperative Agents by Learning Incompatible Policies (notable-top-25% @ ICLR 2023)
51616/openai_requests_for_research
My attemp to do requests for research from OpenAI
51616/vim
Vim config and bundle
51616/wordwar
Object oriented game using java
51616/51616
51616/51616.github.io
Github Pages template for academic personal websites, forked from mmistakes/minimal-mistakes
51616/advent_of_code_2019
My solution for advent of code 2019 in python
51616/aoc2020
my solution for Advent of Code 2020
51616/attentionneuron.github.io
51616/baselines
OpenAI Baselines: high-quality implementations of reinforcement learning algorithms
51616/boids
boids!
51616/brax
Massively parallel rigidbody physics simulation on accelerator hardware.
51616/cleanba
CleanRL's implementation of DeepMind's Podracer Sebulba Architecture for Distributed DRL
51616/cleanrl
High-quality single file implementation of Deep Reinforcement Learning algorithms with research-friendly features (PPO, DQN, C51, DDPG, TD3, SAC, PPG)
51616/dotfiles
51616/dreamerv2
Mastering Atari with Discrete World Models
51616/envpool
C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
51616/es-clip.github.io
Placeholder
51616/evojax
51616/fort
My utility repo for personal work (multi-agent reinforcement learning)
51616/github-pages-javascript-prototype
Experimenting with GitHub pages and JavaScript and local data
51616/gsa-scraper
This codebase implements a scraper that accumulate all GSA emails from your inbox into a single `.md` file.
51616/human_aware_rl
Code for "On the Utility of Learning about Humans for Human-AI Coordination"
51616/IsaacGymEnvs
Isaac Gym Reinforcement Learning Environments
51616/multiagent_mujoco
Benchmark for Continuous Multi-Agent Robotic Control, based on OpenAI's Mujoco Gym environments.
51616/PettingZoo
Gym for multi-agent reinforcement learning
51616/ray
An open source framework that provides a simple, universal API for building distributed applications. Ray is packaged with RLlib, a scalable reinforcement learning library, and Tune, a scalable hyperparameter tuning library.
51616/rebar
Reinforcement learning utils from https://github.com/andyljones/megastep
51616/xmanager
A platform for managing machine learning experiments