Pinned Repositories
a0-jax
AlphaZero in JAX
AlphaRenju
try to solve the renju game with AlphaZero- like algorithm
AlphaZero_Gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
AQ
Computer Go Program. Download: http://github.com/ymgaq/AQ/releases
Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
awesome-marketing-datascience
Curated list of useful LLM / Analytics / Datascience resources
awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
BPP-3D-Viewer
3D pattern viewer for cutting and packing problems
chatbot-ui
An open source ChatGPT UI.
chatllama
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
oriskunk's Repositories
oriskunk/Online-3D-BPP-DRL
This repository contains the implementation of paper Online 3D Bin Packing with Constrained Deep Reinforcement Learning.
oriskunk/IR-BPP
Packing irregular objects with deep reinforcement learning.
oriskunk/vllm
A high-throughput and memory-efficient inference and serving engine for LLMs
oriskunk/ml-papers
My collection of machine learning papers
oriskunk/K-G-OAT
IA3방식으로 KoAlpaca를 fine tuning한 한국어 LLM모델
oriskunk/awesome-marketing-datascience
Curated list of useful LLM / Analytics / Datascience resources
oriskunk/llama.cpp
Port of Facebook's LLaMA model in C/C++
oriskunk/chatbot-ui
An open source ChatGPT UI.
oriskunk/Auto-GPT
An experimental open-source attempt to make GPT-4 fully autonomous.
oriskunk/R-NaD
Experimentation with Regularized Nash Dynamics on a GPU accelerated game
oriskunk/LLM-As-Chatbot
Alpaca-LoRA as Chatbot service
oriskunk/langchain
⚡ Building applications with LLMs through composability ⚡
oriskunk/toolformer-pytorch
Implementation of Toolformer, Language Models That Can Use Tools, by MetaAI
oriskunk/awesome-RLHF
A curated list of reinforcement learning with human feedback resources (continually updated)
oriskunk/Tetris-deep-Q-learning-pytorch
Deep Q-learning for playing tetris game
oriskunk/decision-transformer
Official codebase for Decision Transformer: Reinforcement Learning via Sequence Modeling.
oriskunk/match3
A web match-3 game in C++14 using SDL2 / MVC / Range-v3 / Meta State Machine / Dependency Injection
oriskunk/chatllama
ChatLLaMA 📢 Open source implementation for LLaMA-based ChatGPT runnable in a single GPU. 15x faster training process than ChatGPT
oriskunk/Travelling-Salesman-Visualiser
Algorithm visualiser for the Travelling Salesman Problem
oriskunk/puzzleagent
oriskunk/gbr
Go board image recognition
oriskunk/a0-jax
AlphaZero in JAX
oriskunk/gym-pcgrl
A package for "Procedural Content Generation via Reinforcement Learning" OpenAI Gym interface.
oriskunk/mctx
Monte Carlo tree search in JAX
oriskunk/reverb
Reverb is an efficient and easy-to-use data storage and transport system designed for machine learning research
oriskunk/circuit_training
oriskunk/AlphaZero_Gomoku
An implementation of the AlphaZero algorithm for Gomoku (also called Gobang or Five in a Row)
oriskunk/KataGo
GTP engine and self-play learning in Go
oriskunk/project_MYM
Combined computer vision techniques and convolutional neural networks to accurately classify chess pieces and identified their location on a chessboard. Tools: Python, Google Cloud, Keras, TensorFlow, OpenCV, Pillow, Scikit-learn, NumPy, Seaborn, and others
oriskunk/BPP-3D-Viewer
3D pattern viewer for cutting and packing problems