0xsuu/Project-Mahjong

Implement reinforcement learning roadmap

Opened this issue · 0 comments

0xsuu commented

Implement Route:

  • Traditional Tabular Method:
    • Monte Carlo Value Iteration on Tiny Mahjong ->
    • Q-learning on Tiny Mahjong
  • Learn the rules, learn the strategy:
    • Tiny Mahjong ->
    • Simple Mahjong Random ->
    • Gomoku Random ->
    • Simple Mahjong Greedy ->
    • Gomoku Beginner ->
    • Japanese Mahjong
  • Learn to react:
    • Breakout ->
    • Snake
  • Self-playing:
    • Simple Mahjong Duel/Full ->
    • Japanese Mahjong
  • Searching:
    • Tiny Mahjong (break the optimal) ->
    • Simple Mahjong ->
    • Japanese Mahjong
  • Value prediction:
    • Tiny Mahjong ->
    • Simple Mahjong ->
    • Gomoku ->
    • Japanese Mahjong