/FineTune-LLM-OnlineRL

Fine-tuning LLM agents w online RL for XiangQi (Chinese Chess)

Primary LanguageJupyter Notebook

No issues in this repository yet.