chowfi/FineTune-LLM-OnlineRL
Fine-tuning LLM agents w online RL for XiangQi (Chinese Chess)
Jupyter Notebook
No issues in this repository yet.
Fine-tuning LLM agents w online RL for XiangQi (Chinese Chess)
Jupyter Notebook
No issues in this repository yet.