Open Q*
A transformer-based LLM structurally infused with Q-Learning and A* algorithms
Is this Q*? Let's find out!
A Work In Progress Repo
Transformer-based LLM structurally infused with Q-Learning and A* heuristic search algorithms
Python
A transformer-based LLM structurally infused with Q-Learning and A* algorithms
Is this Q*? Let's find out!
A Work In Progress Repo