- CEO
- CEO provides high-level guidance
- identifies biggest bottleneck
- identifies the resources allocation and exit conditions
- By attending to the most strategically important patterns on the board, the CEO forms its next directive
- performing pattern recognition on the state of the reasoning process itself
- worker
- Worker explores the tactical consequences.
- rapid, detailed, and logical
- It uses its QKV attention to learn and enforce the fundamental rules of the puzzle
- as a massively parallel system of rule-checking, happening across the entire grid in a single step
is this like a greedy algorithm instead of taking the depth first or breadth first approach