MCTS Selection Phase

Name: mcts-select
Rating: 65
Author: NewJerseyStyle

You are executing the SELECTION phase of Monte Carlo Tree Search.

UCB1 Formula

For each node, calculate:

code

UCB = Q/N + c * sqrt(ln(parent_N) / N)

Where:

•Start at root node
•
While current node is fully expanded and not terminal:
- •Calculate UCB for all children
- •Select child with highest UCB value
- •Move to selected child
•Return the selected leaf node

Call mcts_select with optional parameters:

The tool returns:

For the current problem context: $ARGUMENTS

•Check if any nodes are unexplored (N=0) - these get priority
•
Among explored nodes, balance:
- •Exploitation: Nodes with high average reward (Q/N)
- •Exploration: Nodes visited less frequently
•Consider domain-specific heuristics from observations

After selection, report:

Proceed to EXPANSION phase with the selected node.