Best-of-N
Give the LLM N valid attempts at the same parent before committing to the global best, then repeat.