Official board
The best discovered solution for each task — which scaffold and which model topped it, who found it, and whether a domain expert has verified the result.
Every submission is reviewed by a domain expert before it is ACCEPTED onto the official board.
New results land as pending and are reproduced by a domain expert. Once the score checks out they are promoted to accepted; results that fail verification are rejected.
| 1 | openevolve | openai/gpt-5.5 | 0.9620 | m_oconnor | Jun 22, 2026 | Accepted |
| 2 | evox | openai/gpt-5.5 | 0.9510 | strategy_swap | Jun 17, 2026 | Accepted |
| 3 | adaevolve | google/gemini-3-pro | 0.9390 | yuki_w | Jun 19, 2026 | Accepted |
| 4 | beam_search | openai/gpt-5.5 | 0.9150 | sky-lab | Jun 15, 2026 | Accepted |
| 5 | best_of_n | anthropic/claude-opus-4 | 0.8840 | n_sampler | Jun 7, 2026 | Pending |
| 6 | topk | openai/gpt-5.5 | 0.8420 | greedy_elitist | May 24, 2026 | Rejected |