
LLM benchmark harness for cellular automata initial-state design
game-of-life-bench is a Python benchmark runner for testing whether LLMs can design long-lived initial boards for Conway's Game of Life.
What It Measures
- Grid: 8 x 8
- Topology: toroi… [+2377 chars]





