Living human neurons were trained to play Doom, extending the long-running engineering benchmark into biological computing.
ETH Zurich tests AGENTS.md and context files on 438 tasks, finding developer-written notes raise performance about 4% while increasing spend ...
Some results have been hidden because they may be inaccessible to you
Show inaccessible results