<p>Gave some elevator logic code (Python) to a room of smart coders at a trading firm yesterday and asked "does it have any bugs in it?" Nobody found the bug, but a few noted some hard-to-describe oddities that were bug-adjacent perhaps.</p><p>Just gave the same code to Gemini Pro 2.5 and asked it to find bugs. It found a few superficial Python things and a few bits of odd behavior that none of the human coders found. But, it also didn't find the bug. </p><p>So, the elevator remains undefeated.</p>