Comment on How long do you think until AI writes and debugs code better than the average programmer?

ezchili@iusearchlinux.fyi ⁨7⁩ ⁨months⁩ ago

I’ve had 100% failure rate on simple requirements

“Make a pathfinding function for a 2d grid” - fine

“Make a pathfinding function for a 2d grid, but we can only move 15 cells at a time” - fails on lesser models, it keeps clinging to pulling you the same A* as the first one

“Make a pathfinding function for a 2d grid, but we can only move 15 cells at a time, also, some cells are on fire and must be avoided if possible, but if there is no other path possible then you’re allowed to use fire cells as fallback” - Never works

There for that last one, none of the models give a solution that fits the very simple requirement

source
Sort:hotnewtop